I used runpod.io prior to buying a pair of 3090s. They make it easy to run vllm too so you can experiment with different models.
I also rented a GPU vm from them and ran huggingface models on it. That did require lot more coding and learning.
 replies(1):