Loading Comparison
Fetching pricing data and provider information...
Loading Comparison
Fetching pricing data and provider information...
Compare GPU and LLM inference API pricing between Google Cloud and Vast.ai. Find the best rates for AI training, inference, and ML workloads.
Provider 1
Provider 2
Average Price Difference: $0.91/hour between comparable GPUs
| GPU Model ↑ | Google Cloud Price | Vast.ai Price | Price Diff ↕ | Sources |
|---|---|---|---|---|
A10 24GB VRAM • Vast.ai | Not Available | — | ||
A10 24GB VRAM • | ||||
A100 PCIE 40GB VRAM • Vast.ai | Not Available | — | ||
A100 PCIE 40GB VRAM • | ||||
A40 48GB VRAM • Vast.ai | Not Available | — | ||
A40 48GB VRAM • | ||||
L4 24GB VRAM • Vast.ai | Not Available | — | ||
L4 24GB VRAM • | ||||
L40 40GB VRAM • Vast.ai | Not Available | — | ||
L40 40GB VRAM • | ||||
RTX 3070 8GB VRAM • Vast.ai | Not Available | — | ||
RTX 3070 8GB VRAM • | ||||
RTX 3070 Ti 8GB VRAM • Vast.ai | Not Available | — | ||
RTX 3070 Ti 8GB VRAM • | ||||
RTX 3080 10GB VRAM • Vast.ai | Not Available | 2x GPU | — | |
RTX 3080 10GB VRAM • | ||||
RTX 3080 Ti 12GB VRAM • Vast.ai | Not Available | 2x GPU | — | |
RTX 3080 Ti 12GB VRAM • | ||||
RTX 3090 24GB VRAM • Vast.ai | Not Available | 2x GPU | — | |
RTX 3090 24GB VRAM • | ||||
RTX 3090 Ti 24GB VRAM • Vast.ai | Not Available | — | ||
RTX 3090 Ti 24GB VRAM • | ||||
RTX 4060 8GB VRAM • Vast.ai | Not Available | 2x GPU | — | |
RTX 4060 8GB VRAM • | ||||
RTX 4060 Ti 8GB VRAM • Vast.ai | Not Available | — | ||
RTX 4060 Ti 8GB VRAM • | ||||
RTX 4070 12GB VRAM • Vast.ai | Not Available | 4x GPU | — | |
RTX 4070 12GB VRAM • | ||||
RTX 4070 Ti 12GB VRAM • Vast.ai | Not Available | — | ||
RTX 4070 Ti 12GB VRAM • | ||||
A10 24GB VRAM • Vast.ai | Not Available | — | ||
A10 24GB VRAM • | ||||
A100 PCIE 40GB VRAM • Vast.ai | Not Available | — | ||
A100 PCIE 40GB VRAM • | ||||
A40 48GB VRAM • Vast.ai | Not Available | — | ||
A40 48GB VRAM • | ||||
L4 24GB VRAM • Vast.ai | Not Available | — | ||
L4 24GB VRAM • | ||||
L40 40GB VRAM • Vast.ai | Not Available | — | ||
L40 40GB VRAM • | ||||
RTX 3070 8GB VRAM • Vast.ai | Not Available | — | ||
RTX 3070 8GB VRAM • | ||||
RTX 3070 Ti 8GB VRAM • Vast.ai | Not Available | — | ||
RTX 3070 Ti 8GB VRAM • | ||||
RTX 3080 10GB VRAM • Vast.ai | Not Available | 2x GPU | — | |
RTX 3080 10GB VRAM • | ||||
RTX 3080 Ti 12GB VRAM • Vast.ai | Not Available | 2x GPU | — | |
RTX 3080 Ti 12GB VRAM • | ||||
RTX 3090 24GB VRAM • Vast.ai | Not Available | 2x GPU | — | |
RTX 3090 24GB VRAM • | ||||
RTX 3090 Ti 24GB VRAM • Vast.ai | Not Available | — | ||
RTX 3090 Ti 24GB VRAM • | ||||
RTX 4060 8GB VRAM • Vast.ai | Not Available | 2x GPU | — | |
RTX 4060 8GB VRAM • | ||||
RTX 4060 Ti 8GB VRAM • Vast.ai | Not Available | — | ||
RTX 4060 Ti 8GB VRAM • | ||||
RTX 4070 12GB VRAM • Vast.ai | Not Available | 4x GPU | — | |
RTX 4070 12GB VRAM • | ||||
RTX 4070 Ti 12GB VRAM • Vast.ai | Not Available | — | ||
RTX 4070 Ti 12GB VRAM • | ||||
Explore how these providers compare to other popular GPU cloud services
Compare Google Cloud with another leading provider
Compare Google Cloud with another leading provider
Compare Google Cloud with another leading provider
Compare Google Cloud with another leading provider
Compare Google Cloud with another leading provider
Compare Vast.ai with another leading provider
Scalable virtual machines with a wide range of machine types, including GPUs.
Managed Kubernetes service for deploying and managing containerized applications.
Event-driven serverless compute platform.
Fully managed serverless platform for containerized applications.
Unified ML platform for building, deploying, and managing ML models.
Short-lived compute instances at a significant discount, suitable for fault-tolerant workloads.
Prices set by supply and demand across the platform with no list prices or hidden fees
GPU Cloud for full control, Serverless for zero-ops inference, Clusters for large-scale training
CLI, Python SDK, and REST API for programmatic GPU provisioning
Scale from $5 to 20,000 GPUs across 40+ data centers without contracts or minimums
Offers customizable virtual machines running in Google's data centers.
Managed Kubernetes service for running containerized applications.
Serverless compute platform for running code in response to events.
On-demand instances across 40+ data centers and 20,000+ GPUs
Deploy models as endpoints with autoscaling to zero
Dedicated multi-node GPU clusters with InfiniBand networking
Pay for compute capacity per hour or per second, with no long-term commitments.
Automatic discounts for running instances for a significant portion of the month.
Save up to 57% with a 1-year or 3-year commitment to a minimum level of resource usage.
Save up to 80% for fault-tolerant workloads that can be interrupted.
Guaranteed uptime with per-second billing. Best for production workloads.
50%+ cheaper preemptible instances. Best for fault-tolerant batch training.
Up to 50% off with 1, 3, or 6 month commitments. Guaranteed capacity with volume discounts.
Set up a project in the Google Cloud Console.
Set up a billing account to pay for resource usage.
Select Compute Engine, GKE, Cloud Functions, or Cloud Run based on your needs.
Launch a VM instance, configure a Kubernetes cluster, or deploy a function/application.
Use the Cloud Console, command-line tools, or APIs to manage your resources.
Start with as little as $5. No contracts, no minimums.
Filter by model, VRAM, price, and availability across the platform
Launch instances in seconds. Scale up or down anytime.
40+ regions and 120+ zones worldwide.
Role-based (free), Standard, Enhanced and Premium support plans. Comprehensive documentation, community forums, and training resources.
40+ data centers with global coverage including community and enterprise providers
24/7 expert support, comprehensive documentation, Discord community, CLI and SDK tools