Loading Comparison
Fetching pricing data and provider information...
Loading Comparison
Fetching pricing data and provider information...
Compare GPU and LLM inference API pricing between Google Cloud and Latitude.sh. Find the best rates for AI training, inference, and ML workloads.
Provider 1
Provider 2
| GPU Model ↑ | Google Cloud Price | Latitude.sh Price | Price Diff ↕ | Sources |
|---|---|---|---|---|
A100 SXM 80GB VRAM • Latitude.sh | Not Available | — | ||
A100 SXM 80GB VRAM • | ||||
A30 24GB VRAM • Latitude.sh | Not Available | — | ||
A30 24GB VRAM • | ||||
GH200 96GB VRAM • Latitude.sh | Not Available | — | ||
GH200 96GB VRAM • | ||||
H100 NVL 94GB VRAM • Latitude.sh | Not Available | 8x GPU | — | |
H100 NVL 94GB VRAM • | ||||
H100 SXM 80GB VRAM • Latitude.sh | Not Available | — | ||
H100 SXM 80GB VRAM • | ||||
L40 40GB VRAM • Latitude.sh | Not Available | — | ||
L40 40GB VRAM • | ||||
L40S 48GB VRAM • Latitude.sh | Not Available | — | ||
L40S 48GB VRAM • | ||||
RTX 6000 Ada 48GB VRAM • Latitude.sh | Not Available | — | ||
RTX 6000 Ada 48GB VRAM • | ||||
RTX 6000 Pro 96GB VRAM • Latitude.sh | Not Available | — | ||
RTX 6000 Pro 96GB VRAM • | ||||
RTX A5000 24GB VRAM • Latitude.sh | Not Available | — | ||
RTX A5000 24GB VRAM • | ||||
RTX A6000 48GB VRAM • Latitude.sh | Not Available | — | ||
RTX A6000 48GB VRAM • | ||||
RTX PRO 6000 96GB VRAM • Latitude.sh | Not Available | 8x GPU | — | |
RTX PRO 6000 96GB VRAM • | ||||
Tesla T4 16GB VRAM • Google Cloud | Not Available | — | ||
Tesla T4 16GB VRAM • | ||||
Tesla V100 32GB VRAM • Google Cloud | Not Available | — | ||
Tesla V100 32GB VRAM • | ||||
A100 SXM 80GB VRAM • Latitude.sh | Not Available | — | ||
A100 SXM 80GB VRAM • | ||||
A30 24GB VRAM • Latitude.sh | Not Available | — | ||
A30 24GB VRAM • | ||||
GH200 96GB VRAM • Latitude.sh | Not Available | — | ||
GH200 96GB VRAM • | ||||
H100 NVL 94GB VRAM • Latitude.sh | Not Available | 8x GPU | — | |
H100 NVL 94GB VRAM • | ||||
H100 SXM 80GB VRAM • Latitude.sh | Not Available | — | ||
H100 SXM 80GB VRAM • | ||||
L40 40GB VRAM • Latitude.sh | Not Available | — | ||
L40 40GB VRAM • | ||||
L40S 48GB VRAM • Latitude.sh | Not Available | — | ||
L40S 48GB VRAM • | ||||
RTX 6000 Ada 48GB VRAM • Latitude.sh | Not Available | — | ||
RTX 6000 Ada 48GB VRAM • | ||||
RTX 6000 Pro 96GB VRAM • Latitude.sh | Not Available | — | ||
RTX 6000 Pro 96GB VRAM • | ||||
RTX A5000 24GB VRAM • Latitude.sh | Not Available | — | ||
RTX A5000 24GB VRAM • | ||||
RTX A6000 48GB VRAM • Latitude.sh | Not Available | — | ||
RTX A6000 48GB VRAM • | ||||
RTX PRO 6000 96GB VRAM • Latitude.sh | Not Available | 8x GPU | — | |
RTX PRO 6000 96GB VRAM • | ||||
Tesla T4 16GB VRAM • Google Cloud | Not Available | — | ||
Tesla T4 16GB VRAM • | ||||
Tesla V100 32GB VRAM • Google Cloud | Not Available | — | ||
Tesla V100 32GB VRAM • | ||||
Explore how these providers compare to other popular GPU cloud services
Compare Google Cloud with another leading provider
Compare Google Cloud with another leading provider
Compare Google Cloud with another leading provider
Compare Google Cloud with another leading provider
Compare Google Cloud with another leading provider
Compare Google Cloud with another leading provider
Scalable virtual machines with a wide range of machine types, including GPUs.
Managed Kubernetes service for deploying and managing containerized applications.
Event-driven serverless compute platform.
Fully managed serverless platform for containerized applications.
Unified ML platform for building, deploying, and managing ML models.
Short-lived compute instances at a significant discount, suitable for fault-tolerant workloads.
Provision servers with user data, RAID, and SSH over a documented REST API.
RTX 6000 Ada, H100, and L40S capacity with dual 100 Gbps networking on multi-GPU nodes.
20 locations spanning the US, Europe, Latin America, and Asia-Pacific.
Hourly and monthly rates published for GPU and CPU bare metal.
Offers customizable virtual machines running in Google's data centers.
Managed Kubernetes service for running containerized applications.
Serverless compute platform for running code in response to events.
Dedicated GPU servers tuned for AI training and inference.
Virtualized GPU plans for quick starts and cost-effective deployment.
General-purpose dedicated servers with high core counts and NVMe.
Pay for compute capacity per hour or per second, with no long-term commitments.
Automatic discounts for running instances for a significant portion of the month.
Save up to 57% with a 1-year or 3-year commitment to a minimum level of resource usage.
Save up to 80% for fault-tolerant workloads that can be interrupted.
Hourly billing for dedicated GPU servers including RTX 6000 Ada configurations with transparent published pricing.
Cost-effective virtualized GPU options with H100 and L40S available in select regions.
Hourly and monthly billing options for CPU-focused bare metal servers with regional pricing variations.
Published hourly rates for all GPU and CPU plans with pricing that varies by region (US, Brazil, Europe, APAC).
Set up a project in the Google Cloud Console.
Set up a billing account to pay for resource usage.
Select Compute Engine, GKE, Cloud Functions, or Cloud Run based on your needs.
Launch a VM instance, configure a Kubernetes cluster, or deploy a function/application.
Use the Cloud Console, command-line tools, or APIs to manage your resources.
40+ regions and 120+ zones worldwide.
Role-based (free), Standard, Enhanced and Premium support plans. Comprehensive documentation, community forums, and training resources.
20 locations across Dallas, Los Angeles, New York, Chicago, Ashburn, Miami, London (2), Frankfurt (2), Amsterdam, Sao Paulo (2), Mexico City, Buenos Aires, Bogota, Santiago, Singapore, Tokyo (2), and Sydney (2).
API reference, contact sales, and a trust center; platform tooling exposes SSH, RAID, and user-data options on provision.