Loading Comparison
Fetching pricing data and provider information...
Loading Comparison
Fetching pricing data and provider information...
Compare GPU and LLM inference API pricing between Amazon AWS and Google Cloud. Find the best rates for AI training, inference, and ML workloads.
Provider 1
Provider 2
Average Price Difference: $0.37/hour between comparable GPUs
| GPU Model ↑ | Amazon AWS Price | Google Cloud Price | Price Diff ↕ | Sources |
|---|---|---|---|---|
A10 24GB VRAM • Amazon AWS | Not Available | — | ||
A10 24GB VRAM • | ||||
A100 SXM 80GB VRAM • Amazon AWS | 8x GPU | Not Available | — | |
A100 SXM 80GB VRAM • $2.74/hour 8x GPU configuration Updated: 4/16/2026 ★Best Price Not Available | ||||
H100 SXM 80GB VRAM • Amazon AWS | 8x GPU | Not Available | — | |
H100 SXM 80GB VRAM • $6.88/hour 8x GPU configuration Updated: 4/16/2026 ★Best Price Not Available | ||||
H200 141GB VRAM • Amazon AWS | 8x GPU | Not Available | — | |
H200 141GB VRAM • $7.91/hour 8x GPU configuration Updated: 4/16/2026 ★Best Price Not Available | ||||
L4 24GB VRAM • Amazon AWS | Not Available | — | ||
L4 24GB VRAM • | ||||
L40S 48GB VRAM • Amazon AWS | Not Available | — | ||
L40S 48GB VRAM • | ||||
Tesla T4 16GB VRAM • Amazon AWSGoogle Cloud | ↑+$0.37(+228.8%) | |||
Tesla T4 16GB VRAM • $0.53/hour Updated: 4/16/2026 $0.16/hour Updated: 3/31/2026 ★Best Price Price Difference:↑+$0.37(+228.8%) | ||||
Tesla V100 32GB VRAM • Google Cloud | Not Available | — | ||
Tesla V100 32GB VRAM • | ||||
A10 24GB VRAM • Amazon AWS | Not Available | — | ||
A10 24GB VRAM • | ||||
A100 SXM 80GB VRAM • Amazon AWS | 8x GPU | Not Available | — | |
A100 SXM 80GB VRAM • $2.74/hour 8x GPU configuration Updated: 4/16/2026 ★Best Price Not Available | ||||
H100 SXM 80GB VRAM • Amazon AWS | 8x GPU | Not Available | — | |
H100 SXM 80GB VRAM • $6.88/hour 8x GPU configuration Updated: 4/16/2026 ★Best Price Not Available | ||||
H200 141GB VRAM • Amazon AWS | 8x GPU | Not Available | — | |
H200 141GB VRAM • $7.91/hour 8x GPU configuration Updated: 4/16/2026 ★Best Price Not Available | ||||
L4 24GB VRAM • Amazon AWS | Not Available | — | ||
L4 24GB VRAM • | ||||
L40S 48GB VRAM • Amazon AWS | Not Available | — | ||
L40S 48GB VRAM • | ||||
Tesla T4 16GB VRAM • Amazon AWSGoogle Cloud | ↑+$0.37(+228.8%) | |||
Tesla T4 16GB VRAM • $0.53/hour Updated: 4/16/2026 $0.16/hour Updated: 3/31/2026 ★Best Price Price Difference:↑+$0.37(+228.8%) | ||||
Tesla V100 32GB VRAM • Google Cloud | Not Available | — | ||
Tesla V100 32GB VRAM • | ||||
Explore how these providers compare to other popular GPU cloud services
Compare Amazon AWS with another leading provider
Compare Amazon AWS with another leading provider
Compare Amazon AWS with another leading provider
Compare Amazon AWS with another leading provider
Compare Amazon AWS with another leading provider
Compare Google Cloud with another leading provider
Extensive network of data centers across multiple regions worldwide
Flexible pricing model with no upfront commitments required
Comprehensive security tools and compliance certifications
Automatically adjust resources based on demand
Extensive ecosystem of services that work seamlessly together
Comprehensive suite of tools for development, deployment, and management
Scalable virtual machines with a wide range of machine types, including GPUs.
Managed Kubernetes service for deploying and managing containerized applications.
Event-driven serverless compute platform.
Fully managed serverless platform for containerized applications.
Unified ML platform for building, deploying, and managing ML models.
Short-lived compute instances at a significant discount, suitable for fault-tolerant workloads.
Virtual servers in the cloud with a wide range of instance types.
Fully managed container orchestration service.
Managed Kubernetes service for container orchestration.
Offers customizable virtual machines running in Google's data centers.
Managed Kubernetes service for running containerized applications.
Serverless compute platform for running code in response to events.
Pay for compute capacity by the second with no long-term commitments.
Use spare EC2 capacity at up to 90% off the On-Demand price.
Save up to 72% compared to On-Demand pricing with a 1 or 3-year commitment.
Save up to 72% on compute usage with a 1 or 3-year commitment to a consistent amount of usage.
Pay for compute capacity per hour or per second, with no long-term commitments.
Automatic discounts for running instances for a significant portion of the month.
Save up to 57% with a 1-year or 3-year commitment to a minimum level of resource usage.
Save up to 80% for fault-tolerant workloads that can be interrupted.
Create an AWS account to access the cloud platform.
Select from EC2, Lambda, or container services based on your workload needs.
Configure and launch your first compute instance or container.
Configure security groups and access controls for your resources.
Use AWS CloudWatch and Compute Optimizer to monitor performance and reduce costs.
Set up a project in the Google Cloud Console.
Set up a billing account to pay for resource usage.
Select Compute Engine, GKE, Cloud Functions, or Cloud Run based on your needs.
Launch a VM instance, configure a Kubernetes cluster, or deploy a function/application.
Use the Cloud Console, command-line tools, or APIs to manage your resources.
30+ regions and 100+ availability zones worldwide.
Basic (free), Developer, Business, Enterprise support plans with varying response times and features. Extensive documentation, forums, and training resources.
40+ regions and 120+ zones worldwide.
Role-based (free), Standard, Enhanced and Premium support plans. Comprehensive documentation, community forums, and training resources.