Skip to main content
Amazon AWS logo

Amazon AWS

Comprehensive cloud platform with global reach

Classical hyperscaler🇺🇸 USinferenceenterprisemulti-provider

Last reviewed Mar 14, 2026

AWS provides a comprehensive suite of cloud computing services, including compute, storage, and GPU solutions for diverse workloads.

7
GPU Models
$0.53
From / hour
89
LLM Models
$0.02
From / 1M input

Available GPUs

Hourly on-demand pricing. Click column headers to sort.

Prices last updated: May 17, 2026

GPU Model
Memory
GPUs
Price / hr
Updated
Source
A1024GB
1×4×8×
$1.01/hr
5/17/2026
A100 SXM80GB
8×
$2.74/hr
5/17/2026
H100 SXM80GB
8×
$6.88/hr
5/17/2026
H200141GB
8×
$7.91/hr
5/17/2026
L424GB
1×4×8×
$0.80/hr
5/17/2026
L40S48GB
1×4×8×
$1.86/hr
5/17/2026
Tesla T416GB
1×4×8×
$0.53/hr
5/17/2026

LLM API Pricing

Pay-per-token pricing. Prices shown per 1M tokens.

Prices last updated: May 17, 2026

Pricing
ModelCreatorContextInput/1MOutput/1MUpdated
Amazon128K$0.035$0.1405/17/2026
Google131K$0.040$0.0805/17/2026
NVIDIA131K$0.060$0.2305/17/2026
NVIDIA262K$0.060$0.2405/17/2026
Amazon300K$0.060$0.2405/17/2026
OpenAI128K$0.070$0.2005/17/2026
Zhipu128K$0.070$0.4005/17/2026
Google131K$0.090$0.2905/17/2026
Meta128K$0.100$0.1005/17/2026
Mistral32K$0.100$0.3005/17/2026

Pros & Cons

Advantages

  • Broad range of compute options including GPUs
  • Highly scalable and reliable infrastructure
  • Pay-as-you-go pricing with cost optimization tools
  • Extensive global network of data centers
  • Rich ecosystem of integrated services and tools

Limitations

  • Complex pricing structure
  • Steep learning curve for new users
  • Potential for unexpected costs without proper management

Key Features

Global Infrastructure

Extensive network of data centers across multiple regions worldwide

Pay-as-you-go Pricing

Flexible pricing model with no upfront commitments required

Advanced Security

Comprehensive security tools and compliance certifications

Auto Scaling

Automatically adjust resources based on demand

Integrated Services

Extensive ecosystem of services that work seamlessly together

Developer Tools

Comprehensive suite of tools for development, deployment, and management

Compute Services

Amazon EC2

Virtual servers in the cloud with a wide range of instance types.

Amazon ECS

Fully managed container orchestration service.

  • Support for Docker containers
  • Integration with other AWS services
  • Automated cluster management and scheduling

Amazon EKS

Managed Kubernetes service for container orchestration.

  • Certified Kubernetes conformant
  • Integrates with AWS networking and security services
  • Supports both EC2 and Fargate launch types

AWS Lambda

Serverless compute service for running code without managing servers.

  • Automatic scaling and high availability
  • Pay only for compute time consumed
  • Supports multiple programming languages

Inference Services

Amazon Bedrock

Fully managed service providing access to foundation models from AI21 Labs, Anthropic, Cohere, Meta, Mistral, Stability AI, and Amazon through a unified API.

  • Multi-Provider Access: Access models from multiple AI providers through a single API
  • Model Customization: Fine-tune models on your own data with continued pretraining
  • Guardrails: Built-in safety controls and content filtering

Pricing Models

  • On-Demand: Pay per input/output token with no minimum commitment
  • Provisioned Throughput: Reserved model units for consistent performance

Pricing Options

OptionDetails
On-Demand InstancesPay for compute capacity by the second with no long-term commitments.
Spot InstancesUse spare EC2 capacity at up to 90% off the On-Demand price.
Reserved InstancesSave up to 72% compared to On-Demand pricing with a 1 or 3-year commitment.
Savings PlansSave up to 72% on compute usage with a 1 or 3-year commitment to a consistent amount of usage.

Availability & Support

Regions

30+ regions and 100+ availability zones worldwide.

Support

Basic (free), Developer, Business, Enterprise support plans with varying response times and features. Extensive documentation, forums, and training resources.

Getting Started

  1. 1

    Sign up for AWS

    Create an AWS account to access the cloud platform.

  2. 2

    Choose a compute service

    Select from EC2, Lambda, or container services based on your workload needs.

  3. 3

    Launch an instance

    Configure and launch your first compute instance or container.

  4. 4

    Set up security

    Configure security groups and access controls for your resources.

  5. 5

    Monitor and optimize

    Use AWS CloudWatch and Compute Optimizer to monitor performance and reduce costs.

Compare Providers

Find the best prices for the same GPUs from other providers

IO.NET logo

IO.NET

6 shared GPUs with Amazon AWS

Sesterce logo

Sesterce

6 shared GPUs with Amazon AWS

Koyeb logo

Koyeb

5 shared GPUs with Amazon AWS