Loading Comparison
Fetching pricing data and provider information...
Loading Comparison
Fetching pricing data and provider information...
Compare GPU and LLM inference API pricing between Deep Infra and Vast.ai. Find the best rates for AI training, inference, and ML workloads.
Provider 1
Provider 2
Average Price Difference: $0.42/hour between comparable GPUs
| GPU Model ↑ | Deep Infra Price | Vast.ai Price | Price Diff ↕ | Sources |
|---|---|---|---|---|
A10 24GB VRAM • Vast.ai | Not Available | — | ||
A10 24GB VRAM • | ||||
A100 PCIE 40GB VRAM • Vast.ai | Not Available | — | ||
A100 PCIE 40GB VRAM • | ||||
A100 SXM 80GB VRAM • Deep InfraVast.ai | ↑+$0.42(+91.4%) | |||
A100 SXM 80GB VRAM • $0.89/hour Updated: 5/14/2026 $0.47/hour Updated: 5/11/2026 ★Best Price Price Difference:↑+$0.42(+91.4%) | ||||
A40 48GB VRAM • Vast.ai | Not Available | 2x GPU | — | |
A40 48GB VRAM • | ||||
B200 192GB VRAM • Deep Infra | Not Available | — | ||
B200 192GB VRAM • | ||||
H100 SXM 80GB VRAM • Deep Infra | Not Available | — | ||
H100 SXM 80GB VRAM • | ||||
H200 141GB VRAM • Deep Infra | Not Available | — | ||
H200 141GB VRAM • | ||||
HGX B300 288GB VRAM • Deep Infra | Not Available | — | ||
HGX B300 288GB VRAM • | ||||
L4 24GB VRAM • Vast.ai | Not Available | — | ||
L4 24GB VRAM • | ||||
L40 40GB VRAM • Vast.ai | Not Available | — | ||
L40 40GB VRAM • | ||||
L40S 48GB VRAM • Vast.ai | Not Available | — | ||
L40S 48GB VRAM • | ||||
RTX 3070 8GB VRAM • Vast.ai | Not Available | 4x GPU | — | |
RTX 3070 8GB VRAM • | ||||
RTX 3070 Ti 8GB VRAM • Vast.ai | Not Available | — | ||
RTX 3070 Ti 8GB VRAM • | ||||
RTX 3080 10GB VRAM • Vast.ai | Not Available | 4x GPU | — | |
RTX 3080 10GB VRAM • | ||||
RTX 3080 Ti 12GB VRAM • Vast.ai | Not Available | — | ||
RTX 3080 Ti 12GB VRAM • | ||||
A10 24GB VRAM • Vast.ai | Not Available | — | ||
A10 24GB VRAM • | ||||
A100 PCIE 40GB VRAM • Vast.ai | Not Available | — | ||
A100 PCIE 40GB VRAM • | ||||
A100 SXM 80GB VRAM • Deep InfraVast.ai | ↑+$0.42(+91.4%) | |||
A100 SXM 80GB VRAM • $0.89/hour Updated: 5/14/2026 $0.47/hour Updated: 5/11/2026 ★Best Price Price Difference:↑+$0.42(+91.4%) | ||||
A40 48GB VRAM • Vast.ai | Not Available | 2x GPU | — | |
A40 48GB VRAM • | ||||
B200 192GB VRAM • Deep Infra | Not Available | — | ||
B200 192GB VRAM • | ||||
H100 SXM 80GB VRAM • Deep Infra | Not Available | — | ||
H100 SXM 80GB VRAM • | ||||
H200 141GB VRAM • Deep Infra | Not Available | — | ||
H200 141GB VRAM • | ||||
HGX B300 288GB VRAM • Deep Infra | Not Available | — | ||
HGX B300 288GB VRAM • | ||||
L4 24GB VRAM • Vast.ai | Not Available | — | ||
L4 24GB VRAM • | ||||
L40 40GB VRAM • Vast.ai | Not Available | — | ||
L40 40GB VRAM • | ||||
L40S 48GB VRAM • Vast.ai | Not Available | — | ||
L40S 48GB VRAM • | ||||
RTX 3070 8GB VRAM • Vast.ai | Not Available | 4x GPU | — | |
RTX 3070 8GB VRAM • | ||||
RTX 3070 Ti 8GB VRAM • Vast.ai | Not Available | — | ||
RTX 3070 Ti 8GB VRAM • | ||||
RTX 3080 10GB VRAM • Vast.ai | Not Available | 4x GPU | — | |
RTX 3080 10GB VRAM • | ||||
RTX 3080 Ti 12GB VRAM • Vast.ai | Not Available | — | ||
RTX 3080 Ti 12GB VRAM • | ||||
Explore how these providers compare to other popular GPU cloud services
Compare Deep Infra with another leading provider
Compare Deep Infra with another leading provider
Compare Deep Infra with another leading provider
Compare Deep Infra with another leading provider
Compare Deep Infra with another leading provider
Compare Deep Infra with another leading provider
OpenAI-compatible endpoints for 100+ models with autoscaling and pay-per-token billing
B200 instances with SSH access spin up in about 10 seconds and bill hourly
Deploy your own Hugging Face models onto dedicated A100, H100, H200, B200, or B300 GPUs
Published per-GPU hourly rates for A100, H100, H200, B200, and B300 with competitive pricing
All hosted models run on H100 or A100 hardware tuned for low latency
Comprehensive APIs for text, vision, image generation, video generation, speech recognition, and text-to-speech
Prices set by supply and demand across the platform with no list prices or hidden fees
GPU Cloud for full control, Serverless for zero-ops inference, Clusters for large-scale training
CLI, Python SDK, and REST API for programmatic GPU provisioning
Scale from $5 to 20,000 GPUs across 40+ data centers without contracts or minimums
Hosted model APIs with autoscaling on H100/A100 hardware.
On-demand GPU nodes with SSH access for custom workloads.
On-demand instances across 40+ data centers and 20,000+ GPUs
Deploy models as endpoints with autoscaling to zero
Dedicated multi-node GPU clusters with InfiniBand networking
OpenAI-compatible inference APIs with pay-per-request billing on H100/A100 hardware
Published transparent hourly pricing for A100, H100, H200, B200, and B300 GPUs with pay-as-you-go billing
Flexible hourly billing for dedicated instances with no prepayments or contracts required
Guaranteed uptime with per-second billing. Best for production workloads.
50%+ cheaper preemptible instances. Best for fault-tolerant batch training.
Up to 50% off with 1, 3, or 6 month commitments. Guaranteed capacity with volume discounts.
Sign up (GitHub-supported) and open the Deep Infra dashboard
Add a payment method to unlock GPU rentals and API usage
Choose serverless APIs or dedicated A100, H100, H200, B200, or B300 instances
Start instances with SSH access or call the OpenAI-compatible API endpoints
Track spend and instance status from the dashboard and shut down when idle
Start with as little as $5. No contracts, no minimums.
Filter by model, VRAM, price, and availability across the platform
Launch instances in seconds. Scale up or down anytime.
Region list not published on the GPU Instances page; promo mentions Nebraska availability alongside multi-region autoscaling messaging.
Documentation site, dashboard guidance, Discord community link, and contact-sales options.
40+ data centers with global coverage including community and enterprise providers
24/7 expert support, comprehensive documentation, Discord community, CLI and SDK tools