Loading Comparison
Fetching pricing data and provider information...
Loading Comparison
Fetching pricing data and provider information...
Compare GPU and LLM inference API pricing between CoreWeave and Replicate. Find the best rates for AI training, inference, and ML workloads.
Provider 1
Provider 2
Average Price Difference: $1.42/hour between comparable GPUs
| GPU Model ↑ | CoreWeave Price | Replicate Price | Price Diff ↕ | Sources |
|---|---|---|---|---|
A100 SXM 80GB VRAM • CoreWeaveReplicate | 8x GPU | ↓$2.34(46.4%) | ||
B200 192GB VRAM • CoreWeave | 8x GPU | Not Available | — | |
B200 192GB VRAM • | ||||
GB200 384GB VRAM • CoreWeave | 41x GPU | Not Available | — | |
GB200 384GB VRAM • | ||||
GH200 96GB VRAM • CoreWeave | Not Available | — | ||
GH200 96GB VRAM • | ||||
H100 SXM 80GB VRAM • CoreWeaveReplicate | 8x GPU | ↑+$0.67(+12.1%) | ||
H200 141GB VRAM • CoreWeave | 8x GPU | Not Available | — | |
H200 141GB VRAM • | ||||
L40 40GB VRAM • CoreWeave | 8x GPU | Not Available | — | |
L40 40GB VRAM • | ||||
L40S 48GB VRAM • CoreWeaveReplicate | 8x GPU | ↓$1.26(35.9%) | ||
RTX PRO 6000 96GB VRAM • CoreWeave | 8x GPU | Not Available | — | |
RTX PRO 6000 96GB VRAM • | ||||
Tesla T4 16GB VRAM • Replicate | Not Available | — | ||
Tesla T4 16GB VRAM • | ||||
A100 SXM 80GB VRAM • CoreWeaveReplicate | 8x GPU | ↓$2.34(46.4%) | ||
B200 192GB VRAM • CoreWeave | 8x GPU | Not Available | — | |
B200 192GB VRAM • | ||||
GB200 384GB VRAM • CoreWeave | 41x GPU | Not Available | — | |
GB200 384GB VRAM • | ||||
GH200 96GB VRAM • CoreWeave | Not Available | — | ||
GH200 96GB VRAM • | ||||
H100 SXM 80GB VRAM • CoreWeaveReplicate | 8x GPU | ↑+$0.67(+12.1%) | ||
H200 141GB VRAM • CoreWeave | 8x GPU | Not Available | — | |
H200 141GB VRAM • | ||||
L40 40GB VRAM • CoreWeave | 8x GPU | Not Available | — | |
L40 40GB VRAM • | ||||
L40S 48GB VRAM • CoreWeaveReplicate | 8x GPU | ↓$1.26(35.9%) | ||
RTX PRO 6000 96GB VRAM • CoreWeave | 8x GPU | Not Available | — | |
RTX PRO 6000 96GB VRAM • | ||||
Tesla T4 16GB VRAM • Replicate | Not Available | — | ||
Tesla T4 16GB VRAM • | ||||
Explore how these providers compare to other popular GPU cloud services
Compare CoreWeave with another leading provider
Compare CoreWeave with another leading provider
Compare CoreWeave with another leading provider
Compare CoreWeave with another leading provider
Compare CoreWeave with another leading provider
Compare CoreWeave with another leading provider
Purpose-built AI-native platform with Kubernetes-native developer experience
First-to-market access to the latest NVIDIA GPUs including H100, H200, and Blackwell architecture
Unified security, talent services, and observability platform for large-scale AI operations
High-performance clusters with InfiniBand networking for optimal scale-out connectivity
Access thousands of open-source models including LLMs, image generators, and more
Consistent REST API across all models with webhooks for async processing
Deploy your own models using Cog containerization
Automatic scaling with cold-start optimization
On-demand and reserved GPU instances with latest NVIDIA hardware
High-performance CPU instances to complement GPU workloads
Pay-per-hour GPU and CPU instances with flexible scaling
Committed usage discounts up to 60% over on-demand pricing
No ingress, egress, or transfer fees for data movement
Charged per model run based on compute time and hardware
Limited free predictions for new users
Sign up for CoreWeave Cloud platform access
Select from latest NVIDIA GPUs including H100, H200, and Blackwell architecture
Use Kubernetes-native tools for workload deployment and scaling
Sign up at replicate.com with GitHub or email
Copy your API token from account settings
Use the API or Python client to run any model
Deployments across North America with expanding global presence
24/7 support from dedicated engineering teams, comprehensive documentation, and Kubernetes expertise
US-based infrastructure with global CDN
Documentation, Discord community, email support