Loading Comparison
Fetching pricing data and provider information...
Loading Comparison
Fetching pricing data and provider information...
Compare GPU and LLM inference API pricing between Cohere and RunPod. Find the best rates for AI training, inference, and ML workloads.
Provider 1
Provider 2
| GPU Model ↑ | Cohere Price | RunPod Price | Price Diff ↕ | Sources |
|---|---|---|---|---|
A100 PCIE 40GB VRAM • RunPod | Not Available | — | ||
A100 PCIE 40GB VRAM • | ||||
A100 SXM 80GB VRAM • RunPod | Not Available | — | ||
A100 SXM 80GB VRAM • | ||||
A2 16GB VRAM • RunPod | Not Available | — | ||
A2 16GB VRAM • | ||||
B200 192GB VRAM • RunPod | Not Available | — | ||
B200 192GB VRAM • | ||||
H100 NVL 94GB VRAM • RunPod | Not Available | — | ||
H100 NVL 94GB VRAM • | ||||
H100 PCIe 80GB VRAM • RunPod | Not Available | — | ||
H100 PCIe 80GB VRAM • | ||||
H100 SXM 80GB VRAM • RunPod | Not Available | — | ||
H100 SXM 80GB VRAM • | ||||
H200 141GB VRAM • RunPod | Not Available | — | ||
H200 141GB VRAM • | ||||
HGX B300 288GB VRAM • RunPod | Not Available | — | ||
HGX B300 288GB VRAM • | ||||
L40 40GB VRAM • RunPod | Not Available | — | ||
L40 40GB VRAM • | ||||
L40S 48GB VRAM • RunPod | Not Available | — | ||
L40S 48GB VRAM • | ||||
RTX 3070 8GB VRAM • RunPod | Not Available | 2x GPU | — | |
RTX 3070 8GB VRAM • | ||||
RTX 3080 10GB VRAM • RunPod | Not Available | 2x GPU | — | |
RTX 3080 10GB VRAM • | ||||
RTX 3080 Ti 12GB VRAM • RunPod | Not Available | 2x GPU | — | |
RTX 3080 Ti 12GB VRAM • | ||||
RTX 3090 24GB VRAM • RunPod | Not Available | — | ||
RTX 3090 24GB VRAM • | ||||
A100 PCIE 40GB VRAM • RunPod | Not Available | — | ||
A100 PCIE 40GB VRAM • | ||||
A100 SXM 80GB VRAM • RunPod | Not Available | — | ||
A100 SXM 80GB VRAM • | ||||
A2 16GB VRAM • RunPod | Not Available | — | ||
A2 16GB VRAM • | ||||
B200 192GB VRAM • RunPod | Not Available | — | ||
B200 192GB VRAM • | ||||
H100 NVL 94GB VRAM • RunPod | Not Available | — | ||
H100 NVL 94GB VRAM • | ||||
H100 PCIe 80GB VRAM • RunPod | Not Available | — | ||
H100 PCIe 80GB VRAM • | ||||
H100 SXM 80GB VRAM • RunPod | Not Available | — | ||
H100 SXM 80GB VRAM • | ||||
H200 141GB VRAM • RunPod | Not Available | — | ||
H200 141GB VRAM • | ||||
HGX B300 288GB VRAM • RunPod | Not Available | — | ||
HGX B300 288GB VRAM • | ||||
L40 40GB VRAM • RunPod | Not Available | — | ||
L40 40GB VRAM • | ||||
L40S 48GB VRAM • RunPod | Not Available | — | ||
L40S 48GB VRAM • | ||||
RTX 3070 8GB VRAM • RunPod | Not Available | 2x GPU | — | |
RTX 3070 8GB VRAM • | ||||
RTX 3080 10GB VRAM • RunPod | Not Available | 2x GPU | — | |
RTX 3080 10GB VRAM • | ||||
RTX 3080 Ti 12GB VRAM • RunPod | Not Available | 2x GPU | — | |
RTX 3080 Ti 12GB VRAM • | ||||
RTX 3090 24GB VRAM • RunPod | Not Available | — | ||
RTX 3090 24GB VRAM • | ||||
Explore how these providers compare to other popular GPU cloud services
Compare Cohere with another leading provider
Compare Cohere with another leading provider
Compare Cohere with another leading provider
Compare Cohere with another leading provider
Compare Cohere with another leading provider
Compare Cohere with another leading provider
High-performance language models supporting 23 languages with Command, Command R, and Command R+ variants
Research-grade multilingual models (8B and 32B) excelling across diverse languages
Multimodal semantic search and relevance optimization for retrieval-augmented generation
Deploy via cloud API, virtual private cloud, on-premises, or Cohere-managed Model Vault
Enterprise-ready AI platform for workplace productivity with intelligent search
Fine-tune models on proprietary data for domain-specific applications
Access to a wide range of GPU types with enterprise-grade security
Only pay for the compute time you actually use
Programmatically manage your GPU instances via REST API
Pods typically ready in 20-30 s
SSH & VS Code tunnels built-in
Automatic migration on pre-empt
On‑demand single‑node GPU instances with flexible templates and storage.
Spin up multi‑node GPU clusters in minutes with auto networking.
Per million token pricing starting at $0.30/$0.60 for Command-light
Free tier with rate limiting for development and testing
Custom pricing for dedicated deployments, Model Vault, and on-premises
Sign up at dashboard.cohere.com
Generate a trial or production API key from the dashboard
pip install cohere (Python) or npm install cohere-ai (TypeScript)
Call the Chat or Generate endpoint with your API key
Sign up for RunPod using your email or GitHub account
Add a credit card or cryptocurrency payment method
Select a template and GPU type to launch your first instance
Global API access with deployment options including AWS, GCP, Azure, and on-premises installations
Documentation, API reference, cookbooks, Discord community, and enterprise support options