LLM Pricing
Compare per-token rates across 10+ providers and 32+ LLM models
Updated daily
💡Top Picks Right Now
Click to filter tableShowing 58 prices
| Model | Creator | Provider | Context | Modalities | Knowledge | Input/1M ↑ | Output/1M |
|---|---|---|---|---|---|---|---|
| Llama 3.1 8B | Meta | Groq | 128K | Aa | Dec 2023 | $0.050 | $0.080 |
| Llama 3.1 8B | Meta | Deep Infra | 128K | Aa | Dec 2023 | $0.055 | $0.055 |
| Gemma 2 9B | Deep Infra | 8K | Aa | Feb 2024 | $0.060 | $0.060 | |
| Gemini 2.0 Flash | Google Cloud | 1.0M | Aa | Aug 2024 | $0.100 | $0.400 | |
| Gemma 2 27B | Deep Infra | 8K | Aa | Feb 2024 | $0.130 | $0.130 | |
| GPT-4o mini | OpenAI | OpenAI | 128K | Aa | Oct 2023 | $0.150 | $0.600 |
| Llama 3.1 8B | Meta | Together AI | 128K | Aa | Dec 2023 | $0.180 | $0.180 |
| Llama 3.1 8B | Meta | Fireworks AI | 128K | Aa | Dec 2023 | $0.200 | $0.200 |
| Mixtral 8x7B | Mistral | Deep Infra | 32K | Aa | Sep 2023 | $0.240 | $0.240 |
| Mixtral 8x7B | Mistral | Groq | 32K | Aa | Sep 2023 | $0.240 | $0.240 |
| Llama 3.1 70B | Meta | Deep Infra | 128K | Aa | Dec 2023 | $0.350 | $0.400 |
| Llama 3.3 70B | Meta | Deep Infra | 128K | Aa | Mar 2024 | $0.350 | $0.400 |
| Qwen 2.5 72B | Alibaba | Deep Infra | 128K | Aa | Mar 2024 | $0.350 | $0.400 |
| GPT-4.1 mini | OpenAI | OpenAI | 1.0M | Aa | Jun 2024 | $0.400 | $1.60 |
| DeepSeek V3 | DeepSeek | Deep Infra | 64K | Aa | Jun 2024 | $0.490 | $0.890 |
| Mixtral 8x7B | Mistral | Fireworks AI | 32K | Aa | Sep 2023 | $0.500 | $0.500 |
| Gemini 3 Flash | Google Cloud | 1.0M | Aa | Jun 2025 | $0.500 | $3.00 | |
| DeepSeek R1 | DeepSeek | Deep Infra | 64K | Aa | Nov 2024 | $0.550 | $2.19 |
| Llama 3.3 70B | Meta | Groq | 128K | Aa | Mar 2024 | $0.590 | $0.790 |
| Mixtral 8x7B | Mistral | Together AI | 32K | Aa | Sep 2023 | $0.600 | $0.600 |
| Llama 3.1 70B | Meta | Replicate | 128K | Aa | Dec 2023 | $0.650 | $0.650 |
| Claude 3.5 Haiku | Anthropic | Anthropic | 200K | Aa | Apr 2024 | $0.800 | $4.00 |
| Llama 3.1 70B | Meta | Together AI | 128K | Aa | Dec 2023 | $0.880 | $0.880 |
| Llama 3.3 70B | Meta | Together AI | 128K | Aa | Mar 2024 | $0.880 | $0.880 |
| Qwen 2.5 72B | Alibaba | Together AI | 128K | Aa | Mar 2024 | $0.900 | $0.900 |
| DeepSeek V3 | DeepSeek | Together AI | 64K | Aa | Jun 2024 | $0.900 | $0.900 |
| Llama 3.1 70B | Meta | Fireworks AI | 128K | Aa | Dec 2023 | $0.900 | $0.900 |
| Llama 3.3 70B | Meta | Fireworks AI | 128K | Aa | Mar 2024 | $0.900 | $0.900 |
| Qwen 2.5 72B | Alibaba | Fireworks AI | 128K | Aa | Mar 2024 | $0.900 | $0.900 |
| DeepSeek V3 | DeepSeek | Fireworks AI | 64K | Aa | Jun 2024 | $0.900 | $0.900 |
| Llama 3.3 70B | Meta | Amazon AWS | 128K | Aa | Mar 2024 | $0.990 | $0.990 |
| Claude Haiku 4.5 | Anthropic | Anthropic | 200K | Aa | Apr 2025 | $1.00 | $5.00 |
| o3-mini | OpenAI | OpenAI | 200K | Aa | Oct 2024 | $1.10 | $4.40 |
| o1-mini | OpenAI | OpenAI | 128K | Aa | Jun 2024 | $1.10 | $4.40 |
| GPT-5 | OpenAI | OpenAI | 200K | Aa | Dec 2024 | $1.25 | $10.00 |
| GPT-5.1 | OpenAI | OpenAI | 256K | Aa | Mar 2025 | $1.50 | $12.00 |
| GPT-5.2 | OpenAI | Microsoft Azure | 400K | Aa | Jun 2025 | $1.75 | $14.00 |
| GPT-5.2 | OpenAI | OpenAI | 400K | Aa | Jun 2025 | $1.75 | $14.00 |
| Llama 3.1 405B | Meta | Deep Infra | 128K | Aa | Dec 2023 | $1.79 | $1.79 |
| Mistral Large | Mistral | Deep Infra | 128K | Aa | Jan 2024 | $2.00 | $6.00 |
| GPT-4.1 | OpenAI | OpenAI | 1.0M | Aa | Jun 2024 | $2.00 | $8.00 |
| Mistral Large | Mistral | Together AI | 128K | Aa | Jan 2024 | $2.00 | $6.00 |
| Mistral Large | Mistral | Fireworks AI | 128K | Aa | Jan 2024 | $2.00 | $6.00 |
| Gemini 3 Pro | Google Cloud | 1.0M | Aa | Jun 2025 | $2.00 | $12.00 | |
| GPT-4o | OpenAI | Microsoft Azure | 128K | Aa | Oct 2023 | $2.50 | $10.00 |
| GPT-4o | OpenAI | OpenAI | 128K | Aa | Oct 2023 | $2.50 | $10.00 |
| Claude Sonnet 4.5 | Anthropic | Amazon AWS | 200K | Aa | Apr 2025 | $3.00 | $15.00 |
| DeepSeek R1 | DeepSeek | Together AI | 64K | Aa | Nov 2024 | $3.00 | $7.00 |
| DeepSeek R1 | DeepSeek | Fireworks AI | 64K | Aa | Nov 2024 | $3.00 | $8.00 |
| Llama 3.1 405B | Meta | Fireworks AI | 128K | Aa | Dec 2023 | $3.00 | $3.00 |
| Claude 3.5 Sonnet | Anthropic | Anthropic | 200K | Aa | Apr 2024 | $3.00 | $15.00 |
| Claude Sonnet 4.5 | Anthropic | Anthropic | 200K | Aa | Apr 2025 | $3.00 | $15.00 |
| Llama 3.1 405B | Meta | Together AI | 128K | Aa | Dec 2023 | $3.50 | $3.50 |
| Claude Opus 4.5 | Anthropic | Amazon AWS | 200K | Aa | Apr 2025 | $5.00 | $25.00 |
| Claude Opus 4.5 | Anthropic | Anthropic | 200K | Aa | Apr 2025 | $5.00 | $25.00 |
| Llama 3.1 405B | Meta | Replicate | 128K | Aa | Dec 2023 | $9.50 | $9.50 |
| o1 | OpenAI | OpenAI | 200K | Aa | Jun 2024 | $15.00 | $60.00 |
| Claude 3 Opus | Anthropic | Anthropic | 200K | Aa | Aug 2023 | $15.00 | $75.00 |
Llama 3.1 8B
$0.050
per 1M in
128K ctxDec 2023
Aa
|$0.080 outLlama 3.1 8B
$0.055
per 1M in
128K ctxDec 2023
Aa
|$0.055 outGemma 2 9B
$0.060
per 1M in
8K ctxFeb 2024
Aa
|$0.060 outGemini 2.0 Flash
$0.100
per 1M in
1.0M ctxAug 2024
Aa
|$0.400 outGemma 2 27B
$0.130
per 1M in
8K ctxFeb 2024
Aa
|$0.130 outGPT-4o mini
$0.150
per 1M in
128K ctxOct 2023
Aa
|$0.600 outLlama 3.1 8B
$0.180
per 1M in
128K ctxDec 2023
Aa
|$0.180 outLlama 3.1 8B
$0.200
per 1M in
128K ctxDec 2023
Aa
|$0.200 outMixtral 8x7B
$0.240
per 1M in
32K ctxSep 2023
Aa
|$0.240 outMixtral 8x7B
$0.240
per 1M in
32K ctxSep 2023
Aa
|$0.240 outLlama 3.1 70B
$0.350
per 1M in
128K ctxDec 2023
Aa
|$0.400 outLlama 3.3 70B
$0.350
per 1M in
128K ctxMar 2024
Aa
|$0.400 outQwen 2.5 72B
$0.350
per 1M in
128K ctxMar 2024
Aa
|$0.400 outGPT-4.1 mini
$0.400
per 1M in
1.0M ctxJun 2024
Aa
|$1.60 outDeepSeek V3
$0.490
per 1M in
64K ctxJun 2024
Aa
|$0.890 outMixtral 8x7B
$0.500
per 1M in
32K ctxSep 2023
Aa
|$0.500 outGemini 3 Flash
$0.500
per 1M in
1.0M ctxJun 2025
Aa
|$3.00 outDeepSeek R1
$0.550
per 1M in
64K ctxNov 2024
Aa
|$2.19 outLlama 3.3 70B
$0.590
per 1M in
128K ctxMar 2024
Aa
|$0.790 outMixtral 8x7B
$0.600
per 1M in
32K ctxSep 2023
Aa
|$0.600 outLlama 3.1 70B
$0.650
per 1M in
128K ctxDec 2023
Aa
|$0.650 outClaude 3.5 Haiku
$0.800
per 1M in
200K ctxApr 2024
Aa
|$4.00 outLlama 3.1 70B
$0.880
per 1M in
128K ctxDec 2023
Aa
|$0.880 outLlama 3.3 70B
$0.880
per 1M in
128K ctxMar 2024
Aa
|$0.880 outQwen 2.5 72B
$0.900
per 1M in
128K ctxMar 2024
Aa
|$0.900 outDeepSeek V3
$0.900
per 1M in
64K ctxJun 2024
Aa
|$0.900 outLlama 3.1 70B
$0.900
per 1M in
128K ctxDec 2023
Aa
|$0.900 outLlama 3.3 70B
$0.900
per 1M in
128K ctxMar 2024
Aa
|$0.900 outQwen 2.5 72B
$0.900
per 1M in
128K ctxMar 2024
Aa
|$0.900 outDeepSeek V3
$0.900
per 1M in
64K ctxJun 2024
Aa
|$0.900 outLlama 3.3 70B
$0.990
per 1M in
128K ctxMar 2024
Aa
|$0.990 outClaude Haiku 4.5
$1.00
per 1M in
200K ctxApr 2025
Aa
|$5.00 outo3-mini
$1.10
per 1M in
200K ctxOct 2024
Aa
|$4.40 outo1-mini
$1.10
per 1M in
128K ctxJun 2024
Aa
|$4.40 outGPT-5
$1.25
per 1M in
200K ctxDec 2024
Aa
|$10.00 outGPT-5.1
$1.50
per 1M in
256K ctxMar 2025
Aa
|$12.00 outGPT-5.2
$1.75
per 1M in
400K ctxJun 2025
Aa
|$14.00 outGPT-5.2
$1.75
per 1M in
400K ctxJun 2025
Aa
|$14.00 outLlama 3.1 405B
$1.79
per 1M in
128K ctxDec 2023
Aa
|$1.79 outMistral Large
$2.00
per 1M in
128K ctxJan 2024
Aa
|$6.00 outGPT-4.1
$2.00
per 1M in
1.0M ctxJun 2024
Aa
|$8.00 outMistral Large
$2.00
per 1M in
128K ctxJan 2024
Aa
|$6.00 outMistral Large
$2.00
per 1M in
128K ctxJan 2024
Aa
|$6.00 outGemini 3 Pro
$2.00
per 1M in
1.0M ctxJun 2025
Aa
|$12.00 outGPT-4o
$2.50
per 1M in
128K ctxOct 2023
Aa
|$10.00 outGPT-4o
$2.50
per 1M in
128K ctxOct 2023
Aa
|$10.00 outClaude Sonnet 4.5
$3.00
per 1M in
200K ctxApr 2025
Aa
|$15.00 outDeepSeek R1
$3.00
per 1M in
64K ctxNov 2024
Aa
|$7.00 outDeepSeek R1
$3.00
per 1M in
64K ctxNov 2024
Aa
|$8.00 outLlama 3.1 405B
$3.00
per 1M in
128K ctxDec 2023
Aa
|$3.00 outClaude 3.5 Sonnet
$3.00
per 1M in
200K ctxApr 2024
Aa
|$15.00 outClaude Sonnet 4.5
$3.00
per 1M in
200K ctxApr 2025
Aa
|$15.00 outLlama 3.1 405B
$3.50
per 1M in
128K ctxDec 2023
Aa
|$3.50 outClaude Opus 4.5
$5.00
per 1M in
200K ctxApr 2025
Aa
|$25.00 outClaude Opus 4.5
$5.00
per 1M in
200K ctxApr 2025
Aa
|$25.00 outLlama 3.1 405B
$9.50
per 1M in
128K ctxDec 2023
Aa
|$9.50 outo1
$15.00
per 1M in
200K ctxJun 2024
Aa
|$60.00 outClaude 3 Opus
$15.00
per 1M in
200K ctxAug 2023
Aa
|$75.00 outUnderstanding LLM Pricing
Input vs Output Tokens
LLM APIs charge separately for input tokens (your prompts) and output tokens (model responses). Output tokens are usually 2-5x more expensive than input tokens.
Context Windows
The context window determines how much text a model can process at once. Larger context windows allow for longer conversations and document analysis.
Open vs Proprietary Models
Open-source models (Llama, Mistral) are often cheaper but may require more tuning. Proprietary models (GPT-4, Claude) typically offer better out-of-box performance.
Batch Discounts
Many providers offer 50% discounts for batch/async API usage. Consider batch APIs for non-time-sensitive workloads to reduce costs.