LLM Pricing

Compare per-token rates across 10+ providers and 32+ LLM models

Updated daily

💡Top Picks Right Now

Click to filter table
Showing 58 prices
Llama 3.1 8B
MetaGroq
$0.050
per 1M in
128K ctxDec 2023
Aa
|$0.080 out
Llama 3.1 8B
MetaDeep Infra
$0.055
per 1M in
128K ctxDec 2023
Aa
|$0.055 out
Gemma 2 9B
GoogleDeep Infra
$0.060
per 1M in
8K ctxFeb 2024
Aa
|$0.060 out
Gemini 2.0 Flash
GoogleGoogle Cloud
$0.100
per 1M in
1.0M ctxAug 2024
Aa
|$0.400 out
Gemma 2 27B
GoogleDeep Infra
$0.130
per 1M in
8K ctxFeb 2024
Aa
|$0.130 out
GPT-4o mini
OpenAIOpenAI
$0.150
per 1M in
128K ctxOct 2023
Aa
|$0.600 out
Llama 3.1 8B
$0.180
per 1M in
128K ctxDec 2023
Aa
|$0.180 out
Llama 3.1 8B
$0.200
per 1M in
128K ctxDec 2023
Aa
|$0.200 out
Mixtral 8x7B
MistralDeep Infra
$0.240
per 1M in
32K ctxSep 2023
Aa
|$0.240 out
Mixtral 8x7B
MistralGroq
$0.240
per 1M in
32K ctxSep 2023
Aa
|$0.240 out
Llama 3.1 70B
MetaDeep Infra
$0.350
per 1M in
128K ctxDec 2023
Aa
|$0.400 out
Llama 3.3 70B
MetaDeep Infra
$0.350
per 1M in
128K ctxMar 2024
Aa
|$0.400 out
Qwen 2.5 72B
AlibabaDeep Infra
$0.350
per 1M in
128K ctxMar 2024
Aa
|$0.400 out
GPT-4.1 mini
OpenAIOpenAI
$0.400
per 1M in
1.0M ctxJun 2024
Aa
|$1.60 out
DeepSeek V3
DeepSeekDeep Infra
$0.490
per 1M in
64K ctxJun 2024
Aa
|$0.890 out
Mixtral 8x7B
MistralFireworks AI
$0.500
per 1M in
32K ctxSep 2023
Aa
|$0.500 out
Gemini 3 Flash
GoogleGoogle Cloud
$0.500
per 1M in
1.0M ctxJun 2025
Aa
|$3.00 out
DeepSeek R1
DeepSeekDeep Infra
$0.550
per 1M in
64K ctxNov 2024
Aa
|$2.19 out
Llama 3.3 70B
MetaGroq
$0.590
per 1M in
128K ctxMar 2024
Aa
|$0.790 out
Mixtral 8x7B
MistralTogether AI
$0.600
per 1M in
32K ctxSep 2023
Aa
|$0.600 out
Llama 3.1 70B
MetaReplicate
$0.650
per 1M in
128K ctxDec 2023
Aa
|$0.650 out
Claude 3.5 Haiku
AnthropicAnthropic
$0.800
per 1M in
200K ctxApr 2024
Aa
|$4.00 out
Llama 3.1 70B
$0.880
per 1M in
128K ctxDec 2023
Aa
|$0.880 out
Llama 3.3 70B
$0.880
per 1M in
128K ctxMar 2024
Aa
|$0.880 out
Qwen 2.5 72B
AlibabaTogether AI
$0.900
per 1M in
128K ctxMar 2024
Aa
|$0.900 out
DeepSeek V3
DeepSeekTogether AI
$0.900
per 1M in
64K ctxJun 2024
Aa
|$0.900 out
Llama 3.1 70B
$0.900
per 1M in
128K ctxDec 2023
Aa
|$0.900 out
Llama 3.3 70B
$0.900
per 1M in
128K ctxMar 2024
Aa
|$0.900 out
Qwen 2.5 72B
AlibabaFireworks AI
$0.900
per 1M in
128K ctxMar 2024
Aa
|$0.900 out
DeepSeek V3
DeepSeekFireworks AI
$0.900
per 1M in
64K ctxJun 2024
Aa
|$0.900 out
Llama 3.3 70B
MetaAmazon AWS
$0.990
per 1M in
128K ctxMar 2024
Aa
|$0.990 out
Claude Haiku 4.5
AnthropicAnthropic
$1.00
per 1M in
200K ctxApr 2025
Aa
|$5.00 out
o3-mini
OpenAIOpenAI
$1.10
per 1M in
200K ctxOct 2024
Aa
|$4.40 out
o1-mini
OpenAIOpenAI
$1.10
per 1M in
128K ctxJun 2024
Aa
|$4.40 out
GPT-5
OpenAIOpenAI
$1.25
per 1M in
200K ctxDec 2024
Aa
|$10.00 out
GPT-5.1
OpenAIOpenAI
$1.50
per 1M in
256K ctxMar 2025
Aa
|$12.00 out
GPT-5.2
$1.75
per 1M in
400K ctxJun 2025
Aa
|$14.00 out
GPT-5.2
OpenAIOpenAI
$1.75
per 1M in
400K ctxJun 2025
Aa
|$14.00 out
Llama 3.1 405B
MetaDeep Infra
$1.79
per 1M in
128K ctxDec 2023
Aa
|$1.79 out
Mistral Large
MistralDeep Infra
$2.00
per 1M in
128K ctxJan 2024
Aa
|$6.00 out
GPT-4.1
OpenAIOpenAI
$2.00
per 1M in
1.0M ctxJun 2024
Aa
|$8.00 out
Mistral Large
MistralTogether AI
$2.00
per 1M in
128K ctxJan 2024
Aa
|$6.00 out
Mistral Large
MistralFireworks AI
$2.00
per 1M in
128K ctxJan 2024
Aa
|$6.00 out
Gemini 3 Pro
GoogleGoogle Cloud
$2.00
per 1M in
1.0M ctxJun 2025
Aa
|$12.00 out
GPT-4o
$2.50
per 1M in
128K ctxOct 2023
Aa
|$10.00 out
GPT-4o
OpenAIOpenAI
$2.50
per 1M in
128K ctxOct 2023
Aa
|$10.00 out
Claude Sonnet 4.5
AnthropicAmazon AWS
$3.00
per 1M in
200K ctxApr 2025
Aa
|$15.00 out
DeepSeek R1
DeepSeekTogether AI
$3.00
per 1M in
64K ctxNov 2024
Aa
|$7.00 out
DeepSeek R1
DeepSeekFireworks AI
$3.00
per 1M in
64K ctxNov 2024
Aa
|$8.00 out
Llama 3.1 405B
$3.00
per 1M in
128K ctxDec 2023
Aa
|$3.00 out
Claude 3.5 Sonnet
AnthropicAnthropic
$3.00
per 1M in
200K ctxApr 2024
Aa
|$15.00 out
Claude Sonnet 4.5
AnthropicAnthropic
$3.00
per 1M in
200K ctxApr 2025
Aa
|$15.00 out
Llama 3.1 405B
$3.50
per 1M in
128K ctxDec 2023
Aa
|$3.50 out
Claude Opus 4.5
AnthropicAmazon AWS
$5.00
per 1M in
200K ctxApr 2025
Aa
|$25.00 out
Claude Opus 4.5
AnthropicAnthropic
$5.00
per 1M in
200K ctxApr 2025
Aa
|$25.00 out
Llama 3.1 405B
MetaReplicate
$9.50
per 1M in
128K ctxDec 2023
Aa
|$9.50 out
o1
OpenAIOpenAI
$15.00
per 1M in
200K ctxJun 2024
Aa
|$60.00 out
Claude 3 Opus
AnthropicAnthropic
$15.00
per 1M in
200K ctxAug 2023
Aa
|$75.00 out

Understanding LLM Pricing

Input vs Output Tokens

LLM APIs charge separately for input tokens (your prompts) and output tokens (model responses). Output tokens are usually 2-5x more expensive than input tokens.

Context Windows

The context window determines how much text a model can process at once. Larger context windows allow for longer conversations and document analysis.

Open vs Proprietary Models

Open-source models (Llama, Mistral) are often cheaper but may require more tuning. Proprietary models (GPT-4, Claude) typically offer better out-of-box performance.

Batch Discounts

Many providers offer 50% discounts for batch/async API usage. Consider batch APIs for non-time-sensitive workloads to reduce costs.