LLM Pricing
Compare per-token rates across 18 providers and 214 LLM models
Updated daily
Top Picks Right Now
Click to filter tableInputs
View
Showing 1 of 1 chat models · 5 records — filteredMiniMax M2.5
Understanding LLM Pricing
Input vs Output Tokens
LLM APIs charge separately for input tokens (your prompts) and output tokens (model responses). Output tokens are usually 2-5x more expensive than input tokens.
Context Windows
The context window determines how much text a model can process at once. Larger context windows allow for longer conversations and document analysis.
Open vs Proprietary Models
Open-source models (Llama, Mistral) are often cheaper but may require more tuning. Proprietary models (GPT-4, Claude) typically offer better out-of-box performance.
Batch Discounts
Many providers offer 50% discounts for batch/async API usage. Consider batch APIs for non-time-sensitive workloads to reduce costs.