Skip to main content

LLM Pricing

Compare per-token rates across 18 providers and 214 LLM models

Updated daily

Top Picks Right Now

Click to filter table
Inputs
View
Showing 1 of 1 chat models · 5 records — filteredMiniMax M2.5

Understanding LLM Pricing

Input vs Output Tokens

LLM APIs charge separately for input tokens (your prompts) and output tokens (model responses). Output tokens are usually 2-5x more expensive than input tokens.

Context Windows

The context window determines how much text a model can process at once. Larger context windows allow for longer conversations and document analysis.

Open vs Proprietary Models

Open-source models (Llama, Mistral) are often cheaper but may require more tuning. Proprietary models (GPT-4, Claude) typically offer better out-of-box performance.

Batch Discounts

Many providers offer 50% discounts for batch/async API usage. Consider batch APIs for non-time-sensitive workloads to reduce costs.

LLM API Pricing Comparison: Compare 18 Providers | ComputePrices.com