MiniMax M2.5
MiniMax M2.5 is MiniMax's flagship text generation model with a 128K token context window and tool calling capabilities.
API Pricing
Cheapest on OpenRouter — 51% below avg| Provider | Input / 1M | Output / 1M | Speed | TTFT | Updated |
|---|---|---|---|---|---|
| $0.118 | $0.990 | 70.7 t/s | 2.0s | 4/14/2026 | |
| $0.300 | $1.20 | 70.7 t/s | 2.0s | 4/14/2026 | |
| $0.300 | $1.20 | 70.7 t/s | 2.0s | 4/14/2026 |
Prices updated daily. Last check: 4/14/2026
Model Details
General
- Creator
- MiniMax
- Family
- MiniMax
- Tier
- Flagship
- Context Window
- 128K
- Modalities
- Text
Capabilities
- Tool Calling
- Yes
- Open Source
- No
- Subtypes
- Chat Completion
- Aliases
- minimax-m2-5, minimax-m2-family
Strengths & Limitations
- 128,000 token context window for processing lengthy documents
- Tool calling support enables API integrations and structured outputs
- 58.52 tokens per second generation speed for responsive interactions
- Flagship tier positioning within MiniMax's model family
- Specialized capabilities for Chinese language processing
- Chat completion interface for conversational applications
- Developed by MiniMax with focus on regional market needs
- Text-only modality without image or multimodal input support
- Proprietary model with no open source weights available
- Limited global availability compared to major cloud providers
- Smaller context window than some competing flagship models
- Time to first token of 1,386ms may impact real-time applications
Key Features
About MiniMax M2.5
Common Use Cases
MiniMax M2.5 suits organizations requiring flagship-tier language processing with particular emphasis on Chinese markets and applications. The 128K context window makes it appropriate for document analysis, lengthy conversational sessions, and complex reasoning tasks that require maintaining context across extended interactions. Tool calling capabilities enable integration into business workflows, customer service automation, and agentic applications that need to interact with external systems. The model's regional optimization makes it valuable for companies operating in Chinese-speaking markets or requiring specialized understanding of regional language nuances and cultural context.
Frequently Asked Questions
How much does MiniMax M2.5 cost per million tokens?
MiniMax M2.5 pricing varies by provider and region. Check the pricing table above for current rates across all available providers offering this model.
What is MiniMax M2.5 best used for?
MiniMax M2.5 is best suited for complex conversational AI applications, document processing requiring its 128K context window, and business workflows that leverage its tool calling capabilities. It's particularly valuable for organizations operating in Chinese-speaking markets or requiring regional language expertise.
Does MiniMax M2.5 support multimodal inputs like images?
No, MiniMax M2.5 is a text-only model that supports chat completion interfaces but does not process images or other multimedia inputs. It focuses exclusively on text generation and understanding tasks.