FlagshipMiniMax

MiniMax 01

MiniMax 01 is MiniMax's flagship multimodal model with text and image capabilities, featuring a massive 1M+ token context window for processing extensive documents and conversations.

Context 1.0M
Tier Flagship
Modalities text, image
Input from
$0.200 / 1M tokens
across 1 provider

API Pricing

ProviderInput / 1MOutput / 1MUpdated
$0.200$1.104/14/2026

Prices updated daily. Last check: 4/14/2026

Model Details

General

Creator
MiniMax
Family
MiniMax
Tier
Flagship
Context Window
1.0M
Modalities
Text, Image

Capabilities

Tool Calling
No
Open Source
No

Strengths & Limitations

  • Massive 1,000,192 token context window enables processing of very long documents
  • Multimodal capabilities support both text and image inputs
  • Flagship-tier model designed for complex reasoning tasks
  • Large context window allows for extensive conversation history retention
  • Vision capabilities enable document analysis and image understanding tasks
  • No tool calling or function calling capabilities
  • Proprietary model with no open-source availability
  • Limited modality support compared to models with audio or video inputs
  • Newer entrant with less established track record than established providers

Key Features

1,000,192 token context window
Text input and generation
Image input processing
Vision-language understanding
Long document processing
Multimodal conversation capabilities
Extended context retention

About MiniMax 01

MiniMax 01 is the flagship model from MiniMax, representing the company's most advanced offering in their model family. As a tier-one model, it positions MiniMax as a competitor in the high-capability AI space alongside other major model providers. The model supports both text and image inputs with an exceptionally large context window of over 1 million tokens (1,000,192 tokens). This extensive context capacity enables processing of very long documents, extended conversations, and large-scale multimodal inputs. The model handles vision tasks alongside text generation, making it suitable for applications requiring understanding of both textual and visual information. MiniMax 01 targets enterprise and developer use cases that require sophisticated reasoning across multiple modalities and the ability to maintain context over very long interactions. While it lacks tool calling capabilities, its strength lies in direct text and image processing tasks where maintaining extensive context is crucial.

Common Use Cases

MiniMax 01 is well-suited for applications requiring extensive context retention and multimodal processing. Its massive context window makes it ideal for long document analysis, legal document review, research tasks involving large datasets, and extended technical consultations. The vision capabilities enable document OCR and analysis, image-based question answering, and multimodal content creation workflows. Organizations needing to process lengthy transcripts, analyze large codebases, or maintain context across very long conversations would benefit from this model's context capacity. However, applications requiring tool integration or function calling would need to implement those capabilities externally.

Frequently Asked Questions

How much does MiniMax 01 cost per million tokens?

MiniMax 01 pricing varies by provider and usage patterns. Check the pricing table above for current rates across all available providers offering this model.

What is MiniMax 01 best used for?

MiniMax 01 excels at tasks requiring extensive context retention and multimodal processing, such as long document analysis, research across large datasets, extended technical conversations, and vision-language tasks involving both text and images.

Does MiniMax 01 support tool calling or function calling?

No, MiniMax 01 does not have built-in tool calling capabilities. Applications requiring function calling or API integration would need to implement these features through external orchestration layers.