FlagshipMiniMax

MiniMax 01

Name: MiniMax 01
Availability: InStock
Author: MiniMax

MiniMax 01 is MiniMax's flagship multimodal model with text and image capabilities, featuring a massive 1M+ token context window for processing extensive documents and conversations.

Context 1.0M

Tier Flagship

Modalities text, image

Input from

$0.200 / 1M tokens

across 1 provider

Compare Prices

API Pricing

Provider	Input / 1M	Output / 1M	Updated
OpenRouter	$0.200	$1.10	7/13/2026

Prices updated daily. Last check: Jul 13, 2026

Model Details

General

Creator: MiniMax
Family: MiniMax
Tier: Flagship
Context Window: 1.0M
Modalities: Text, Image

Capabilities

Tool Calling: No
Open Source: No

Strengths & Limitations

Strengths

Massive 1,000,192 token context window enables processing of very long documents
Multimodal capabilities support both text and image inputs
Flagship-tier model designed for complex reasoning tasks
Large context window allows for extensive conversation history retention
Vision capabilities enable document analysis and image understanding tasks

Limitations

No tool calling or function calling capabilities
Proprietary model with no open-source availability
Limited modality support compared to models with audio or video inputs
Newer entrant with less established track record than established providers

Key Features

•1,000,192 token context window

•Text input and generation

•Image input processing

•Vision-language understanding

•Long document processing

•Multimodal conversation capabilities

•Extended context retention

About MiniMax 01

MiniMax 01 is the flagship model from MiniMax, representing the company's most advanced offering in their model family. As a tier-one model, it positions MiniMax as a competitor in the high-capability AI space alongside other major model providers. The model supports both text and image inputs with an exceptionally large context window of over 1 million tokens (1,000,192 tokens). This extensive context capacity enables processing of very long documents, extended conversations, and large-scale multimodal inputs. The model handles vision tasks alongside text generation, making it suitable for applications requiring understanding of both textual and visual information. MiniMax 01 targets enterprise and developer use cases that require sophisticated reasoning across multiple modalities and the ability to maintain context over very long interactions. While it lacks tool calling capabilities, its strength lies in direct text and image processing tasks where maintaining extensive context is crucial.

Common Use Cases

MiniMax 01 is well-suited for applications requiring extensive context retention and multimodal processing. Its massive context window makes it ideal for long document analysis, legal document review, research tasks involving large datasets, and extended technical consultations. The vision capabilities enable document OCR and analysis, image-based question answering, and multimodal content creation workflows. Organizations needing to process lengthy transcripts, analyze large codebases, or maintain context across very long conversations would benefit from this model's context capacity. However, applications requiring tool integration or function calling would need to implement those capabilities externally.

Frequently Asked Questions

How much does MiniMax 01 cost per million tokens?

MiniMax 01 pricing varies by provider and usage patterns. Check the pricing table above for current rates across all available providers offering this model.

What is MiniMax 01 best used for?

MiniMax 01 excels at tasks requiring extensive context retention and multimodal processing, such as long document analysis, research across large datasets, extended technical conversations, and vision-language tasks involving both text and images.

Does MiniMax 01 support tool calling or function calling?

No, MiniMax 01 does not have built-in tool calling capabilities. Applications requiring function calling or API integration would need to implement these features through external orchestration layers.