Devstral 2 123B
Devstral 2 123B is Mistral's flagship code-specialized model with 123 billion parameters, offering advanced code generation and chat capabilities with a 128K token context window.
API Pricing
| Provider | Input / 1M | Output / 1M | Speed | TTFT | Updated |
|---|---|---|---|---|---|
| $0.400 | $2.00 | 74.9 t/s | 752ms | 4/14/2026 |
Prices updated daily. Last check: 4/14/2026
Model Details
General
- Creator
- Mistral
- Family
- Devstral
- Tier
- Flagship
- Context Window
- 128K
- Modalities
- Text
Capabilities
- Tool Calling
- Yes
- Open Source
- No
- Subtypes
- Chat Completion, Code Generation
- Aliases
- devstral-2-123b-instruct-2512
Strengths & Limitations
- 123 billion parameters provide substantial model capacity for complex coding tasks
- Tool calling support enables integration with development environments and APIs
- 128K token context window accommodates large codebases and extended conversations
- Generates 76.04 tokens per second for responsive code generation
- Time to first token of 423ms provides quick response initiation
- Specialized training for code generation and technical tasks
- Flagship tier positioning within Mistral's Devstral family
- Proprietary model with no open source weights available
- Limited to text-only interactions without image or multimodal support
- Smaller context window compared to some competing frontier models
- Higher computational requirements due to 123B parameter size
- No batch processing capabilities mentioned in available specifications
Key Features
About Devstral 2 123B
Common Use Cases
Devstral 2 123B is designed for professional software development workflows requiring sophisticated code generation and technical problem-solving. Its 123 billion parameters and code specialization make it suitable for complex programming tasks like architectural planning, code review and optimization, debugging across large codebases, and generating production-quality code in multiple programming languages. The 128K context window supports analyzing substantial code repositories, while tool calling capabilities enable integration with IDEs, version control systems, and development APIs. Organizations use this flagship model for technical documentation generation, code migration projects, and building AI-powered developer tools where accuracy and technical depth are essential.
Frequently Asked Questions
How much does Devstral 2 123B cost per million tokens?
Devstral 2 123B pricing varies by provider and may include different rates for input and output tokens. Check the pricing table above for current rates across all available providers offering this model.
What is Devstral 2 123B best used for?
Devstral 2 123B excels at complex code generation, technical problem-solving, and professional development workflows. Its 123B parameters and code specialization make it ideal for architectural planning, multi-language code generation, large codebase analysis, and building sophisticated developer tools that require high technical accuracy.
How does Devstral 2 123B compare to other coding models?
Devstral 2 123B offers 123 billion parameters specifically trained for coding tasks, with tool calling support and a 128K context window. It generates 76.04 tokens per second with 423ms time to first token, providing a balance of model capability and inference speed for professional development use cases.