Devstral Small
Devstral Small is Mistral's lightweight coding model optimized for fast code generation and completion tasks, with a 128K token context window.
API Pricing
| Provider | Input / 1M | Output / 1M | Updated |
|---|---|---|---|
| $0.100 | $0.300 | 4/14/2026 |
Prices updated daily. Last check: 4/14/2026
Model Details
General
- Creator
- Mistral
- Family
- Devstral
- Tier
- Lightweight
- Context Window
- 131K
- Modalities
- Text
Capabilities
- Tool Calling
- No
- Open Source
- No
Strengths & Limitations
- Fast token generation at approximately 206 tokens per second
- Quick response initiation with 393ms time to first token
- Large 128K token context window for substantial code analysis
- Optimized specifically for coding and development tasks
- Lightweight architecture reduces computational requirements
- Suitable for high-frequency coding assistance workflows
- No tool calling or function execution capabilities
- Proprietary model with no open source availability
- Text-only modality without image or multimodal support
- Lightweight tier may limit complex reasoning capabilities
- Smaller model size compared to flagship coding models
Key Features
About Devstral Small
Common Use Cases
Devstral Small is well-suited for developers who need efficient code completion, debugging assistance, and code generation for routine programming tasks. Its fast generation speed and large context window make it effective for analyzing substantial codebases, providing real-time coding suggestions in IDEs, and handling repetitive development workflows. The lightweight nature makes it practical for applications requiring frequent API calls or where cost efficiency is important, such as code completion plugins, automated code review assistance, or educational coding platforms where quick feedback is valued over handling the most complex algorithmic challenges.
Frequently Asked Questions
How much does Devstral Small cost per million tokens?
Devstral Small pricing varies by provider and pricing type (standard vs batch). Check the pricing table above for current rates across all providers.
What is Devstral Small best used for?
Devstral Small excels at code completion, routine code generation, and debugging assistance where speed matters. Its 206 tokens/second generation rate and 128K context window make it ideal for IDE integrations, real-time coding suggestions, and analyzing substantial codebases efficiently.
How does Devstral Small compare to larger coding models?
Devstral Small prioritizes speed and efficiency over maximum capability. While larger models may handle more complex algorithmic challenges, Devstral Small's fast 393ms response time and high throughput make it better suited for interactive coding workflows and high-frequency assistance tasks.