Qwen 3 235B
Qwen 3 235B is Alibaba's flagship open-source language model with 235 billion parameters, featuring a 128K token context window and strong performance in coding and reasoning tasks.
API Pricing
Cheapest on Deep Infra — 80% below avg| Provider | Input / 1M | Output / 1M | Speed | TTFT | Updated |
|---|---|---|---|---|---|
| $0.071 | $0.100 | 69.9 t/s | 1.1s | 4/4/2026 | |
| $0.227 | $0.906 | 69.9 t/s | 1.1s | 4/13/2026 | |
| $0.455 | $1.82 | 69.9 t/s | 1.1s | 4/14/2026 | |
| $0.650 | $3.00 | 69.9 t/s | 1.1s | 4/14/2026 |
Prices updated daily. Last check: 4/14/2026
Model Details
General
- Creator
- Alibaba
- Family
- Qwen
- Tier
- Flagship
- Context Window
- 128K
- Modalities
- Text
Capabilities
- Tool Calling
- Yes
- Open Source
- Yes
- Subtypes
- Chat Completion, Code Generation
- Aliases
- qwen3-235b-a22b, qwen3-235b-a22b-instruct-2507, qwen3-235b-a22b-thinking-2507, qwen3-235b-a22b-2507
Strengths & Limitations
- Open-source model with full weights available for download and customization
- Large 235 billion parameter count for complex reasoning and generation tasks
- 128K token context window supports long-document processing
- Tool calling functionality with structured output support
- Strong code generation capabilities across multiple programming languages
- Benchmark output speed of 65.7 tokens per second for responsive interactions
- Multilingual support extending beyond English for global applications
- Significant computational requirements due to 235B parameter size
- Text-only modality without image or audio input support
- Time to first token of 1,037ms slower than smaller models
- Smaller context window compared to some competing flagship models
- Inference costs scale with model size for cloud deployment
Key Features
About Qwen 3 235B
Common Use Cases
Qwen 3 235B is designed for demanding applications that require maximum language model capability, including complex reasoning tasks, advanced code generation and debugging, long-document analysis within its 128K context window, and sophisticated chatbot implementations. Its open-source nature makes it particularly suitable for organizations requiring on-premises deployment, model customization, or fine-tuning for specialized domains. The model's substantial size makes it appropriate for use cases where response quality is prioritized over inference speed, such as research applications, content creation workflows, and enterprise assistant implementations that benefit from its tool calling capabilities.
Frequently Asked Questions
How much does Qwen 3 235B cost per million tokens?
Qwen 3 235B pricing varies by provider and may include both hosted API access and self-hosting options since it's open-source. Check the pricing table above for current rates across all providers offering this model.
What is Qwen 3 235B best used for?
Qwen 3 235B excels at complex reasoning tasks, advanced code generation and debugging, long-document processing up to 128K tokens, and applications requiring tool calling capabilities. Its 235B parameter size makes it suitable for demanding use cases where response quality is more important than speed.
Can I run Qwen 3 235B on my own infrastructure?
Yes, Qwen 3 235B is open-source with downloadable model weights, allowing self-hosting and customization. However, the 235 billion parameter size requires significant computational resources including multiple high-end GPUs and substantial memory for efficient inference.