Qwen 3.5 122B
Qwen 3.5 122B is Alibaba's flagship multimodal model supporting text, image, and video inputs with a 262K token context window.
API Pricing
| Provider | Input / 1M | Output / 1M | Speed | TTFT | Updated |
|---|---|---|---|---|---|
| $0.260 | $1.56 | 151 t/s | 1.1s | 4/14/2026 |
Prices updated daily. Last check: 4/14/2026
Model Details
General
- Creator
- Alibaba
- Family
- Qwen
- Tier
- Flagship
- Context Window
- 262K
- Modalities
- Text, Image, Video
Capabilities
- Tool Calling
- No
- Open Source
- No
- Aliases
- qwen3-5-plus-02-15
Strengths & Limitations
- Supports text, image, and video input modalities
- Large 262,144 token context window for extensive content processing
- Output speed of 139.78 tokens per second for responsive applications
- 122-billion parameter scale for complex reasoning tasks
- Native multimodal processing without separate model calls
- Flagship tier positioning with comprehensive capabilities
- Video understanding capability beyond static image analysis
- No function calling or tool use capabilities
- Proprietary model with no open source weights available
- Time to first token of 1,024ms may impact real-time applications
- Limited to inference API access only
Key Features
About Qwen 3.5 122B
Common Use Cases
Qwen 3.5 122B is designed for complex multimodal applications requiring analysis of text, images, and video content within a single workflow. Its large context window makes it suitable for document analysis combined with visual elements, content moderation across multiple media types, educational applications involving multimedia materials, and research tasks requiring comprehensive understanding of mixed content formats. The model's flagship positioning and video capabilities make it appropriate for media analysis, content creation workflows, and enterprise applications where multimodal understanding is essential for business processes.
Frequently Asked Questions
How much does Qwen 3.5 122B cost per million tokens?
Qwen 3.5 122B pricing varies by provider and may include different rates for text and image tokens. Check the pricing table above for current rates across all available providers.
What is Qwen 3.5 122B best used for?
Qwen 3.5 122B excels at multimodal tasks involving text, image, and video analysis. Its large context window and video understanding capabilities make it well-suited for content analysis, document processing with visual elements, educational applications, and enterprise workflows requiring comprehensive multimedia understanding.
Does Qwen 3.5 122B support function calling?
No, Qwen 3.5 122B does not support function calling or tool use capabilities. It focuses on multimodal understanding and generation tasks rather than agentic workflows that require external tool integration.