Reka Flash 3
Reka Flash 3 is Reka's lightweight text model designed for fast inference, featuring a 65K token context window and optimized for speed-focused applications.
API Pricing
| Provider | Input / 1M | Output / 1M | Speed | TTFT | Updated |
|---|---|---|---|---|---|
| $0.100 | $0.200 | 87.0 t/s | 1.2s | 4/14/2026 |
Prices updated daily. Last check: 4/14/2026
Model Details
General
- Creator
- Reka
- Family
- Reka
- Tier
- Lightweight
- Context Window
- 66K
- Modalities
- Text
Capabilities
- Tool Calling
- No
- Open Source
- No
Strengths & Limitations
- High throughput at 85.37 output tokens per second for fast text generation
- 65,536 token context window allows processing of substantial documents
- Lightweight architecture optimized for speed and efficiency
- Reasonable time to first token at 1,268ms for interactive applications
- Text-focused design without complexity of multimodal processing
- Suitable for high-volume batch processing scenarios
- No tool calling or function execution capabilities
- Text-only input - no support for images or other modalities
- Proprietary model with no open source weights available
- Lightweight tier limits advanced reasoning compared to flagship models
- Longer time to first token compared to some speed-optimized competitors
Key Features
About Reka Flash 3
Common Use Cases
Reka Flash 3 is designed for applications requiring fast, cost-effective text processing at scale. Its lightweight architecture and high throughput make it well-suited for content generation, document summarization, text classification, and customer service chatbots where speed matters more than complex reasoning. The 65K context window enables processing of longer documents while maintaining efficiency. Organizations running high-volume text processing workloads, automated content workflows, or real-time chat applications can benefit from its speed-optimized design, though users needing advanced capabilities like tool use or multimodal input should consider higher-tier alternatives.
Frequently Asked Questions
How much does Reka Flash 3 cost per million tokens?
Reka Flash 3 pricing varies by provider and usage patterns. Check the pricing table above for current rates across all available providers offering this model.
What is Reka Flash 3 best used for?
Reka Flash 3 excels at high-volume text processing tasks requiring fast response times, including content generation, document processing, text classification, and chatbot applications where speed and cost efficiency are prioritized over advanced reasoning capabilities.
Does Reka Flash 3 support tool calling or function execution?
No, Reka Flash 3 does not support tool calling or function execution. It focuses exclusively on text generation tasks. Users needing tool integration should consider higher-tier models in the Reka family or other providers offering function calling capabilities.