Phi-4
Phi-4 is Microsoft's lightweight language model designed for efficient text generation with a 16K token context window.
API Pricing
Cheapest on OpenRouter — 4% below avg| Provider | Input / 1M | Output / 1M | Speed | TTFT | Updated |
|---|---|---|---|---|---|
| $0.065 | $0.140 | 16.7 t/s | 359ms | 4/14/2026 | |
| $0.070 | $0.140 | 16.7 t/s | 359ms | 4/4/2026 |
Prices updated daily. Last check: 4/14/2026
Model Details
General
- Creator
- Microsoft
- Family
- Phi
- Tier
- Lightweight
- Context Window
- 16K
- Modalities
- Text
Capabilities
- Tool Calling
- No
- Open Source
- No
Strengths & Limitations
- Fast inference speed at 17.24 output tokens per second
- Quick response initiation with 354ms time to first token
- Lightweight architecture for efficient deployment
- 16K token context window supports moderate-length conversations
- Part of Microsoft's established Phi model family
- Optimized for text generation tasks
- No tool calling or function execution support
- Text-only modality - no image or multimodal input
- Proprietary model with no open source access
- Smaller context window than flagship models
- Limited to lightweight tier capabilities
Key Features
About Phi-4
Common Use Cases
Phi-4 is suited for applications requiring efficient text processing where speed and resource efficiency are priorities over maximum capability. This includes chatbots with moderate complexity requirements, content generation for blogs or marketing copy, text summarization of shorter documents, and educational applications where quick responses enhance user experience. The model's lightweight nature makes it appropriate for scenarios with high-volume requests or resource-constrained environments where deploying larger flagship models would be impractical or costly.
Frequently Asked Questions
How much does Phi-4 cost per million tokens?
Phi-4 pricing varies by provider and pricing type. Check the pricing table above for current rates across all providers offering this model.
What is Phi-4 best used for?
Phi-4 excels at efficient text generation tasks including chatbots, content creation, and text summarization where fast response times and resource efficiency are important. Its lightweight design makes it suitable for high-volume applications or environments with computational constraints.
Does Phi-4 support tool calling or multimodal input?
No, Phi-4 is a text-only model that does not support tool calling, function execution, or multimodal inputs like images. It is focused on efficient text processing and generation tasks.