LightweightxAI

Grok 4 Fast

Grok 4 Fast is xAI's lightweight multimodal model with a 2M token context window, optimized for high-speed text and image processing tasks.

Context 2.0M
Tier Lightweight
Modalities text, image
Input from
$0.200 / 1M tokens
across 1 provider

API Pricing

ProviderInput / 1MOutput / 1MSpeedTTFTUpdated
$0.200$1.50167 t/s3.6s4/14/2026

Prices updated daily. Last check: 4/14/2026

Model Details

General

Creator
xAI
Family
Grok
Tier
Lightweight
Context Window
2.0M
Modalities
Text, Image

Capabilities

Tool Calling
No
Open Source
No
Aliases
grok-code-fast-1, grok-4-1-fast

Strengths & Limitations

  • 2 million token context window enables processing of very long documents
  • High output speed at 148.78 tokens per second for rapid content generation
  • Multimodal support for both text and image inputs
  • Fast initial response with 3.9 second time to first token
  • Lightweight tier positioning for cost-effective deployment
  • Large context capacity suitable for extended conversations
  • Part of xAI's latest fourth-generation model architecture
  • No tool calling or function execution capabilities
  • Proprietary model with no open source weights available
  • Lightweight tier may have reduced reasoning capabilities compared to flagship models
  • Limited to text and image modalities without audio or video support

Key Features

2 million token context window
Text and image input processing
High-speed token generation at 148.78 tokens/second
Fast time to first token (3.9 seconds)
Streaming response support
Multi-provider API availability
Batch processing capabilities
JSON structured output mode

About Grok 4 Fast

Grok 4 Fast is xAI's lightweight tier model in the Grok family, positioned for applications requiring fast response times rather than maximum reasoning capability. Created by xAI, it represents the speed-optimized variant of their fourth-generation language model architecture. The model supports both text and image inputs with an exceptionally large 2 million token context window, allowing it to process extensive documents or maintain long conversations while delivering outputs at 148.78 tokens per second. With a time to first token of 3.9 seconds, it provides rapid initial response generation suitable for interactive applications. However, it does not include tool calling capabilities, focusing instead on pure text and image understanding tasks. Grok 4 Fast is designed for workloads where processing speed and large context handling are more critical than advanced reasoning or agent capabilities. Its multimodal support and massive context window make it suitable for document analysis, content summarization, and conversational applications that need to maintain context over extended interactions.

Common Use Cases

Grok 4 Fast is well-suited for applications requiring fast processing of large documents or images where speed takes priority over complex reasoning. Its 2 million token context window makes it ideal for document summarization, content analysis, and research assistance involving lengthy texts. The multimodal capabilities enable rapid image analysis and description tasks. Its lightweight tier positioning makes it cost-effective for high-volume applications like customer support, content moderation, or real-time chat applications where quick responses are essential. The large context window also supports extended conversational AI applications that need to maintain context over long interactions without losing coherence.

Frequently Asked Questions

How much does Grok 4 Fast cost per million tokens?

Grok 4 Fast pricing varies by provider and pricing type (standard vs batch). Check the pricing table above for current rates across all providers.

What is Grok 4 Fast best used for?

Grok 4 Fast excels at high-speed processing of large documents and images where rapid response times are crucial. Its 2 million token context window and 148.78 tokens/second output speed make it ideal for document analysis, content summarization, conversational AI, and image analysis tasks that prioritize speed over complex reasoning.

Does Grok 4 Fast support tool calling and function execution?

No, Grok 4 Fast does not include tool calling or function execution capabilities. It focuses on pure text and image understanding tasks with optimized speed performance. For agent-like capabilities requiring tool use, consider other models in the Grok family or alternative providers.