LightweightxAI

Grok 4 Fast

Name: Grok 4 Fast
Author: xAI

Grok 4 Fast is xAI's lightweight multimodal model with a 2M token context window, optimized for high-speed text and image processing tasks.

Context 2.0M

Tier Lightweight

Modalities text, image

Contact providers for pricing

Compare Prices

API Pricing

No pricing data available for this model at the moment.

Prices updated daily. Last check: Jul 13, 2026

Performance & Benchmarks

Source: Artificial Analysis →

Intelligence

21.6 / 100

Math

43.3 / 100

Reasoning & Knowledge

MMLU-Pro79.3%
GPQA Diamond72.7%
Humanity's Last Exam7.5%

Coding

LiveCodeBench65.7%
SciCode36.2%

Math

AIME 202543.3%

Agentic & Tool Use

Terminal-Bench Hard17.4%
τ²-bench75.7%

Instruction & Long Context

IFBench41.4%
Long-Context Reasoning48.3%

Benchmarks measured Jul 2026. Scores are independent evaluations, not vendor-reported.

Model Details

General

Creator: xAI
Family: Grok
Tier: Lightweight
Context Window: 2.0M
Modalities: Text, Image

Capabilities

Tool Calling: No
Open Source: No
Aliases: grok-code-fast-1, grok-4-1-fast

Strengths & Limitations

Strengths

2 million token context window enables processing of very long documents
High output speed at 148.78 tokens per second for rapid content generation
Multimodal support for both text and image inputs
Fast initial response with 3.9 second time to first token
Lightweight tier positioning for cost-effective deployment
Large context capacity suitable for extended conversations
Part of xAI's latest fourth-generation model architecture

Limitations

No tool calling or function execution capabilities
Proprietary model with no open source weights available
Lightweight tier may have reduced reasoning capabilities compared to flagship models
Limited to text and image modalities without audio or video support

Key Features

•2 million token context window

•Text and image input processing

•High-speed token generation at 148.78 tokens/second

•Fast time to first token (3.9 seconds)

•Streaming response support

•Multi-provider API availability

•Batch processing capabilities

•JSON structured output mode

About Grok 4 Fast

Grok 4 Fast is xAI's lightweight tier model in the Grok family, positioned for applications requiring fast response times rather than maximum reasoning capability. Created by xAI, it represents the speed-optimized variant of their fourth-generation language model architecture. The model supports both text and image inputs with an exceptionally large 2 million token context window, allowing it to process extensive documents or maintain long conversations while delivering outputs at 148.78 tokens per second. With a time to first token of 3.9 seconds, it provides rapid initial response generation suitable for interactive applications. However, it does not include tool calling capabilities, focusing instead on pure text and image understanding tasks. Grok 4 Fast is designed for workloads where processing speed and large context handling are more critical than advanced reasoning or agent capabilities. Its multimodal support and massive context window make it suitable for document analysis, content summarization, and conversational applications that need to maintain context over extended interactions.

Common Use Cases

Grok 4 Fast is well-suited for applications requiring fast processing of large documents or images where speed takes priority over complex reasoning. Its 2 million token context window makes it ideal for document summarization, content analysis, and research assistance involving lengthy texts. The multimodal capabilities enable rapid image analysis and description tasks. Its lightweight tier positioning makes it cost-effective for high-volume applications like customer support, content moderation, or real-time chat applications where quick responses are essential. The large context window also supports extended conversational AI applications that need to maintain context over long interactions without losing coherence.

Frequently Asked Questions

How much does Grok 4 Fast cost per million tokens?

Grok 4 Fast pricing varies by provider and pricing type (standard vs batch). Check the pricing table above for current rates across all providers.

What is Grok 4 Fast best used for?

Grok 4 Fast excels at high-speed processing of large documents and images where rapid response times are crucial. Its 2 million token context window and 148.78 tokens/second output speed make it ideal for document analysis, content summarization, conversational AI, and image analysis tasks that prioritize speed over complex reasoning.

Does Grok 4 Fast support tool calling and function execution?

No, Grok 4 Fast does not include tool calling or function execution capabilities. It focuses on pure text and image understanding tasks with optimized speed performance. For agent-like capabilities requiring tool use, consider other models in the Grok family or alternative providers.