GPT Realtime mini
GPT Realtime mini is OpenAI's lightweight real-time audio model for voice conversations, with text and audio input/output and a 128K token context window.
API Pricing
Prices updated daily. Last check: 4/14/2026
Model Details
General
- Creator
- OpenAI
- Family
- GPT Realtime
- Tier
- Lightweight
- Context Window
- 128K
- Knowledge Cutoff
- Oct 2024
- Modalities
- Text, Audio
Capabilities
- Tool Calling
- Yes
- Open Source
- No
- Subtypes
- Chat Completion
Strengths & Limitations
- Native real-time audio input and output without separate TTS/STT systems
- 128,000 token context window for extended conversations
- Tool calling support enables API integrations during voice interactions
- Lightweight tier offers faster response times than larger family variants
- October 2024 knowledge cutoff provides relatively current information
- Text and audio modality support allows flexible interaction modes
- No image or visual input support
- Proprietary model with weights not publicly available
- Lightweight tier has reduced capabilities compared to larger GPT models
- Limited to chat completion tasks within modality constraints
Key Features
About GPT Realtime mini
Common Use Cases
GPT Realtime mini is suited for applications requiring cost-effective real-time voice interactions, such as customer service chatbots, voice assistants, interactive voice response systems, and conversational interfaces where speed and responsiveness matter more than maximum reasoning capability. Its tool calling functionality makes it appropriate for voice-activated workflows that need to integrate with external services, while the 128K context window supports extended conversations without losing context. The lightweight tier makes it practical for high-volume voice applications where the full capability of larger models isn't necessary.
Frequently Asked Questions
How much does GPT Realtime mini cost per million tokens?
GPT Realtime mini pricing varies by provider and may include both input/output token costs and real-time audio processing fees. Check the pricing table above for current rates across all providers offering this model.
What is GPT Realtime mini best used for?
GPT Realtime mini excels at real-time voice conversations, customer service chatbots, voice assistants, and interactive applications where audio input/output is needed. Its lightweight design makes it cost-effective for high-volume voice interactions while still supporting tool calling for API integrations.
How does GPT Realtime mini compare to text-only GPT models?
GPT Realtime mini offers native real-time audio processing that text-only GPT models lack, eliminating the need for separate speech-to-text and text-to-speech systems. However, it's positioned as a lightweight tier with reduced capabilities compared to flagship GPT models, and it doesn't support visual inputs that some other GPT variants offer.