LightweightOpenAI

GPT-5 mini

Name: GPT-5 mini
Availability: InStock
Author: OpenAI

GPT-5 mini is OpenAI's lightweight model offering multimodal capabilities with text and image processing in a 200K token context window.

Context 200K

Tier Lightweight

Knowledge Dec 2024

Tools Supported

Modalities text, image

Input from

$0.250 / 1M tokens

across 2 providers

Compare Prices Model Page →API Docs

API Pricing

Provider	Input / 1M	Output / 1M	Cached / 1M	Speed	TTFT	Updated
OpenRouter	$0.250	$2.00	$0.025	107 t/s	14.5s	7/13/2026
Perplexity	$0.250	$2.00	-	107 t/s	14.5s	7/6/2026

Prices updated daily. Last check: Jul 13, 2026

Performance & Benchmarks

Source: Artificial Analysis →

Intelligence

30.9 / 100

Math

85.0 / 100

Output Speed

107 t/s

Latency (TTFT)

14.5s

Reasoning & Knowledge

MMLU-Pro82.8%
GPQA Diamond80.3%
Humanity's Last Exam14.6%

Coding

LiveCodeBench69.2%
SciCode41.0%

Math

AIME 202585.0%

Agentic & Tool Use

Terminal-Bench Hard28.8%
τ²-bench71.1%

Instruction & Long Context

IFBench71.2%
Long-Context Reasoning66.0%

Benchmarks measured Jul 2026. Scores are independent evaluations, not vendor-reported.

Model Details

General

Creator: OpenAI
Family: GPT
Tier: Lightweight
Context Window: 200K
Knowledge Cutoff: Dec 2024
Modalities: Text, Image

Capabilities

Tool Calling: Yes
Open Source: No
Subtypes: Chat Completion

Strengths & Limitations

Strengths

200K token context window enables processing of lengthy documents
Multimodal capabilities support both text and image inputs
Tool calling functionality with external API integration
December 2024 knowledge cutoff provides recent training data
Lightweight design offers faster inference than flagship GPT-5
Part of GPT-5 family inheriting latest architectural improvements
Chat completion format optimized for conversational workflows

Limitations

Proprietary model with no open-source weights available
Lightweight tier likely has reduced reasoning capabilities vs flagship GPT-5
No audio or video modality support beyond text and images
Performance limitations compared to larger models in the GPT-5 family
Smaller parameter count may impact complex task performance

Key Features

•200K token context window

•Text and image input processing

•Tool calling with function execution

•Chat completion API format

•Streaming response support

•Multimodal conversation handling

•Structured output capabilities

•Batch processing compatibility

About GPT-5 mini

GPT-5 mini is OpenAI's lightweight tier model in the GPT-5 family, positioned below the flagship GPT-5 for cost-effective deployment scenarios. As part of OpenAI's latest generation, it represents the company's approach to providing capable AI at reduced computational requirements compared to the full GPT-5 model. The model features a 200,000 token context window and supports both text and image inputs for chat completion tasks. GPT-5 mini includes tool calling capabilities, allowing it to interact with external functions and APIs. With a knowledge cutoff of December 2024, it has access to relatively recent training data compared to many competing models. The multimodal design enables users to process documents, images, and text within the same conversation context. GPT-5 mini targets use cases where developers need GPT-5 family capabilities but require faster response times or higher throughput than the flagship model provides. It competes with other lightweight models like Claude Haiku and Gemini Flash variants, offering OpenAI's particular approach to reasoning and instruction following in a more efficient package.

Common Use Cases

GPT-5 mini serves applications requiring GPT-5 family capabilities with emphasis on speed and cost efficiency. Its lightweight design makes it suitable for high-volume customer service chatbots, content moderation at scale, and automated document processing where the 200K context window enables handling substantial text volumes. The multimodal capabilities support applications like visual content analysis, document understanding with embedded images, and educational tools that process both text and visual materials. With tool calling support, it can power lightweight AI agents for task automation, API integrations, and workflow orchestration where the full computational power of flagship models is unnecessary.

Frequently Asked Questions

How much does GPT-5 mini cost per million tokens?

GPT-5 mini pricing varies by provider and pricing type (standard vs batch). Check the pricing table above for current rates across all providers offering this model.

What is GPT-5 mini best used for?

GPT-5 mini excels at high-volume applications requiring GPT-5 family capabilities with faster response times. Its 200K context window and multimodal support make it ideal for document processing, customer service automation, content moderation, and lightweight AI agents where speed and cost efficiency are priorities over maximum reasoning capability.

How does GPT-5 mini compare to the full GPT-5 model?

GPT-5 mini offers the same 200K context window and multimodal capabilities as GPT-5 but with reduced model parameters for faster inference and lower costs. While it maintains tool calling and chat completion features, it likely has diminished performance on complex reasoning, advanced coding, and sophisticated analysis tasks compared to the flagship GPT-5 model.