Google
Gemini 3.5 Flash
Gemini 3.5 Flash by Google — compare inference API pricing across providers.
Input from
$0.750 / 1M tokens
across 3 providers
API Pricing
Cheapest on Google Cloud — 43% below avg| Provider | Input / 1M | Output / 1M | Cached / 1M |
|---|---|---|---|
| $0.750 | $4.50 | - | |
| $1.50 | $9.00 | $0.150 | |
| $1.50 | $9.00 | - | |
| $1.50 | $9.00 | $0.150 |
Prices updated daily. Last check: Jun 23, 2026
Performance & Benchmarks
Source: Artificial Analysis →Intelligence
50.2 / 100
Coding
70.1 / 100
Output Speed
241 t/s
Latency (TTFT)
15.1s
Reasoning & Knowledge
- GPQA Diamond92.2%
- Humanity's Last Exam41.0%
Coding
- SciCode53.1%
Agentic & Tool Use
- Terminal-Bench Hard40.9%
- Terminal-Bench v2.178.7%
- τ²-bench95.3%
- τ-bench Banking25.4%
Instruction & Long Context
- IFBench76.3%
- Long-Context Reasoning69.3%
Benchmarks measured Jun 2026. Scores are independent evaluations, not vendor-reported.
Model Details
General
- Creator
- Modalities
- Text
Capabilities
- Tool Calling
- No
- Open Source
- No