Skip to main content
Google

Gemini 3.5 Flash

Gemini 3.5 Flash by Google — compare inference API pricing across providers.

Input from
$0.750 / 1M tokens
across 3 providers

API Pricing

Cheapest on Google Cloud 43% below avg
ProviderInput / 1MOutput / 1MCached / 1M
$0.750$4.50-
$1.50$9.00$0.150
$1.50$9.00-
$1.50$9.00$0.150

Prices updated daily. Last check: Jun 23, 2026

Performance & Benchmarks

Source: Artificial Analysis →
Intelligence
50.2 / 100
Coding
70.1 / 100
Output Speed
241 t/s
Latency (TTFT)
15.1s

Reasoning & Knowledge

  • GPQA Diamond92.2%
  • Humanity's Last Exam41.0%

Coding

  • SciCode53.1%

Agentic & Tool Use

  • Terminal-Bench Hard40.9%
  • Terminal-Bench v2.178.7%
  • τ²-bench95.3%
  • τ-bench Banking25.4%

Instruction & Long Context

  • IFBench76.3%
  • Long-Context Reasoning69.3%

Benchmarks measured Jun 2026. Scores are independent evaluations, not vendor-reported.

Model Details

General

Creator
Google
Modalities
Text

Capabilities

Tool Calling
No
Open Source
No