Skip to content
Seedance 2.0 Face is here — generate video from real-person reference photos.Try it now
Back to Models
googlegoogle·May 19, 2026

Google: Gemini 3.5 Flash

google/gemini-3.5-flash
context
1.0M
Max Output
65.5K
Input / 1M
$1.50
Output / 1M
$9.00

About

Gemini 3.5 Flash is Google's high-efficiency multimodal model, delivering coding and reasoning quality at Flash-tier cost and speed. It accepts text, image, video, audio, and PDF inputs with text output, supports a roughly 1M-token context window with up to 64K output tokens, and is optimized for coding and parallel agentic execution loops. On benchmarks Google reports, it scores 76.2% on Terminal-Bench 2.1 and 83.6% on MCP Atlas. It defaults to medium thinking effort and supports configurable thinking levels (minimal, low, medium, high) for cost/performance trade-offs.

Capabilities

Context Length
1.0M
Max Output
65.5K
Reasoning
Yes
In
text, image, video, file, audio
Out
text
50.2#11 of 133
Intelligence Index
45.0#21 of 118
Coding Index

Reasoning & Knowledge

GPQA Diamond92.2%
HLE41.0%

Coding & Agentic

SciCode53.1%
Terminal-Bench Hard40.9%

Source: Artificial Analysis

TypePrice / 1M tokens
Input$1.50
Output$9.00
Cache Read$0.15
Cache Write$0.083333
Audio Input$3.00
Audio Cache$0.30
Reasoning$9.00
Image Input$1.50
Web Search$0.01 / call

OpenAI-compatible · Model ID google/gemini-3.5-flash

curl https://api.elliotgate.com/v1/chat/completions \
  -H "Authorization: Bearer sk-omg-your-api-key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "google/gemini-3.5-flash",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'