Gemini 2.5 Flash
Gemini 2.5 Flash balances speed and quality, ideal for production workloads with 1M context window.
Text Generation API Gemini Family v2.5-flash
Parameters
-
params
Context Window
1.0M
tokens
Max Output
8K
tokens
Input Price
$0.075
per 1M tokens
Output Price
$0.3
per 1M tokens
Gemini Family 13 models
The full Gemini line by generation — pricing and capabilities vary across the family.
3.1
3
2.5
2.5 Flash Lite (8B)
Lightweight classification, unlimited quota fallback for high-volume tasks
$0.0375 / $0.15
in / out · 1M
2.5 Flash Current
Fast document classification, balanced speed/quality for ingestion pipeline
$0.075 / $0.3
in / out · 1M
2.5 Pro
High-quality classification, long-context analysis, strategic planning via Gemini CLI
$1.25 / $5
in / out · 1M
Capabilities
👁️
Vision
⚡
Function Calling
📋
JSON Mode
🌊
Streaming
💬
System Prompt
🖥️
Code Execution
🔍
Web Search
🔌
MCP Support
Extended Pricing
Input
$0.075/1M
Output
$0.3/1M
Cache Read
$0.0188/1M
Cache Write
$0.075/1M
Details
- Release Date
- April 17, 2025
- Knowledge Cutoff
- August 1, 2024
- Source
- API
- API Key Required
- Yes
- License
- Proprietary
- Model ID
- gemini-2.5-flash
Last updated: November 1, 2025