Gemini 3.1 Flash-Lite
Lightweight, high-volume Gemini model optimized for agentic tasks, translation, transcription, and data extraction. First Flash-Lite in the 3.x series.
Text Generation API Gemini Family v3.1
Parameters
-
params
Context Window
1.0M
tokens
Max Output
66K
tokens
Input Price
$0.25
per 1M tokens
Output Price
$1.5
per 1M tokens
Gemini Family 13 models
The full Gemini line by generation — pricing and capabilities vary across the family.
3.1
3.1 Flash-Lite Current
High-volume agentic tasks, translation, summarization, routing
$0.25 / $1.5
in / out · 1M
3.1 Pro
Complex reasoning, code generation, agentic tasks
$2 / $12
in / out · 1M
3
2.5
2.5 Flash Lite (8B)
Lightweight classification, unlimited quota fallback for high-volume tasks
$0.0375 / $0.15
in / out · 1M
2.5 Flash
Fast document classification, balanced speed/quality for ingestion pipeline
$0.075 / $0.3
in / out · 1M
2.5 Pro
High-quality classification, long-context analysis, strategic planning via Gemini CLI
$1.25 / $5
in / out · 1M
Capabilities
👁️
Vision
⚡
Function Calling
📋
JSON Mode
🌊
Streaming
💬
System Prompt
🖥️
Code Execution
🔍
Web Search
🔌
MCP Support
Extended Pricing
Input
$0.25/1M
Output
$1.5/1M
Cache Read
$0.03/1M
Details
- Release Date
- March 3, 2026
- Knowledge Cutoff
- January 1, 2025
- Source
- API
- API Key Required
- Yes
- License
- Proprietary
- Model ID
- gemini-3.1-flash-lite
Last updated: March 13, 2026