Gemini 2.5 Flash Lite (8B)

Google

Gemini 2.5 Flash Lite (8B) is the most cost-effective model with nearly unlimited free tier quota.

Text Generation API Gemini Family v2.5-flash-8b
Parameters
-
params
Context Window
1.0M
tokens
Max Output
8K
tokens
Input Price
$0.0375
per 1M tokens
Output Price
$0.1500
per 1M tokens

Capabilities

👁️
Vision
Function Calling
📋
JSON Mode
🌊
Streaming
💬
System Prompt
🖥️
Code Execution
🔍
Web Search
🔌
MCP Support

Extended Pricing

Input
$0.0375/1M
Output
$0.1500/1M
Cache Read
$0.0100/1M
Cache Write
$0.0375/1M

Details

Release Date
May 20, 2025
Knowledge Cutoff
August 1, 2024
Source
API
API Key Required
Yes
License
Proprietary
Model ID
gemini-2.5-flash-8b
Last updated: November 1, 2025