Gemini 2.5 Flash

Google

Gemini 2.5 Flash balances speed and quality, ideal for production workloads with 1M context window.

Text Generation API Gemini Family v2.5-flash

Documentation Back to Models

Parameters

params

Context Window

1.0M

tokens

Max Output

tokens

Input Price

$0.075

per 1M tokens

Output Price

$0.3

per 1M tokens

Gemini Family 13 models

The full Gemini line by generation — pricing and capabilities vary across the family.

3.5

3.5 Flash Latest

Agentic workflows, coding, multimodal, high-throughput

3.1

High-volume agentic tasks, translation, summarization, routing

Complex reasoning, code generation, agentic tasks

Most capable Google model

Reasoning, coding, multimodal

Fast multimodal, high volume

2.5

Lightweight classification, unlimited quota fallback for high-volume tasks

$0.0375 / $0.15

in / out · 1M

May 2025

2.5 Flash Current

Fast document classification, balanced speed/quality for ingestion pipeline

$0.075 / $0.3

in / out · 1M

Apr 2025

2.5 Pro

High-quality classification, long-context analysis, strategic planning via Gemini CLI

2.0

Next-gen multimodal, agents

1.5

Long context, multimodal

Fast multimodal, cost-effective

GCP native, enterprise

$1.25 / $5

in / out · 1M

Capabilities

👁️

Vision

⚡

Function Calling

📋

JSON Mode

🌊

Streaming

💬

System Prompt

🖥️

Code Execution

🔍

Web Search

🔌

MCP Support

Extended Pricing

Input

$0.075/1M

Output

$0.3/1M

Cache Read

$0.0188/1M

Cache Write

$0.075/1M

Details

Release Date: April 17, 2025
Knowledge Cutoff: August 1, 2024
Source: API
API Key Required: Yes
License: Proprietary
Model ID: gemini-2.5-flash

Last updated: November 1, 2025