Gemma 2 2B

Google

Google compact model optimized for efficiency

Text Generation Local Gemma Family

Back to Models

Parameters

params

Context Window

tokens

Max Output

tokens

Input Price

per 1M tokens

Output Price

per 1M tokens

Gemma Family 7 models

The full Gemma line by generation — pricing and capabilities vary across the family.

Google

FunctionGemma

Native function calling for on-device agents. Routes complex tasks to larger models. Optimized for edge deployment.

context

Dec 2025

2 9B

General purpose, balanced

context

Jun 2024

2 2B Current

Edge devices, fast inference

context

Jun 2024

2 27B

High quality generation

context

Jun 2024

3 1B (llama.cpp)

Fast AI assistant for chat, code generation, and reasoning tasks

context

Feb 2024

3 12B

General-purpose, multimodal, coding

Complex reasoning, multimodal, research

131K

context

Mar 2025

Capabilities

👁️

Vision

⚡

Function Calling

📋

JSON Mode

🌊

Streaming

💬

System Prompt

🖥️

Code Execution

🔍

Web Search

🔌

MCP Support

Local Model Specs

Quantization

Q4_K_M

Runtime

ollama

Details

Release Date: June 27, 2024
Knowledge Cutoff: February 1, 2024
Source: Local
License: Gemma
Model ID: gemma2-2b

Last updated: November 26, 2025