Gemma 2 2B
Google compact model optimized for efficiency
Text Generation Local Gemma Family
Parameters
2B
params
Context Window
8K
tokens
Max Output
-
tokens
Input Price
-
per 1M tokens
Output Price
-
per 1M tokens
Gemma Family 7 models
The full Gemma line by generation — pricing and capabilities vary across the family.
Google
FunctionGemma
Native function calling for on-device agents. Routes complex tasks to larger models. Optimized for edge deployment.
8K
context
2 9B
General purpose, balanced
8K
context
2 2B Current
Edge devices, fast inference
8K
context
2 27B
High quality generation
8K
context
3 1B (llama.cpp)
Fast AI assistant for chat, code generation, and reasoning tasks
4K
context
Capabilities
👁️
Vision
⚡
Function Calling
📋
JSON Mode
🌊
Streaming
💬
System Prompt
🖥️
Code Execution
🔍
Web Search
🔌
MCP Support
Local Model Specs
Quantization
Q4_K_M
Runtime
ollama
Details
- Release Date
- June 27, 2024
- Knowledge Cutoff
- February 1, 2024
- Source
- Local
- License
- Gemma
- Model ID
- gemma2-2b
Last updated: November 26, 2025