Llama 3.1 8B
Meta
Meta latest Llama with 128K context window and improved reasoning
Text Generation Local Llama Family
Parameters
8B
params
Context Window
128K
tokens
Max Output
-
tokens
Input Price
-
per 1M tokens
Output Price
-
per 1M tokens
Llama Family Timeline 8 versions
Capabilities
👁️
Vision
⚡
Function Calling
📋
JSON Mode
🌊
Streaming
💬
System Prompt
🖥️
Code Execution
🔍
Web Search
🔌
MCP Support
Local Model Specs
Quantization
Q4_K_M
Runtime
ollama
Details
- Release Date
- July 23, 2024
- Knowledge Cutoff
- December 1, 2023
- Source
- Local
- License
- Llama 3.1 Community
- Model ID
- llama3.1-8b
Last updated: November 26, 2025