Llama 3.1 8B

Meta

Meta latest Llama with 128K context window and improved reasoning

Text Generation Local Llama Family
Parameters
8B
params
Context Window
128K
tokens
Max Output
-
tokens
Input Price
-
per 1M tokens
Output Price
-
per 1M tokens

Capabilities

👁️
Vision
Function Calling
📋
JSON Mode
🌊
Streaming
💬
System Prompt
🖥️
Code Execution
🔍
Web Search
🔌
MCP Support

Local Model Specs

Quantization
Q4_K_M
Runtime
ollama

Details

Release Date
July 23, 2024
Knowledge Cutoff
December 1, 2023
Source
Local
License
Llama 3.1 Community
Model ID
llama3.1-8b
Last updated: November 26, 2025