QwQ 32B (Reasoning)
Alibaba
Reasoning-focused model with extended thinking
Text Generation Local Qwen Family
Parameters
32B
params
Context Window
32K
tokens
Max Output
-
tokens
Input Price
-
per 1M tokens
Output Price
-
per 1M tokens
Qwen Family 14 models
The full Qwen line by generation — pricing and capabilities vary across the family.
3.5
3.5 35B-A3B
Efficient inference, agentic tasks, edge deployment
262K
context
3.5 27B
Reasoning, coding, analysis, multimodal
262K
context
3.5 Plus
AI agents, multilingual tasks, reasoning, multimodal
$0.4 / $2.4
in / out · 1M
3.5 9B
General-purpose, coding, reasoning, multilingual
262K
context
3
Alibaba
QwQ 32B (Reasoning) Current
Deep reasoning, chain-of-thought
32K
context
2.5 7B
Multilingual, coding, math
128K
context
2.5 14B
Complex tasks, analysis
128K
context
2.5 32B
Professional applications
128K
context
2.5 Coder 32B
Code generation, debugging
128K
context
Capabilities
👁️
Vision
⚡
Function Calling
📋
JSON Mode
🌊
Streaming
💬
System Prompt
🖥️
Code Execution
🔍
Web Search
🔌
MCP Support
Local Model Specs
Quantization
Q4_K_M
Runtime
ollama
Details
- Release Date
- November 28, 2024
- Knowledge Cutoff
- September 1, 2024
- Source
- Local
- License
- Qwen
- Model ID
- qwq-32b
Last updated: March 13, 2026