Qwen 3.5 9B

Alibaba

Most popular local Qwen 3.5 size. Hybrid architecture with Gated Delta Networks and sparse MoE. Supports 201 languages, thinking/non-thinking modes, and native multimodal (text + image + video).

Text Generation Local Qwen Family v3.5
Parameters
9B
params
Context Window
262K
tokens
Max Output
66K
tokens
Input Price
-
per 1M tokens
Output Price
-
per 1M tokens

Capabilities

👁️
Vision
Function Calling
📋
JSON Mode
🌊
Streaming
💬
System Prompt
🖥️
Code Execution
🔍
Web Search
🔌
MCP Support

Local Model Specs

Quantization
Q4_K_M
Architecture
Hybrid MoE (Gated Delta Networks)
Runtime
Ollama / llama.cpp
Disk Size
6.6 GB

Details

Release Date
February 16, 2026
Knowledge Cutoff
-
Source
Local
License
Apache 2.0
Model ID
qwen3.5-9b
Last updated: March 13, 2026