Dia 1.6B
Nari Labs
Multi-speaker dialogue TTS with non-verbal sounds (laughs, sighs, coughs). Voice cloning via audio prompt conditioning. Best model for scripted dialogue and podcast generation.
Text-to-Speech Local Latest Dia Family v1.0
Type
Text-to-Speech
Source
Local
License
Apache 2.0
Capabilities
🌊
Streaming
Local Model Specs
Architecture
Transformer + Descript Audio Codec
Runtime
Python / torch
VRAM Usage
10 GB
Disk Size
3.2 GB
Details
- Release Date
- March 1, 2025
- Knowledge Cutoff
- -
- Source
- Local
- License
- Apache 2.0
- Model ID
- dia-1.6b
Last updated: March 13, 2026