173 models across text, image, audio & embedding
DeepSeek V3.2 with 685B total parameters (37B active). Features Multi-Token Prediction and Sparse Attention for efficiency.