173 models across text, image, audio & embedding
4.5x faster than Whisper Large V3 with minimal quality loss. Decoder reduced from 32 to 4 layers. Most-downloaded Whisper variant (4.6M+ monthly). Best speed/accuracy balance for local STT.