AI Models Database

173 models across text, image, audio & embedding

All 173 Text 142 Image 19 Embedding 7 TTS 3 STT 2

32 models

Qwen 3.5 27B

Dense hybrid Qwen 3.5 model with strong reasoning and multimodal capabilities. Best quality-to-size ratio in the family for single-GPU deployment.

FLUX.2 Klein 4B

Latest

Black Forest Labs

Image Local

Sub-second image generation on consumer GPUs. Unified text-to-image and image editing in one checkpoint. Apache 2.0 licensed for commercial use. Best open fast image model.

GLM-4.7

Latest

Zhipu AI

Text Local

Z.ai flagship open-source MoE model with 358B parameters, 200K context window, and 128K output. Excels at multilingual tasks, coding, and reasoning.

OLMo 3 32B

Latest

AI2

Text Local

Allen AI OLMo 3 32B - large scale fully open model. Complete transparency with training data, code, and intermediate checkpoints.

DeepSeek V3.2

Latest

DeepSeek

Text Local

DeepSeek V3.2 with 685B total parameters (37B active). Features Multi-Token Prediction and Sparse Attention for efficiency.

Arcee Trinity Mini

Latest

Arcee

Text Local

Arcee Trinity Mini with 26B total parameters but only 3B active (128 experts). Extremely efficient MoE architecture.

IBM Granite 4.0 32B

Latest

IBM

Text Local

IBM Granite 4.0 32B flagship model. Hybrid Mamba-Transformer for efficient long-context processing.

Phi-4 Mini Reasoning

Latest

Microsoft

Text Local

Chain-of-thought reasoning variant of Phi-4 Mini (3.8B). SFT on OpenAI o3-mini demonstrations + RL. 128K context makes it ideal for RAG. Best reasoning at sub-4B scale.

Llama 4 Scout

Latest

Gemma 3 27B

Latest

Google

Text Local

Largest Gemma 3 model with multimodal vision capabilities. 128K context, 140+ languages. QAT variants available for 3x smaller footprint.

Dia 1.6B

Latest

Nari Labs

TTS Local

Multi-speaker dialogue TTS with non-verbal sounds (laughs, sighs, coughs). Voice cloning via audio prompt conditioning. Best model for scripted dialogue and podcast generation.

Nomic Embed Text V2 MoE

Latest

Nomic AI

Embedding Local

First open-source MoE text embedding model. 475M total / 305M active parameters. Matryoshka flexible dimensions (256-768). State-of-the-art multilingual embedding at release.

Mistral Small 3

Latest

Mistral

Text Local

Mistral Small 3 with 24B parameters. Supports vision, function calling, and 32K context. Apache 2.0 licensed.

Kokoro 82M

Latest

hexgrad

TTS Local

Ultra-lightweight TTS model. Under $1 per million characters. 54 pre-built voices across 8 languages. Apache 2.0 for commercial deployment. 8.9M+ monthly HuggingFace downloads.

Devstral Small

Latest

Mistral

Text Local

Efficient local coding model. Optimized for code generation, completion, and development assistance.

Ministral 8B

Latest

Mistral

Text Local

Efficient 8B model with vision support. Good balance for local deployment with moderate resources.

Whisper Large V3 Turbo

Latest

OpenAI

STT Local

4.5x faster than Whisper Large V3 with minimal quality loss. Decoder reduced from 32 to 4 layers. Most-downloaded Whisper variant (4.6M+ monthly). Best speed/accuracy balance for local STT.