AI Models Database

173 models across text, image, audio & embedding

32 models

Qwen 3.5 27B

Latest
Alibaba
Text Local

Dense hybrid Qwen 3.5 model with strong reasoning and multimodal capabilities. Best quality-to-size ratio in the family for single-GPU deployment.

Context
262K
Params
27
Cutoff
-
License
Apache 2.0
Vision Functions
13 in family

FLUX.2 Klein 4B

Latest
Black Forest Labs
Image Local

Sub-second image generation on consumer GPUs. Unified text-to-image and image editing in one checkpoint. Apache 2.0 licensed for commercial use. Best open fast image model.

Type
Image
Source
Local
Released
Jan 2026
7 in family

GLM-4.7

Latest
Zhipu AI
Text Local

Z.ai flagship open-source MoE model with 358B parameters, 200K context window, and 128K output. Excels at multilingual tasks, coding, and reasoning.

Context
200K
Params
358
Cutoff
-
License
Apache 2.0
Vision Functions

OLMo 3 32B

Latest
AI2
Text Local

Allen AI OLMo 3 32B - large scale fully open model. Complete transparency with training data, code, and intermediate checkpoints.

Context
66K
Params
32
Cutoff
-
License
Apache 2.0
Functions
2 in family

DeepSeek V3.2

Latest
DeepSeek
Text Local

DeepSeek V3.2 with 685B total parameters (37B active). Features Multi-Token Prediction and Sparse Attention for efficiency.

Context
128K
Params
685
Cutoff
-
License
MIT
Functions
13 in family

Arcee Trinity Mini

Latest
Arcee
Text Local

Arcee Trinity Mini with 26B total parameters but only 3B active (128 experts). Extremely efficient MoE architecture.

Context
128K
Params
26
Cutoff
-
License
Apache 2.0
Functions

IBM Granite 4.0 32B

Latest
IBM
Text Local

IBM Granite 4.0 32B flagship model. Hybrid Mamba-Transformer for efficient long-context processing.

Context
128K
Params
32
Cutoff
-
License
Apache 2.0
Functions
2 in family

Phi-4 Mini Reasoning

Latest
Microsoft
Text Local

Chain-of-thought reasoning variant of Phi-4 Mini (3.8B). SFT on OpenAI o3-mini demonstrations + RL. 128K context makes it ideal for RAG. Best reasoning at sub-4B scale.

Context
131K
Params
4
Cutoff
-
License
MIT
Functions
6 in family

Llama 4 Scout

Latest
Meta
Text Local

Efficient MoE multimodal model with industry-leading 10M token context window. 109B total / 17B active parameters across 16 experts. Native text + image understanding. Trained on 40T tokens in 12 languages.

Context
10.0M
Params
109
Cutoff
-
License
Llama 4 Community
Vision Functions
8 in family

Gemma 3 27B

Latest
Google
Text Local

Largest Gemma 3 model with multimodal vision capabilities. 128K context, 140+ languages. QAT variants available for 3x smaller footprint.

Context
131K
Params
27
Cutoff
-
License
Gemma Terms of Use
Vision Functions
7 in family

Dia 1.6B

Latest
Nari Labs
TTS Local

Multi-speaker dialogue TTS with non-verbal sounds (laughs, sighs, coughs). Voice cloning via audio prompt conditioning. Best model for scripted dialogue and podcast generation.

Source
Local
Released
Mar 2025

Nomic Embed Text V2 MoE

Latest
Nomic AI
Embedding Local

First open-source MoE text embedding model. 475M total / 305M active parameters. Matryoshka flexible dimensions (256-768). State-of-the-art multilingual embedding at release.

Context
-
Params
475
License
Apache 2.0
2 in family

Mistral Small 3

Latest
Mistral
Text Local

Mistral Small 3 with 24B parameters. Supports vision, function calling, and 32K context. Apache 2.0 licensed.

Context
32K
Params
24
Cutoff
-
License
Apache 2.0
Vision Functions
8 in family

Kokoro 82M

Latest
hexgrad
TTS Local

Ultra-lightweight TTS model. Under $1 per million characters. 54 pre-built voices across 8 languages. Apache 2.0 for commercial deployment. 8.9M+ monthly HuggingFace downloads.

Source
Local
Released
Jan 2025

Devstral Small

Latest
Mistral
Text Local

Efficient local coding model. Optimized for code generation, completion, and development assistance.

Context
128K
Params
8
Cutoff
-
License
Apache 2.0
Functions
3 in family

Ministral 8B

Latest
Mistral
Text Local

Efficient 8B model with vision support. Good balance for local deployment with moderate resources.

Context
128K
Params
8
Cutoff
-
License
Apache 2.0
Vision
5 in family

Whisper Large V3 Turbo

Latest
OpenAI
STT Local

4.5x faster than Whisper Large V3 with minimal quality loss. Decoder reduced from 32 to 4 layers. Most-downloaded Whisper variant (4.6M+ monthly). Best speed/accuracy balance for local STT.

Source
Local
Released
Oct 2024
2 in family

Yi Coder 9B

Latest
01.AI
Text Local

01.AI coding model with 128K context

Context
128K
Params
9
Cutoff
Mar 2024
License
Apache 2.0
Functions

Dolphin 2.9 Llama3 8B

Latest
Dolphin
Text Local

Fine-tuned Llama 3 without alignment restrictions

Context
8K
Params
8
Cutoff
Sep 2023
License
Llama 3
Functions

Codestral 22B

Latest
Mistral
Text Local

Mistral dedicated coding model, supports fill-in-the-middle

Context
32K
Params
22
Cutoff
Apr 2024
License
MNPL
Functions
3 in family

mxbai-embed-large

Latest
Mixedbread AI
Embedding Local

High-quality embeddings for semantic search

Context
1K
Params
335
License
Apache 2.0

Command R 35B

Latest
Cohere
Text Local

Cohere enterprise model optimized for RAG workflows

Context
128K
Params
35
Cutoff
Jan 2024
License
CC-BY-NC-4.0
Functions
3 in family

Moondream 2

Latest
Moondream
Text Local

Tiny but capable vision model

Context
2K
Params
2
Cutoff
Sep 2023
License
Apache 2.0
Vision

StarCoder2 15B

Latest
Hugging Face
Text Local

BigCode latest coding model with improved performance

Context
16K
Params
15
Cutoff
Jan 2024
License
BigCode OpenRAIL-M
Functions

nomic-embed-text

Latest
Nomic AI
Embedding Local

768-dimensional text embedding model optimized for semantic similarity search. ~1-2s per embedding on CPU. Used for ProHive RAG system.

Context
8K
Params
137
License
Apache 2.0
2 in family

BGE Large

Latest
BAAI
Embedding Local

BAAI embedding model for RAG applications

Context
1K
Params
335
License
MIT
2 in family

LLaVA 1.6 13B

Latest
Meta
Text Local

Larger LLaVA with improved visual reasoning

Context
4K
Params
13
Cutoff
Jan 2024
License
Apache 2.0
Vision
2 in family

SQLCoder 7B

Latest
Defog
Text Local

Defog SQL generation model, beats GPT-4 on SQL

Context
8K
Params
7
Cutoff
Sep 2023
License
Apache 2.0

Mixtral 8x7B

Latest
Mistral
Text Local

Mixture of Experts model with 8 experts, uses 12B active params

Context
32K
Params
47
Cutoff
Dec 2023
License
Apache 2.0
Functions
2 in family

Orca 2 13B

Latest
Microsoft
Text Local

Microsoft Orca with improved reasoning capabilities

Context
4K
Params
13
Cutoff
Jun 2023
License
Microsoft Research
Functions

Neural Chat 7B

Latest
Intel
Text Local

Intel optimized chat model

Context
8K
Params
7
Cutoff
Jun 2023
License
Apache 2.0
Functions

all-MiniLM-L6-v2

Latest
Hugging Face
Embedding Local

Lightweight embedding model for quick inference

Context
0K
Params
23
License
Apache 2.0