AI Models Database

173 models across text, image, audio & embedding

23 models

Qwen 3.5 27B

Latest
Alibaba
Text Local

Dense hybrid Qwen 3.5 model with strong reasoning and multimodal capabilities. Best quality-to-size ratio in the family for single-GPU deployment.

Context
262K
Params
27
Cutoff
-
License
Apache 2.0
Vision Functions
13 in family

GLM-4.7

Latest
Zhipu AI
Text Local

Z.ai flagship open-source MoE model with 358B parameters, 200K context window, and 128K output. Excels at multilingual tasks, coding, and reasoning.

Context
200K
Params
358
Cutoff
-
License
Apache 2.0
Vision Functions

OLMo 3 32B

Latest
AI2
Text Local

Allen AI OLMo 3 32B - large scale fully open model. Complete transparency with training data, code, and intermediate checkpoints.

Context
66K
Params
32
Cutoff
-
License
Apache 2.0
Functions
2 in family

DeepSeek V3.2

Latest
DeepSeek
Text Local

DeepSeek V3.2 with 685B total parameters (37B active). Features Multi-Token Prediction and Sparse Attention for efficiency.

Context
128K
Params
685
Cutoff
-
License
MIT
Functions
13 in family

Arcee Trinity Mini

Latest
Arcee
Text Local

Arcee Trinity Mini with 26B total parameters but only 3B active (128 experts). Extremely efficient MoE architecture.

Context
128K
Params
26
Cutoff
-
License
Apache 2.0
Functions

IBM Granite 4.0 32B

Latest
IBM
Text Local

IBM Granite 4.0 32B flagship model. Hybrid Mamba-Transformer for efficient long-context processing.

Context
128K
Params
32
Cutoff
-
License
Apache 2.0
Functions
2 in family

Phi-4 Mini Reasoning

Latest
Microsoft
Text Local

Chain-of-thought reasoning variant of Phi-4 Mini (3.8B). SFT on OpenAI o3-mini demonstrations + RL. 128K context makes it ideal for RAG. Best reasoning at sub-4B scale.

Context
131K
Params
4
Cutoff
-
License
MIT
Functions
6 in family

Llama 4 Scout

Latest
Meta
Text Local

Efficient MoE multimodal model with industry-leading 10M token context window. 109B total / 17B active parameters across 16 experts. Native text + image understanding. Trained on 40T tokens in 12 languages.

Context
10.0M
Params
109
Cutoff
-
License
Llama 4 Community
Vision Functions
8 in family

Gemma 3 27B

Latest
Google
Text Local

Largest Gemma 3 model with multimodal vision capabilities. 128K context, 140+ languages. QAT variants available for 3x smaller footprint.

Context
131K
Params
27
Cutoff
-
License
Gemma Terms of Use
Vision Functions
7 in family

Mistral Small 3

Latest
Mistral
Text Local

Mistral Small 3 with 24B parameters. Supports vision, function calling, and 32K context. Apache 2.0 licensed.

Context
32K
Params
24
Cutoff
-
License
Apache 2.0
Vision Functions
8 in family

Devstral Small

Latest
Mistral
Text Local

Efficient local coding model. Optimized for code generation, completion, and development assistance.

Context
128K
Params
8
Cutoff
-
License
Apache 2.0
Functions
3 in family

Ministral 8B

Latest
Mistral
Text Local

Efficient 8B model with vision support. Good balance for local deployment with moderate resources.

Context
128K
Params
8
Cutoff
-
License
Apache 2.0
Vision
5 in family

Yi Coder 9B

Latest
01.AI
Text Local

01.AI coding model with 128K context

Context
128K
Params
9
Cutoff
Mar 2024
License
Apache 2.0
Functions

Dolphin 2.9 Llama3 8B

Latest
Dolphin
Text Local

Fine-tuned Llama 3 without alignment restrictions

Context
8K
Params
8
Cutoff
Sep 2023
License
Llama 3
Functions

Codestral 22B

Latest
Mistral
Text Local

Mistral dedicated coding model, supports fill-in-the-middle

Context
32K
Params
22
Cutoff
Apr 2024
License
MNPL
Functions
3 in family

Command R 35B

Latest
Cohere
Text Local

Cohere enterprise model optimized for RAG workflows

Context
128K
Params
35
Cutoff
Jan 2024
License
CC-BY-NC-4.0
Functions
3 in family

Moondream 2

Latest
Moondream
Text Local

Tiny but capable vision model

Context
2K
Params
2
Cutoff
Sep 2023
License
Apache 2.0
Vision

StarCoder2 15B

Latest
Hugging Face
Text Local

BigCode latest coding model with improved performance

Context
16K
Params
15
Cutoff
Jan 2024
License
BigCode OpenRAIL-M
Functions

LLaVA 1.6 13B

Latest
Meta
Text Local

Larger LLaVA with improved visual reasoning

Context
4K
Params
13
Cutoff
Jan 2024
License
Apache 2.0
Vision
2 in family

SQLCoder 7B

Latest
Defog
Text Local

Defog SQL generation model, beats GPT-4 on SQL

Context
8K
Params
7
Cutoff
Sep 2023
License
Apache 2.0

Mixtral 8x7B

Latest
Mistral
Text Local

Mixture of Experts model with 8 experts, uses 12B active params

Context
32K
Params
47
Cutoff
Dec 2023
License
Apache 2.0
Functions
2 in family

Orca 2 13B

Latest
Microsoft
Text Local

Microsoft Orca with improved reasoning capabilities

Context
4K
Params
13
Cutoff
Jun 2023
License
Microsoft Research
Functions

Neural Chat 7B

Latest
Intel
Text Local

Intel optimized chat model

Context
8K
Params
7
Cutoff
Jun 2023
License
Apache 2.0
Functions