AI Models Database

173 models across text, image, audio & embedding

All 173 Text 142 Image 19 Embedding 7 TTS 3 STT 2

23 models

Qwen 3.5 27B

Dense hybrid Qwen 3.5 model with strong reasoning and multimodal capabilities. Best quality-to-size ratio in the family for single-GPU deployment.

Vision Functions

GLM-4.7

Z.ai flagship open-source MoE model with 358B parameters, 200K context window, and 128K output. Excels at multilingual tasks, coding, and reasoning.

Vision Functions

OLMo 3 32B

Allen AI OLMo 3 32B - large scale fully open model. Complete transparency with training data, code, and intermediate checkpoints.

DeepSeek V3.2

DeepSeek V3.2 with 685B total parameters (37B active). Features Multi-Token Prediction and Sparse Attention for efficiency.

Arcee Trinity Mini

Arcee Trinity Mini with 26B total parameters but only 3B active (128 experts). Extremely efficient MoE architecture.

IBM Granite 4.0 32B

IBM Granite 4.0 32B flagship model. Hybrid Mamba-Transformer for efficient long-context processing.

Phi-4 Mini Reasoning

Chain-of-thought reasoning variant of Phi-4 Mini (3.8B). SFT on OpenAI o3-mini demonstrations + RL. 128K context makes it ideal for RAG. Best reasoning at sub-4B scale.

Llama 4 Scout

Efficient MoE multimodal model with industry-leading 10M token context window. 109B total / 17B active parameters across 16 experts. Native text + image understanding. Trained on 40T tokens in 12 languages.

Llama 4 Community

Vision Functions

Gemma 3 27B

Largest Gemma 3 model with multimodal vision capabilities. 128K context, 140+ languages. QAT variants available for 3x smaller footprint.

Gemma Terms of Use

Vision Functions

Mistral Small 3

Mistral Small 3 with 24B parameters. Supports vision, function calling, and 32K context. Apache 2.0 licensed.

Vision Functions

Devstral Small

Efficient local coding model. Optimized for code generation, completion, and development assistance.

Ministral 8B

Efficient 8B model with vision support. Good balance for local deployment with moderate resources.

Yi Coder 9B

01.AI coding model with 128K context

Dolphin 2.9 Llama3 8B

Fine-tuned Llama 3 without alignment restrictions

Codestral 22B

Mistral dedicated coding model, supports fill-in-the-middle

Command R 35B

Cohere enterprise model optimized for RAG workflows

Moondream 2

Tiny but capable vision model

StarCoder2 15B

BigCode latest coding model with improved performance

BigCode OpenRAIL-M

LLaVA 1.6 13B

Larger LLaVA with improved visual reasoning

SQLCoder 7B

Defog SQL generation model, beats GPT-4 on SQL

Mixtral 8x7B

Mixture of Experts model with 8 experts, uses 12B active params

Orca 2 13B

Microsoft Orca with improved reasoning capabilities

Microsoft Research

Neural Chat 7B

Intel optimized chat model