AI Models Database
173 models across text, image, audio & embedding
Qwen 3.5 27B
LatestDense hybrid Qwen 3.5 model with strong reasoning and multimodal capabilities. Best quality-to-size ratio in the family for single-GPU deployment.
GLM-4.7
LatestZ.ai flagship open-source MoE model with 358B parameters, 200K context window, and 128K output. Excels at multilingual tasks, coding, and reasoning.
OLMo 3 32B
LatestAllen AI OLMo 3 32B - large scale fully open model. Complete transparency with training data, code, and intermediate checkpoints.
DeepSeek V3.2
LatestDeepSeek V3.2 with 685B total parameters (37B active). Features Multi-Token Prediction and Sparse Attention for efficiency.
Arcee Trinity Mini
LatestArcee Trinity Mini with 26B total parameters but only 3B active (128 experts). Extremely efficient MoE architecture.
IBM Granite 4.0 32B
LatestIBM Granite 4.0 32B flagship model. Hybrid Mamba-Transformer for efficient long-context processing.
Phi-4 Mini Reasoning
LatestChain-of-thought reasoning variant of Phi-4 Mini (3.8B). SFT on OpenAI o3-mini demonstrations + RL. 128K context makes it ideal for RAG. Best reasoning at sub-4B scale.
Llama 4 Scout
LatestEfficient MoE multimodal model with industry-leading 10M token context window. 109B total / 17B active parameters across 16 experts. Native text + image understanding. Trained on 40T tokens in 12 languages.
Gemma 3 27B
LatestLargest Gemma 3 model with multimodal vision capabilities. 128K context, 140+ languages. QAT variants available for 3x smaller footprint.
Mistral Small 3
LatestMistral Small 3 with 24B parameters. Supports vision, function calling, and 32K context. Apache 2.0 licensed.
Devstral Small
LatestEfficient local coding model. Optimized for code generation, completion, and development assistance.
Ministral 8B
LatestEfficient 8B model with vision support. Good balance for local deployment with moderate resources.
Yi Coder 9B
Latest01.AI coding model with 128K context
Dolphin 2.9 Llama3 8B
LatestFine-tuned Llama 3 without alignment restrictions
Codestral 22B
LatestMistral dedicated coding model, supports fill-in-the-middle
Command R 35B
LatestCohere enterprise model optimized for RAG workflows
Moondream 2
LatestTiny but capable vision model
StarCoder2 15B
LatestBigCode latest coding model with improved performance
LLaVA 1.6 13B
LatestLarger LLaVA with improved visual reasoning
SQLCoder 7B
LatestDefog SQL generation model, beats GPT-4 on SQL
Mixtral 8x7B
LatestMixture of Experts model with 8 experts, uses 12B active params
Orca 2 13B
LatestMicrosoft Orca with improved reasoning capabilities
Neural Chat 7B
LatestIntel optimized chat model