AI Models Database
173 models across text, image, audio & embedding
Grok 4.20 Beta
LatestLatest Grok model with 2M context window and fastest output speed of any tracked model (265 tok/s). Available in reasoning and non-reasoning variants.
GPT-5.4
LatestLatest GPT flagship model with 1M context window. Extended thinking, multimodal input, and document editing capabilities. Outperforms GPT-5.3 Codex on all benchmarks.
Qwen 3.5 27B
LatestDense hybrid Qwen 3.5 model with strong reasoning and multimodal capabilities. Best quality-to-size ratio in the family for single-GPU deployment.
Gemini 3.1 Pro
LatestLatest Gemini Pro model with thinking capabilities, function calling, code execution, and search grounding. Supersedes Gemini 3 Pro.
Qwen 3.5 Plus
LatestFlagship Qwen model with 397B total parameters (17B active via MoE). Hybrid architecture with Gated Delta Networks and 512 experts. Supports 201 languages and thinking/non-thinking modes.
Claude Opus 4.6
LatestMost intelligent Claude model. Extended thinking with adaptive reasoning. Excels at complex analysis, nuanced content generation, advanced coding, and agentic tasks.
FLUX.2 Klein 4B
LatestSub-second image generation on consumer GPUs. Unified text-to-image and image editing in one checkpoint. Apache 2.0 licensed for commercial use. Best open fast image model.
GLM-4.7
LatestZ.ai flagship open-source MoE model with 358B parameters, 200K context window, and 128K output. Excels at multilingual tasks, coding, and reasoning.
OLMo 3 32B
LatestAllen AI OLMo 3 32B - large scale fully open model. Complete transparency with training data, code, and intermediate checkpoints.
DeepSeek V3.2
LatestDeepSeek V3.2 with 685B total parameters (37B active). Features Multi-Token Prediction and Sparse Attention for efficiency.
Arcee Trinity Mini
LatestArcee Trinity Mini with 26B total parameters but only 3B active (128 experts). Extremely efficient MoE architecture.
IBM Granite 4.0 32B
LatestIBM Granite 4.0 32B flagship model. Hybrid Mamba-Transformer for efficient long-context processing.
DeepSeek V3.2 Exp
LatestDeepSeek Sparse Attention architecture
Magistral Small
LatestCompact reasoning model for efficient multi-step problem solving.
Imagen 4 Fast
LatestSpeed-optimized variant of Imagen 4 for rapid image generation and high-volume tasks. Priced at $0.02 per image.
Mistral Medium 3
LatestFrontier-class multimodal model. Performs at or above 90% of Claude Sonnet 3.7. Vision capable with strong reasoning.
Codestral
LatestCutting-edge coding model with fill-in-the-middle (FIM) capability. Optimized for code generation, completion, and refactoring.
FLUX.1 Kontext Dev
LatestOpen-weight version of FLUX Kontext for research and development. Available on Hugging Face for local deployment.
Phi-4 Mini Reasoning
LatestChain-of-thought reasoning variant of Phi-4 Mini (3.8B). SFT on OpenAI o3-mini demonstrations + RL. 128K context makes it ideal for RAG. Best reasoning at sub-4B scale.
Llama 4 Maverick
Latest17B active params, 400B total, multimodal
Llama 4 Scout
LatestEfficient MoE multimodal model with industry-leading 10M token context window. 109B total / 17B active parameters across 16 experts. Native text + image understanding. Trained on 40T tokens in 12 languages.
Midjourney V7
LatestMidjourney's latest model with new architecture. Features improved text prompt understanding, richer textures, better anatomy rendering, and personalization by default. Includes Draft Mode (10x faster) and Omni Reference.
Ideogram 3.0
LatestMost powerful Ideogram model with major leap in visual quality, realism, and creative control. Known for exceptional text rendering in images.
Gemma 3 27B
LatestLargest Gemma 3 model with multimodal vision capabilities. 128K context, 140+ languages. QAT variants available for 3x smaller footprint.
Dia 1.6B
LatestMulti-speaker dialogue TTS with non-verbal sounds (laughs, sighs, coughs). Voice cloning via audio prompt conditioning. Best model for scripted dialogue and podcast generation.
Nomic Embed Text V2 MoE
LatestFirst open-source MoE text embedding model. 475M total / 305M active parameters. Matryoshka flexible dimensions (256-768). State-of-the-art multilingual embedding at release.
Mistral Small 3
LatestMistral Small 3 with 24B parameters. Supports vision, function calling, and 32K context. Apache 2.0 licensed.
Kokoro 82M
LatestUltra-lightweight TTS model. Under $1 per million characters. 54 pre-built voices across 8 languages. Apache 2.0 for commercial deployment. 8.9M+ monthly HuggingFace downloads.
Devstral Small
LatestEfficient coding model for development workflows. Cost-effective option for code generation and assistance.
Devstral Small
LatestEfficient local coding model. Optimized for code generation, completion, and development assistance.
Ministral 14B
LatestMid-sized 14B parameter model with text and vision. Strong performance for diverse tasks while remaining efficient.
Ministral 8B
LatestEfficient 8B model with vision support. Good balance for local deployment with moderate resources.
Pixtral Large
LatestFrontier multimodal vision model. Excellent for complex image understanding, document analysis, and visual reasoning.
Recraft V3
LatestTop-ranked model on Hugging Face Text-to-Image Leaderboard (ELO 1172). Only model capable of generating images with long texts. Supports both raster and vector image generation.
Stable Diffusion 3.5 Medium
Latest2.5 billion parameter model with improved MMDiT-X architecture. Designed for consumer hardware, requiring only 9.9 GB VRAM.
Whisper Large V3 Turbo
Latest4.5x faster than Whisper Large V3 with minimal quality loss. Decoder reduced from 32 to 4 layers. Most-downloaded Whisper variant (4.6M+ monthly). Best speed/accuracy balance for local STT.
Yi Coder 9B
Latest01.AI coding model with 128K context
Dolphin 2.9 Llama3 8B
LatestFine-tuned Llama 3 without alignment restrictions
Codestral 22B
LatestMistral dedicated coding model, supports fill-in-the-middle
mxbai-embed-large
LatestHigh-quality embeddings for semantic search
Command R+
LatestCohere flagship for complex enterprise tasks
Command R 35B
LatestCohere enterprise model optimized for RAG workflows
Moondream 2
LatestTiny but capable vision model
StarCoder2 15B
LatestBigCode latest coding model with improved performance
nomic-embed-text
Latest768-dimensional text embedding model optimized for semantic similarity search. ~1-2s per embedding on CPU. Used for ProHive RAG system.
BGE Large
LatestBAAI embedding model for RAG applications
LLaVA 1.6 13B
LatestLarger LLaVA with improved visual reasoning
SQLCoder 7B
LatestDefog SQL generation model, beats GPT-4 on SQL
Mixtral 8x7B
LatestMixture of Experts model with 8 experts, uses 12B active params
Orca 2 13B
LatestMicrosoft Orca with improved reasoning capabilities
DALL-E 3
LatestOpenAI's text-to-image model integrated natively into ChatGPT. Significantly improved prompt understanding and text rendering compared to DALL-E 2.
Neural Chat 7B
LatestIntel optimized chat model
all-MiniLM-L6-v2
LatestLightweight embedding model for quick inference
Auto Router
LatestIntelligent routing to best model for task
ElevenLabs v2
LatestHigh-quality voice synthesis
Embed v3
LatestState-of-the-art embeddings for search
Mixtral 8x7B (Groq)
LatestMixtral on Groq LPU hardware
Sonar Reasoning
LatestChain-of-thought search model
Titan Text Premier
LatestAmazon proprietary model for AWS users
Whisper Large v3
LatestBest-in-class transcription model