AI Models Database
173 models across text, image, audio & embedding
Grok 4.20 Beta
LatestLatest Grok model with 2M context window and fastest output speed of any tracked model (265 tok/s). Available in reasoning and non-reasoning variants.
GPT-5.4
LatestLatest GPT flagship model with 1M context window. Extended thinking, multimodal input, and document editing capabilities. Outperforms GPT-5.3 Codex on all benchmarks.
Gemini 3.1 Pro
LatestLatest Gemini Pro model with thinking capabilities, function calling, code execution, and search grounding. Supersedes Gemini 3 Pro.
Qwen 3.5 Plus
LatestFlagship Qwen model with 397B total parameters (17B active via MoE). Hybrid architecture with Gated Delta Networks and 512 experts. Supports 201 languages and thinking/non-thinking modes.
Claude Opus 4.6
LatestMost intelligent Claude model. Extended thinking with adaptive reasoning. Excels at complex analysis, nuanced content generation, advanced coding, and agentic tasks.
DeepSeek V3.2 Exp
LatestDeepSeek Sparse Attention architecture
Magistral Small
LatestCompact reasoning model for efficient multi-step problem solving.
Imagen 4 Fast
LatestSpeed-optimized variant of Imagen 4 for rapid image generation and high-volume tasks. Priced at $0.02 per image.
Mistral Medium 3
LatestFrontier-class multimodal model. Performs at or above 90% of Claude Sonnet 3.7. Vision capable with strong reasoning.
Codestral
LatestCutting-edge coding model with fill-in-the-middle (FIM) capability. Optimized for code generation, completion, and refactoring.
FLUX.1 Kontext Dev
LatestOpen-weight version of FLUX Kontext for research and development. Available on Hugging Face for local deployment.
Llama 4 Maverick
Latest17B active params, 400B total, multimodal
Midjourney V7
LatestMidjourney's latest model with new architecture. Features improved text prompt understanding, richer textures, better anatomy rendering, and personalization by default. Includes Draft Mode (10x faster) and Omni Reference.
Ideogram 3.0
LatestMost powerful Ideogram model with major leap in visual quality, realism, and creative control. Known for exceptional text rendering in images.
Devstral Small
LatestEfficient coding model for development workflows. Cost-effective option for code generation and assistance.
Ministral 14B
LatestMid-sized 14B parameter model with text and vision. Strong performance for diverse tasks while remaining efficient.
Pixtral Large
LatestFrontier multimodal vision model. Excellent for complex image understanding, document analysis, and visual reasoning.
Recraft V3
LatestTop-ranked model on Hugging Face Text-to-Image Leaderboard (ELO 1172). Only model capable of generating images with long texts. Supports both raster and vector image generation.
Stable Diffusion 3.5 Medium
Latest2.5 billion parameter model with improved MMDiT-X architecture. Designed for consumer hardware, requiring only 9.9 GB VRAM.
Command R+
LatestCohere flagship for complex enterprise tasks
DALL-E 3
LatestOpenAI's text-to-image model integrated natively into ChatGPT. Significantly improved prompt understanding and text rendering compared to DALL-E 2.
Auto Router
LatestIntelligent routing to best model for task
ElevenLabs v2
LatestHigh-quality voice synthesis
Embed v3
LatestState-of-the-art embeddings for search
Mixtral 8x7B (Groq)
LatestMixtral on Groq LPU hardware
Sonar Reasoning
LatestChain-of-thought search model
Titan Text Premier
LatestAmazon proprietary model for AWS users
Whisper Large v3
LatestBest-in-class transcription model