AI Models Database

173 models across text, image, audio & embedding

28 models

Grok 4.20 Beta

Latest
xAI
Text API

Latest Grok model with 2M context window and fastest output speed of any tracked model (265 tok/s). Available in reasoning and non-reasoning variants.

Context
2.0M
Params
-
Input
$2.0000
Output
$6.0000
Vision Functions
8 in family

GPT-5.4

Latest
OpenAI
Text API

Latest GPT flagship model with 1M context window. Extended thinking, multimodal input, and document editing capabilities. Outperforms GPT-5.3 Codex on all benchmarks.

Context
1.0M
Params
-
Input
$2.5000
Output
$15.0000
Vision Functions
15 in family

Gemini 3.1 Pro

Latest
Google
Text API

Latest Gemini Pro model with thinking capabilities, function calling, code execution, and search grounding. Supersedes Gemini 3 Pro.

Context
1.0M
Params
-
Input
$2.0000
Output
$12.0000
Vision Functions
12 in family

Qwen 3.5 Plus

Latest
Alibaba
Text API

Flagship Qwen model with 397B total parameters (17B active via MoE). Hybrid architecture with Gated Delta Networks and 512 experts. Supports 201 languages and thinking/non-thinking modes.

Context
1.0M
Params
397
Input
$0.4000
Output
$2.4000
Vision Functions
13 in family

Claude Opus 4.6

Latest
Anthropic
Text API

Most intelligent Claude model. Extended thinking with adaptive reasoning. Excels at complex analysis, nuanced content generation, advanced coding, and agentic tasks.

Context
1.0M
Params
-
Input
$5.0000
Output
$25.0000
Vision Functions MCP
12 in family

DeepSeek V3.2 Exp

Latest
DeepSeek
Text API

DeepSeek Sparse Attention architecture

Context
128K
Params
-
Input
$0.2700
Output
$1.1000
Functions
13 in family

Magistral Small

Latest
Mistral
Text API

Compact reasoning model for efficient multi-step problem solving.

Context
40K
Params
-
Input
$0.5000
Output
$1.5000
Functions
2 in family

Imagen 4 Fast

Latest
Google
Image API

Speed-optimized variant of Imagen 4 for rapid image generation and high-volume tasks. Priced at $0.02 per image.

Type
Image
Source
API
Released
Aug 2025
3 in family

Mistral Medium 3

Latest
Mistral
Text API

Frontier-class multimodal model. Performs at or above 90% of Claude Sonnet 3.7. Vision capable with strong reasoning.

Context
40K
Params
-
Input
$0.4000
Output
$2.0000
Vision Functions
8 in family

Codestral

Latest
Mistral
Text API

Cutting-edge coding model with fill-in-the-middle (FIM) capability. Optimized for code generation, completion, and refactoring.

Context
128K
Params
-
Input
$0.3000
Output
$0.9000
Functions
3 in family

FLUX.1 Kontext Dev

Latest
Black Forest Labs
Image API

Open-weight version of FLUX Kontext for research and development. Available on Hugging Face for local deployment.

Type
Image
Source
API
Released
May 2025
Vision
7 in family

Llama 4 Maverick

Latest
Meta
Text API

17B active params, 400B total, multimodal

Context
1.0M
Params
-
Input
$1.0000
Output
$3.0000
Vision Functions
8 in family

Midjourney V7

Latest
Midjourney
Image API

Midjourney's latest model with new architecture. Features improved text prompt understanding, richer textures, better anatomy rendering, and personalization by default. Includes Draft Mode (10x faster) and Omni Reference.

Type
Image
Source
API
Released
Apr 2025

Ideogram 3.0

Latest
Ideogram
Image API

Most powerful Ideogram model with major leap in visual quality, realism, and creative control. Known for exceptional text rendering in images.

Type
Image
Source
API
Released
Mar 2025
2 in family

Devstral Small

Latest
Mistral
Text API

Efficient coding model for development workflows. Cost-effective option for code generation and assistance.

Context
128K
Params
-
Input
$0.1000
Output
$0.3000
Functions
3 in family

Ministral 14B

Latest
Mistral
Text API

Mid-sized 14B parameter model with text and vision. Strong performance for diverse tasks while remaining efficient.

Context
128K
Params
-
Input
$0.2000
Output
$0.2000
Vision
5 in family

Pixtral Large

Latest
Mistral
Text API

Frontier multimodal vision model. Excellent for complex image understanding, document analysis, and visual reasoning.

Context
128K
Params
-
Input
$2.0000
Output
$6.0000
Vision Functions
2 in family

Recraft V3

Latest
Recraft
Image API

Top-ranked model on Hugging Face Text-to-Image Leaderboard (ELO 1172). Only model capable of generating images with long texts. Supports both raster and vector image generation.

Type
Image
Source
API
Released
Oct 2024

Stable Diffusion 3.5 Medium

Latest
Stability AI
Image API

2.5 billion parameter model with improved MMDiT-X architecture. Designed for consumer hardware, requiring only 9.9 GB VRAM.

Type
Image
Source
API
Released
Oct 2024
4 in family

Command R+

Latest
Cohere
Text API

Cohere flagship for complex enterprise tasks

Context
128K
Params
-
Input
$3.0000
Output
$15.0000
Functions
3 in family

DALL-E 3

Latest
OpenAI
Image API

OpenAI's text-to-image model integrated natively into ChatGPT. Significantly improved prompt understanding and text rendering compared to DALL-E 2.

Type
Image
Source
API
Released
Oct 2023

Auto Router

Latest
OpenRouter
Text API

Intelligent routing to best model for task

Context
128K
Params
-
Cutoff
-
License
Proprietary
Vision Functions

ElevenLabs v2

Latest
ElevenLabs
TTS API

High-quality voice synthesis

Source
API
Released
-

Embed v3

Latest
Cohere
Embedding API

State-of-the-art embeddings for search

Context
1K
Params
-
License
Proprietary

Mixtral 8x7B (Groq)

Latest
Groq
Text API

Mixtral on Groq LPU hardware

Context
33K
Params
-
Input
$0.2400
Output
$0.2400
Functions
2 in family

Sonar Reasoning

Latest
Perplexity
Text API

Chain-of-thought search model

Context
128K
Params
-
Input
$5.0000
Output
$5.0000
3 in family

Titan Text Premier

Latest
AWS Bedrock
Text API

Amazon proprietary model for AWS users

Context
32K
Params
-
Input
$0.5000
Output
$1.5000
Functions

Whisper Large v3

Latest
OpenAI
STT API

Best-in-class transcription model

Source
API
Released
-
2 in family