AI Models Database

173 models across text, image, audio & embedding

60 models

Grok 4.20 Beta

Latest
xAI
Text API

Latest Grok model with 2M context window and fastest output speed of any tracked model (265 tok/s). Available in reasoning and non-reasoning variants.

Context
2.0M
Params
-
Input
$2.0000
Output
$6.0000
Vision Functions
8 in family

GPT-5.4

Latest
OpenAI
Text API

Latest GPT flagship model with 1M context window. Extended thinking, multimodal input, and document editing capabilities. Outperforms GPT-5.3 Codex on all benchmarks.

Context
1.0M
Params
-
Input
$2.5000
Output
$15.0000
Vision Functions
15 in family

Qwen 3.5 27B

Latest
Alibaba
Text Local

Dense hybrid Qwen 3.5 model with strong reasoning and multimodal capabilities. Best quality-to-size ratio in the family for single-GPU deployment.

Context
262K
Params
27
Cutoff
-
License
Apache 2.0
Vision Functions
13 in family

Gemini 3.1 Pro

Latest
Google
Text API

Latest Gemini Pro model with thinking capabilities, function calling, code execution, and search grounding. Supersedes Gemini 3 Pro.

Context
1.0M
Params
-
Input
$2.0000
Output
$12.0000
Vision Functions
12 in family

Qwen 3.5 Plus

Latest
Alibaba
Text API

Flagship Qwen model with 397B total parameters (17B active via MoE). Hybrid architecture with Gated Delta Networks and 512 experts. Supports 201 languages and thinking/non-thinking modes.

Context
1.0M
Params
397
Input
$0.4000
Output
$2.4000
Vision Functions
13 in family

Claude Opus 4.6

Latest
Anthropic
Text API

Most intelligent Claude model. Extended thinking with adaptive reasoning. Excels at complex analysis, nuanced content generation, advanced coding, and agentic tasks.

Context
1.0M
Params
-
Input
$5.0000
Output
$25.0000
Vision Functions MCP
12 in family

FLUX.2 Klein 4B

Latest
Black Forest Labs
Image Local

Sub-second image generation on consumer GPUs. Unified text-to-image and image editing in one checkpoint. Apache 2.0 licensed for commercial use. Best open fast image model.

Type
Image
Source
Local
Released
Jan 2026
7 in family

GLM-4.7

Latest
Zhipu AI
Text Local

Z.ai flagship open-source MoE model with 358B parameters, 200K context window, and 128K output. Excels at multilingual tasks, coding, and reasoning.

Context
200K
Params
358
Cutoff
-
License
Apache 2.0
Vision Functions

OLMo 3 32B

Latest
AI2
Text Local

Allen AI OLMo 3 32B - large scale fully open model. Complete transparency with training data, code, and intermediate checkpoints.

Context
66K
Params
32
Cutoff
-
License
Apache 2.0
Functions
2 in family

DeepSeek V3.2

Latest
DeepSeek
Text Local

DeepSeek V3.2 with 685B total parameters (37B active). Features Multi-Token Prediction and Sparse Attention for efficiency.

Context
128K
Params
685
Cutoff
-
License
MIT
Functions
13 in family

Arcee Trinity Mini

Latest
Arcee
Text Local

Arcee Trinity Mini with 26B total parameters but only 3B active (128 experts). Extremely efficient MoE architecture.

Context
128K
Params
26
Cutoff
-
License
Apache 2.0
Functions

IBM Granite 4.0 32B

Latest
IBM
Text Local

IBM Granite 4.0 32B flagship model. Hybrid Mamba-Transformer for efficient long-context processing.

Context
128K
Params
32
Cutoff
-
License
Apache 2.0
Functions
2 in family

DeepSeek V3.2 Exp

Latest
DeepSeek
Text API

DeepSeek Sparse Attention architecture

Context
128K
Params
-
Input
$0.2700
Output
$1.1000
Functions
13 in family

Magistral Small

Latest
Mistral
Text API

Compact reasoning model for efficient multi-step problem solving.

Context
40K
Params
-
Input
$0.5000
Output
$1.5000
Functions
2 in family

Imagen 4 Fast

Latest
Google
Image API

Speed-optimized variant of Imagen 4 for rapid image generation and high-volume tasks. Priced at $0.02 per image.

Type
Image
Source
API
Released
Aug 2025
3 in family

Mistral Medium 3

Latest
Mistral
Text API

Frontier-class multimodal model. Performs at or above 90% of Claude Sonnet 3.7. Vision capable with strong reasoning.

Context
40K
Params
-
Input
$0.4000
Output
$2.0000
Vision Functions
8 in family

Codestral

Latest
Mistral
Text API

Cutting-edge coding model with fill-in-the-middle (FIM) capability. Optimized for code generation, completion, and refactoring.

Context
128K
Params
-
Input
$0.3000
Output
$0.9000
Functions
3 in family

FLUX.1 Kontext Dev

Latest
Black Forest Labs
Image API

Open-weight version of FLUX Kontext for research and development. Available on Hugging Face for local deployment.

Type
Image
Source
API
Released
May 2025
Vision
7 in family

Phi-4 Mini Reasoning

Latest
Microsoft
Text Local

Chain-of-thought reasoning variant of Phi-4 Mini (3.8B). SFT on OpenAI o3-mini demonstrations + RL. 128K context makes it ideal for RAG. Best reasoning at sub-4B scale.

Context
131K
Params
4
Cutoff
-
License
MIT
Functions
6 in family

Llama 4 Maverick

Latest
Meta
Text API

17B active params, 400B total, multimodal

Context
1.0M
Params
-
Input
$1.0000
Output
$3.0000
Vision Functions
8 in family

Llama 4 Scout

Latest
Meta
Text Local

Efficient MoE multimodal model with industry-leading 10M token context window. 109B total / 17B active parameters across 16 experts. Native text + image understanding. Trained on 40T tokens in 12 languages.

Context
10.0M
Params
109
Cutoff
-
License
Llama 4 Community
Vision Functions
8 in family

Midjourney V7

Latest
Midjourney
Image API

Midjourney's latest model with new architecture. Features improved text prompt understanding, richer textures, better anatomy rendering, and personalization by default. Includes Draft Mode (10x faster) and Omni Reference.

Type
Image
Source
API
Released
Apr 2025

Ideogram 3.0

Latest
Ideogram
Image API

Most powerful Ideogram model with major leap in visual quality, realism, and creative control. Known for exceptional text rendering in images.

Type
Image
Source
API
Released
Mar 2025
2 in family

Gemma 3 27B

Latest
Google
Text Local

Largest Gemma 3 model with multimodal vision capabilities. 128K context, 140+ languages. QAT variants available for 3x smaller footprint.

Context
131K
Params
27
Cutoff
-
License
Gemma Terms of Use
Vision Functions
7 in family

Dia 1.6B

Latest
Nari Labs
TTS Local

Multi-speaker dialogue TTS with non-verbal sounds (laughs, sighs, coughs). Voice cloning via audio prompt conditioning. Best model for scripted dialogue and podcast generation.

Source
Local
Released
Mar 2025

Nomic Embed Text V2 MoE

Latest
Nomic AI
Embedding Local

First open-source MoE text embedding model. 475M total / 305M active parameters. Matryoshka flexible dimensions (256-768). State-of-the-art multilingual embedding at release.

Context
-
Params
475
License
Apache 2.0
2 in family

Mistral Small 3

Latest
Mistral
Text Local

Mistral Small 3 with 24B parameters. Supports vision, function calling, and 32K context. Apache 2.0 licensed.

Context
32K
Params
24
Cutoff
-
License
Apache 2.0
Vision Functions
8 in family

Kokoro 82M

Latest
hexgrad
TTS Local

Ultra-lightweight TTS model. Under $1 per million characters. 54 pre-built voices across 8 languages. Apache 2.0 for commercial deployment. 8.9M+ monthly HuggingFace downloads.

Source
Local
Released
Jan 2025

Devstral Small

Latest
Mistral
Text API

Efficient coding model for development workflows. Cost-effective option for code generation and assistance.

Context
128K
Params
-
Input
$0.1000
Output
$0.3000
Functions
3 in family

Devstral Small

Latest
Mistral
Text Local

Efficient local coding model. Optimized for code generation, completion, and development assistance.

Context
128K
Params
8
Cutoff
-
License
Apache 2.0
Functions
3 in family

Ministral 14B

Latest
Mistral
Text API

Mid-sized 14B parameter model with text and vision. Strong performance for diverse tasks while remaining efficient.

Context
128K
Params
-
Input
$0.2000
Output
$0.2000
Vision
5 in family

Ministral 8B

Latest
Mistral
Text Local

Efficient 8B model with vision support. Good balance for local deployment with moderate resources.

Context
128K
Params
8
Cutoff
-
License
Apache 2.0
Vision
5 in family

Pixtral Large

Latest
Mistral
Text API

Frontier multimodal vision model. Excellent for complex image understanding, document analysis, and visual reasoning.

Context
128K
Params
-
Input
$2.0000
Output
$6.0000
Vision Functions
2 in family

Recraft V3

Latest
Recraft
Image API

Top-ranked model on Hugging Face Text-to-Image Leaderboard (ELO 1172). Only model capable of generating images with long texts. Supports both raster and vector image generation.

Type
Image
Source
API
Released
Oct 2024

Stable Diffusion 3.5 Medium

Latest
Stability AI
Image API

2.5 billion parameter model with improved MMDiT-X architecture. Designed for consumer hardware, requiring only 9.9 GB VRAM.

Type
Image
Source
API
Released
Oct 2024
4 in family

Whisper Large V3 Turbo

Latest
OpenAI
STT Local

4.5x faster than Whisper Large V3 with minimal quality loss. Decoder reduced from 32 to 4 layers. Most-downloaded Whisper variant (4.6M+ monthly). Best speed/accuracy balance for local STT.

Source
Local
Released
Oct 2024
2 in family

Yi Coder 9B

Latest
01.AI
Text Local

01.AI coding model with 128K context

Context
128K
Params
9
Cutoff
Mar 2024
License
Apache 2.0
Functions

Dolphin 2.9 Llama3 8B

Latest
Dolphin
Text Local

Fine-tuned Llama 3 without alignment restrictions

Context
8K
Params
8
Cutoff
Sep 2023
License
Llama 3
Functions

Codestral 22B

Latest
Mistral
Text Local

Mistral dedicated coding model, supports fill-in-the-middle

Context
32K
Params
22
Cutoff
Apr 2024
License
MNPL
Functions
3 in family

mxbai-embed-large

Latest
Mixedbread AI
Embedding Local

High-quality embeddings for semantic search

Context
1K
Params
335
License
Apache 2.0

Command R+

Latest
Cohere
Text API

Cohere flagship for complex enterprise tasks

Context
128K
Params
-
Input
$3.0000
Output
$15.0000
Functions
3 in family

Command R 35B

Latest
Cohere
Text Local

Cohere enterprise model optimized for RAG workflows

Context
128K
Params
35
Cutoff
Jan 2024
License
CC-BY-NC-4.0
Functions
3 in family

Moondream 2

Latest
Moondream
Text Local

Tiny but capable vision model

Context
2K
Params
2
Cutoff
Sep 2023
License
Apache 2.0
Vision

StarCoder2 15B

Latest
Hugging Face
Text Local

BigCode latest coding model with improved performance

Context
16K
Params
15
Cutoff
Jan 2024
License
BigCode OpenRAIL-M
Functions

nomic-embed-text

Latest
Nomic AI
Embedding Local

768-dimensional text embedding model optimized for semantic similarity search. ~1-2s per embedding on CPU. Used for ProHive RAG system.

Context
8K
Params
137
License
Apache 2.0
2 in family

BGE Large

Latest
BAAI
Embedding Local

BAAI embedding model for RAG applications

Context
1K
Params
335
License
MIT
2 in family

LLaVA 1.6 13B

Latest
Meta
Text Local

Larger LLaVA with improved visual reasoning

Context
4K
Params
13
Cutoff
Jan 2024
License
Apache 2.0
Vision
2 in family

SQLCoder 7B

Latest
Defog
Text Local

Defog SQL generation model, beats GPT-4 on SQL

Context
8K
Params
7
Cutoff
Sep 2023
License
Apache 2.0

Mixtral 8x7B

Latest
Mistral
Text Local

Mixture of Experts model with 8 experts, uses 12B active params

Context
32K
Params
47
Cutoff
Dec 2023
License
Apache 2.0
Functions
2 in family

Orca 2 13B

Latest
Microsoft
Text Local

Microsoft Orca with improved reasoning capabilities

Context
4K
Params
13
Cutoff
Jun 2023
License
Microsoft Research
Functions

DALL-E 3

Latest
OpenAI
Image API

OpenAI's text-to-image model integrated natively into ChatGPT. Significantly improved prompt understanding and text rendering compared to DALL-E 2.

Type
Image
Source
API
Released
Oct 2023

Neural Chat 7B

Latest
Intel
Text Local

Intel optimized chat model

Context
8K
Params
7
Cutoff
Jun 2023
License
Apache 2.0
Functions

all-MiniLM-L6-v2

Latest
Hugging Face
Embedding Local

Lightweight embedding model for quick inference

Context
0K
Params
23
License
Apache 2.0

Auto Router

Latest
OpenRouter
Text API

Intelligent routing to best model for task

Context
128K
Params
-
Cutoff
-
License
Proprietary
Vision Functions

ElevenLabs v2

Latest
ElevenLabs
TTS API

High-quality voice synthesis

Source
API
Released
-

Embed v3

Latest
Cohere
Embedding API

State-of-the-art embeddings for search

Context
1K
Params
-
License
Proprietary

Mixtral 8x7B (Groq)

Latest
Groq
Text API

Mixtral on Groq LPU hardware

Context
33K
Params
-
Input
$0.2400
Output
$0.2400
Functions
2 in family

Sonar Reasoning

Latest
Perplexity
Text API

Chain-of-thought search model

Context
128K
Params
-
Input
$5.0000
Output
$5.0000
3 in family

Titan Text Premier

Latest
AWS Bedrock
Text API

Amazon proprietary model for AWS users

Context
32K
Params
-
Input
$0.5000
Output
$1.5000
Functions

Whisper Large v3

Latest
OpenAI
STT API

Best-in-class transcription model

Source
API
Released
-
2 in family