AI Models Database

Grok 4.20 Beta

Latest

xAI

Text API

Latest Grok model with 2M context window and fastest output speed of any tracked model (265 tok/s). Available in reasoning and non-reasoning variants.

Context

2.0M

Params

-

Input

$2.0000

Output

$6.0000

Vision Functions

8 in family

GPT-5.4

Latest

OpenAI

Text API

Latest GPT flagship model with 1M context window. Extended thinking, multimodal input, and document editing capabilities. Outperforms GPT-5.3 Codex on all benchmarks.

Context

1.0M

Params

-

Input

$2.5000

Output

$15.0000

Vision Functions

15 in family

Qwen 3.5 27B

Latest

Alibaba

Text Local

Dense hybrid Qwen 3.5 model with strong reasoning and multimodal capabilities. Best quality-to-size ratio in the family for single-GPU deployment.

Context

262K

Params

27

Cutoff

-

License

Apache 2.0

Vision Functions

13 in family

Gemini 3.1 Pro

Latest

Google

Text API

Latest Gemini Pro model with thinking capabilities, function calling, code execution, and search grounding. Supersedes Gemini 3 Pro.

Context

1.0M

Params

-

Input

$2.0000

Output

$12.0000

Vision Functions

12 in family

Qwen 3.5 Plus

Latest

Alibaba

Text API

Flagship Qwen model with 397B total parameters (17B active via MoE). Hybrid architecture with Gated Delta Networks and 512 experts. Supports 201 languages and thinking/non-thinking modes.

Context

1.0M

Params

397

Input

$0.4000

Output

$2.4000

Vision Functions

13 in family

Claude Opus 4.6

Latest

Anthropic

Text API

Most intelligent Claude model. Extended thinking with adaptive reasoning. Excels at complex analysis, nuanced content generation, advanced coding, and agentic tasks.

Context

1.0M

Params

-

Input

$5.0000

Output

$25.0000

Vision Functions MCP

12 in family

GLM-4.7

Latest

Zhipu AI

Text Local

Z.ai flagship open-source MoE model with 358B parameters, 200K context window, and 128K output. Excels at multilingual tasks, coding, and reasoning.

Context

200K

Params

358

Cutoff

-

License

Apache 2.0

Vision Functions

OLMo 3 32B

Latest

AI2

Text Local

Allen AI OLMo 3 32B - large scale fully open model. Complete transparency with training data, code, and intermediate checkpoints.

Context

66K

Params

32

Cutoff

-

License

Apache 2.0

Functions

2 in family

DeepSeek V3.2

Latest

DeepSeek

Text Local

DeepSeek V3.2 with 685B total parameters (37B active). Features Multi-Token Prediction and Sparse Attention for efficiency.

Context

128K

Params

685

Cutoff

-

License

MIT

Functions

13 in family

Arcee Trinity Mini

Latest

Arcee

Text Local

Arcee Trinity Mini with 26B total parameters but only 3B active (128 experts). Extremely efficient MoE architecture.

Context

128K

Params

26

Cutoff

-

License

Apache 2.0

Functions

IBM Granite 4.0 32B

Latest

IBM

Text Local

IBM Granite 4.0 32B flagship model. Hybrid Mamba-Transformer for efficient long-context processing.

Context

128K

Params

32

Cutoff

-

License

Apache 2.0

Functions

2 in family

DeepSeek V3.2 Exp

Latest

DeepSeek

Text API

DeepSeek Sparse Attention architecture

Context

128K

Params

-

Input

$0.2700

Output

$1.1000

Functions

13 in family

Magistral Small

Latest

Mistral

Text API

Compact reasoning model for efficient multi-step problem solving.

Context

40K

Params

-

Input

$0.5000

Output

$1.5000

Functions

2 in family

Mistral Medium 3

Latest

Mistral

Text API

Frontier-class multimodal model. Performs at or above 90% of Claude Sonnet 3.7. Vision capable with strong reasoning.

Context

40K

Params

-

Input

$0.4000

Output

$2.0000

Vision Functions

8 in family

Codestral

Latest

Mistral

Text API

Cutting-edge coding model with fill-in-the-middle (FIM) capability. Optimized for code generation, completion, and refactoring.

Context

128K

Params

-

Input

$0.3000

Output

$0.9000

Functions

3 in family

Phi-4 Mini Reasoning

Latest

Microsoft

Text Local

Chain-of-thought reasoning variant of Phi-4 Mini (3.8B). SFT on OpenAI o3-mini demonstrations + RL. 128K context makes it ideal for RAG. Best reasoning at sub-4B scale.

Context

131K

Params

4

Cutoff

-

License

MIT

Functions

6 in family

Llama 4 Maverick

Latest

Llama 4 Scout

Latest

Gemma 3 27B

Latest

Google

Text Local

Largest Gemma 3 model with multimodal vision capabilities. 128K context, 140+ languages. QAT variants available for 3x smaller footprint.

Context

131K

Params

27

Cutoff

-

License

Gemma Terms of Use

Vision Functions

7 in family

Mistral Small 3

Latest

Mistral

Text Local

Mistral Small 3 with 24B parameters. Supports vision, function calling, and 32K context. Apache 2.0 licensed.

Context

32K

Params

24

Cutoff

-

License

Apache 2.0

Vision Functions

8 in family

Devstral Small

Latest

Mistral

Text API

Efficient coding model for development workflows. Cost-effective option for code generation and assistance.

Context

128K

Params

-

Input

$0.1000

Output

$0.3000

Functions

3 in family

Devstral Small

Latest

Mistral

Text Local

Efficient local coding model. Optimized for code generation, completion, and development assistance.

Context

128K

Params

8

Cutoff

-

License

Apache 2.0

Functions

3 in family

Ministral 14B

Latest

Mistral

Text API

Mid-sized 14B parameter model with text and vision. Strong performance for diverse tasks while remaining efficient.

Context

128K

Params

-

Input

$0.2000

Output

$0.2000

Vision

5 in family

Ministral 8B

Latest

Mistral

Text Local

Efficient 8B model with vision support. Good balance for local deployment with moderate resources.

Context

128K

Params

8

Cutoff

-

License

Apache 2.0

Vision

5 in family

Pixtral Large

Latest

Mistral

Text API

Frontier multimodal vision model. Excellent for complex image understanding, document analysis, and visual reasoning.

Context

128K

Params

-

Input

$2.0000

Output

$6.0000

Vision Functions

2 in family

Yi Coder 9B

Latest

01.AI

Text Local

01.AI coding model with 128K context

Context

128K

Params

9

Cutoff

Mar 2024

License

Apache 2.0

Functions

Dolphin 2.9 Llama3 8B

Latest

Dolphin

Text Local

Fine-tuned Llama 3 without alignment restrictions

Context

8K

Params

8

Cutoff

Sep 2023

License

Llama 3

Functions

Codestral 22B

Latest

Mistral

Text Local

Mistral dedicated coding model, supports fill-in-the-middle

Context

32K

Params

22

Cutoff

Apr 2024

License

MNPL

Functions

3 in family

Command R+

Latest

Cohere

Text API

Cohere flagship for complex enterprise tasks

Context

128K

Params

-

Input

$3.0000

Output

$15.0000

Functions

3 in family

Command R 35B

Latest

Cohere

Text Local

Cohere enterprise model optimized for RAG workflows

Context

128K

Params

35

Cutoff

Jan 2024

License

CC-BY-NC-4.0

Functions

3 in family

Moondream 2

Latest

Moondream

Text Local

Tiny but capable vision model

Context

2K

Params

2

Cutoff

Sep 2023

License

Apache 2.0

Vision

StarCoder2 15B

Latest

Hugging Face

Text Local

BigCode latest coding model with improved performance

Context

16K

Params

15

Cutoff

Jan 2024

License

BigCode OpenRAIL-M

Functions

LLaVA 1.6 13B

Latest

SQLCoder 7B

Latest

Defog

Text Local

Defog SQL generation model, beats GPT-4 on SQL

Context

8K

Params

7

Cutoff

Sep 2023

License

Apache 2.0

Mixtral 8x7B

Latest

Mistral

Text Local

Mixture of Experts model with 8 experts, uses 12B active params

Context

32K

Params

47

Cutoff

Dec 2023

License

Apache 2.0

Functions

2 in family

Orca 2 13B

Latest

Microsoft

Text Local

Microsoft Orca with improved reasoning capabilities

Context

4K

Params

13

Cutoff

Jun 2023

License

Microsoft Research

Functions

Neural Chat 7B

Latest

Intel

Text Local

Intel optimized chat model

Context

8K

Params

7

Cutoff

Jun 2023

License

Apache 2.0

Functions

Auto Router

Latest

OpenRouter

Text API

Intelligent routing to best model for task

Context

128K

Params

-

Cutoff

-

License

Proprietary

Vision Functions

Mixtral 8x7B (Groq)

Latest

Groq

Text API

Mixtral on Groq LPU hardware

Context

33K

Params

-

Input

$0.2400

Output

$0.2400

Functions

2 in family

Sonar Reasoning

Latest

Perplexity

Text API

Chain-of-thought search model

Context

128K

Params

-

Input

$5.0000

Output

$5.0000

3 in family

Titan Text Premier

Latest

AWS Bedrock

Text API

Amazon proprietary model for AWS users

Context

32K

Params

-

Input

$0.5000

Output

$1.5000

Functions