Every major AI model ranked across benchmark quality, inference speed, agentic capability, programming aptitude, and cost efficiency — updated continuously from published evaluation data.
296
Tracked models
27
Providers
253
Benchmarked
30.8
Avg. index
296 models
| Rank | Model | Provider | Score | Benchmarks | Inference | Agentic | Programming | Value | Price |
|---|---|---|---|---|---|---|---|---|---|
| 141 | Gemini 2.5 Pro gemini-2.5-pro multimodalvisionmulti-input reasoning | Google | 27.6 Value / Price | 44.2 | 62.8 | 0.0 | 25.0 | 27.6 | $1.25 in / $10 out |
| 142 | Gemini 2.5 Pro Preview 06-05 gemini-2.5-pro-preview-06-05 multimodalvisionmulti-input reasoning | Google | 27.6 Value / Price | 51.2 | 62.8 | 0.0 | 29.3 | 27.6 | |
| 143 | GPT-5.1 Thinking gpt-5.1-thinking-2025-11-12 multimodalvisionmulti-input reasoning | OpenAI | 27.0 Value / Price | 64.9 | 55.6 | 0.0 | 56.2 | 27.0 | |
| 144 | GPT-4o gpt-4o-2024-08-06 multimodalvisionmulti-input reasoning | OpenAI | 26.8 Value / Price | 31.5 | 46.7 | 14.9 | 4.3 | 26.8 | |
| 145 | Mistral Large 2 mistral-large-2-2407 textinference | Mistral AI | 26.7 Value / Price | 0.0 | 21.4 | 0.0 | 0.0 | 26.7 | $2 in / $6 out |
| 146 | GPT-4o gpt-4o-2024-05-13 multimodalvisionmulti-input reasoning | OpenAI | 26.5 Value / Price | 22.3 | 45.4 | 0.0 | 0.0 | 26.5 | |
| 147 | GPT-5.2 gpt-5.2-2025-12-11 multimodalvisionmulti-input reasoning | OpenAI | 26.5 Value / Price | 76.6 | 72.0 | 47.0 | 71.9 | 26.5 | |
| 148 | GPT-5.3 Chat gpt-5.3-chat-latest multimodalvisionmulti-input reasoning | OpenAI | 26.5 Value / Price | 0.0 | 52.7 | 0.0 | 0.0 | 26.5 | |
| 149 | Grok-2 grok-2 multimodalvisionmulti-input reasoning | xAI | 25.4 Value / Price | 27.1 | 38.3 | 0.0 | 0.0 | 25.4 | $2 in / $10 out |
| 150 | Jamba 1.5 Large jamba-1.5-large textinference | AI21 Labs | 25.2 Value / Price | 8.1 | 33.6 | 0.0 | 0.0 | 25.2 | $2 in / $8 out |
| 151 | GPT-5.1 Codex gpt-5.1-codex multimodalvisionmulti-input reasoning | OpenAI | 25.1 Value / Price | 0.0 | 49.0 | 0.0 | 50.0 | 25.1 | |
| 152 | GPT-5.1 Codex High gpt-5.1-codex-high multimodalvisionmulti-input reasoning | OpenAI | 25.1 Value / Price | 61.0 | 49.0 | 0.0 | 0.0 | 25.1 | |
| 153 | Claude 3.5 Sonnet claude-3-5-sonnet-20240620 multimodalvisionmulti-input reasoning | Anthropic | 24.6 Value / Price | 25.4 | 68.2 | 0.0 | 0.0 | 24.6 | |
| 154 | Claude 3.5 Sonnet claude-3-5-sonnet-20241022 multimodalvisionmulti-input reasoning | Anthropic | 24.6 Value / Price | 33.7 | 68.2 | 38.7 | 12.9 | 24.6 | |
| 155 | Gemini 1.5 Pro gemini-1.5-pro multimodalvisionmulti-input reasoning | Google | 24.3 Value / Price | 27.6 | 65.2 | 0.0 | 0.0 | 24.3 | |
| 156 | Grok-3 grok-3 multimodalvisionmulti-input reasoning | xAI | 22.7 Value / Price | 59.3 | 52.7 | 0.0 | 0.0 | 22.7 | $3 in / $15 out |
| 157 | Grok-4.1 grok-4.1-2025-11-17 multimodalvisionmulti-input reasoning | xAI | 22.7 Value / Price | 0.0 | 64.9 | 0.0 | 0.0 | 22.7 | |
| 158 | Pixtral Large pixtral-large multimodalvisionmulti-input reasoning | Mistral AI | 22.4 Value / Price | 27.8 | 7.0 | 0.0 | 0.0 | 22.4 | |
| 159 | Gemini 3.1 Pro gemini-3.1-pro-preview multimodalvisionmulti-input reasoning | Google | 22.1 Value / Price | 74.4 | 67.1 | 71.7 | 66.1 | 22.1 | |
| 160 | GPT-5.2 Codex gpt-5.2-codex multimodalvisionmulti-input reasoning | OpenAI | 19.6 Value / Price | 0.0 | 49.0 | 0.0 | 44.1 | 19.6 |
Gemini 2.5 Pro
27.6
$1.25 in / $10 out
Gemini 2.5 Pro Preview 06-05
27.6
$1.25 in / $10 out
GPT-5.1 Thinking
OpenAI
27.0
$1.25 in / $10 out
Want benchmark charts, model comparison, and pricing analytics?
Sign in to access the full interactive leaderboard with deep benchmark breakdowns and model comparison tools.
Open full leaderboardRankings are based on multi-dimensional evaluation across benchmark quality, inference efficiency, and cost-per-output. Scores are updated continuously and may differ from individual third-party benchmarks.
| $1.25 in / $10 out |
| $1.25 in / $10 out |
| $2.5 in / $10 out |
| $2.5 in / $10 out |
| $1.75 in / $14 out |
| $1.75 in / $14 out |
| $1.25 in / $10 out |
| $1.25 in / $10 out |
| $3 in / $15 out |
| $3 in / $15 out |
| $2.5 in / $10 out |
| $3 in / $15 out |
| $2 in / $6 out |
| $2.5 in / $15 out |
| $1.75 in / $14 out |
GPT-4o
OpenAI
26.8
$2.5 in / $10 out
Mistral Large 2
Mistral AI
26.7
$2 in / $6 out
GPT-4o
OpenAI
26.5
$2.5 in / $10 out
GPT-5.2
OpenAI
26.5
$1.75 in / $14 out
GPT-5.3 Chat
OpenAI
26.5
$1.75 in / $14 out
Grok-2
xAI
25.4
$2 in / $10 out
Jamba 1.5 Large
AI21 Labs
25.2
$2 in / $8 out
GPT-5.1 Codex
OpenAI
25.1
$1.25 in / $10 out
GPT-5.1 Codex High
OpenAI
25.1
$1.25 in / $10 out
Claude 3.5 Sonnet
Anthropic
24.6
$3 in / $15 out
Claude 3.5 Sonnet
Anthropic
24.6
$3 in / $15 out
Gemini 1.5 Pro
24.3
$2.5 in / $10 out
Grok-3
xAI
22.7
$3 in / $15 out
Grok-4.1
xAI
22.7
$3 in / $15 out
Pixtral Large
Mistral AI
22.4
$2 in / $6 out
Gemini 3.1 Pro
22.1
$2.5 in / $15 out
GPT-5.2 Codex
OpenAI
19.6
$1.75 in / $14 out