Every major AI model ranked across benchmark quality, inference speed, agentic capability, programming aptitude, and cost efficiency — updated continuously from published evaluation data.
296
Tracked models
27
Providers
253
Benchmarked
30.8
Avg. index
296 models
| Rank | Model | Provider | Score | Benchmarks | Inference | Agentic | Programming | Value | Price |
|---|---|---|---|---|---|---|---|---|---|
| 161 | GPT-5.3 Codex gpt-5.3-codex texttext-to-textcoding | OpenAI | 19.6 Value / Price | 0.0 | 49.0 | 0.0 | 52.2 | 19.6 | $1.75 in / $14 out |
| 162 | Claude 3 Opus claude-3-opus-20240229 multimodalvisionmulti-input reasoning | Anthropic | 19.5 Value / Price | 19.3 | 71.7 | 0.0 | 0.0 | 19.5 | |
| 163 | GPT-4 Turbo gpt-4-turbo-2024-04-09 textinference | OpenAI | 18.8 Value / Price | 16.9 | 52.7 | 0.0 | 0.0 | 18.8 | $10 in / $30 out |
| 164 | GPT-4 gpt-4-0613 multimodalvisionmulti-input reasoning | OpenAI | 18.7 Value / Price | 6.8 | 54.9 | 0.0 | 0.0 | 18.7 | $30 in / $60 out |
| 165 | GPT-5.4 gpt-5.4 texttext-to-textlanguage | OpenAI | 18.3 Value / Price | 75.9 | 51.5 | 61.8 | 63.9 | 18.3 | |
| 166 | Grok-4.1 Thinking grok-4.1-thinking-2025-11-17 multimodalvisionmulti-input reasoning | xAI | 17.8 Value / Price | 0.0 | 48.5 | 0.0 | 0.0 | 17.8 | |
| 167 | Claude 3.7 Sonnet claude-3-7-sonnet-20250219 multimodalvisionmulti-input reasoning | Anthropic | 13.3 Value / Price | 43.5 | 30.5 | 49.0 | 39.6 | 13.3 | |
| 168 | Claude 3 Sonnet claude-3-sonnet-20240229 multimodalvisionmulti-input reasoning | Anthropic | 13.3 Value / Price | 10.0 | 30.5 | 0.0 | 0.0 | 13.3 | |
| 169 | Claude Sonnet 4.5 claude-sonnet-4-5-20250929 multimodalvisionmulti-input reasoning | Anthropic | 13.3 Value / Price | 53.0 | 30.5 | 71.8 | 74.6 | 13.3 | |
| 170 | Claude Sonnet 4.6 claude-sonnet-4-6 multimodalvisionmulti-input reasoning | Anthropic | 13.3 Value / Price | 66.1 | 30.5 | 48.5 | 68.2 | 13.3 | |
| 171 | o1-preview o1-preview codeprogrammingtool use | OpenAI | 11.8 Value / Price | 41.8 | 33.0 | 0.0 | 9.5 | 11.8 | $15 in / $60 out |
| 172 | Claude Opus 4.5 claude-opus-4-5-20251101 multimodalvisionmulti-input reasoning | Anthropic | 10.7 Value / Price | 56.1 | 30.5 | 42.5 | 74.2 | 10.7 | |
| 173 | Claude Opus 4.6 claude-opus-4-6 multimodalvisionmulti-input reasoning | Anthropic | 10.7 Value / Price | 79.5 | 43.1 | 59.3 | 73.3 | 10.7 | |
| 174 | Claude Opus 4.7 claude-opus-4-7 multimodalvisionmulti-input reasoning | Anthropic | 10.7 Value / Price | 76.8 | 43.1 | 68.6 | 81.4 | 10.7 | |
| 175 | Gemma 3n E4B Instructed gemma-3n-e4b-it multimodalvisionmulti-input reasoning | Google | 10.3 Value / Price | 1.3 | 20.3 | 0.0 | 0.0 | 10.3 | |
| 176 | Claude Opus 4.1 claude-opus-4-1-20250805 multimodalvisionmulti-input reasoning | Anthropic | 7.2 Value / Price | 47.9 | 30.5 | 66.8 | 62.1 | 7.2 | |
| 177 | GPT-4.5 gpt-4.5 multimodalvisionmulti-input reasoning | OpenAI | 7.0 Value / Price | 41.9 | 29.7 | 35.8 | 6.0 | 7.0 | $75 in / $150 out |
| 178 | GPT-5.5 gpt-5.5 multimodalvisionmulti-input reasoning | OpenAI | 6.6 Value / Price | 80.4 | 84.0 | 76.5 | 66.3 | 6.6 | $5 in / $30 out |
| 179 | o1 o1-2024-12-17 multimodalvisionmulti-input reasoning | OpenAI | 4.9 Value / Price | 42.9 | 19.4 | 44.7 | 6.5 | 4.9 | $15 in / $60 out |
| 180 | o3-pro o3-pro-2025-06-10 multimodalvisionmulti-input reasoning | OpenAI | 3.6 Value / Price | 0.0 | 21.4 | 0.0 | 0.0 | 3.6 |
GPT-5.3 Codex
OpenAI
19.6
$1.75 in / $14 out
Claude 3 Opus
Anthropic
19.5
$15 in / $75 out
GPT-4 Turbo
OpenAI
18.8
$10 in / $30 out
Want benchmark charts, model comparison, and pricing analytics?
Sign in to access the full interactive leaderboard with deep benchmark breakdowns and model comparison tools.
Open full leaderboardRankings are based on multi-dimensional evaluation across benchmark quality, inference efficiency, and cost-per-output. Scores are updated continuously and may differ from individual third-party benchmarks.
| $15 in / $75 out |
| $2.5 in / $15 out |
| $3 in / $15 out |
| $3 in / $15 out |
| $3 in / $15 out |
| $3 in / $15 out |
| $3 in / $15 out |
| $5 in / $25 out |
| $5 in / $25 out |
| $5 in / $25 out |
| $20 in / $40 out |
| $15 in / $75 out |
| $20 in / $80 out |
GPT-4
OpenAI
18.7
$30 in / $60 out
Grok-4.1 Thinking
xAI
17.8
$3 in / $15 out
Claude 3.7 Sonnet
Anthropic
13.3
$3 in / $15 out
Claude 3 Sonnet
Anthropic
13.3
$3 in / $15 out
Claude Sonnet 4.5
Anthropic
13.3
$3 in / $15 out
Claude Sonnet 4.6
Anthropic
13.3
$3 in / $15 out
o1-preview
OpenAI
11.8
$15 in / $60 out
Claude Opus 4.5
Anthropic
10.7
$5 in / $25 out
Claude Opus 4.6
Anthropic
10.7
$5 in / $25 out
Claude Opus 4.7
Anthropic
10.7
$5 in / $25 out
Gemma 3n E4B Instructed
10.3
$20 in / $40 out
Claude Opus 4.1
Anthropic
7.2
$15 in / $75 out
GPT-4.5
OpenAI
7.0
$75 in / $150 out
GPT-5.5
OpenAI
6.6
$5 in / $30 out
o1
OpenAI
4.9
$15 in / $60 out
o3-pro
OpenAI
3.6
$20 in / $80 out