Every major AI model ranked across benchmark quality, inference speed, agentic capability, programming aptitude, and cost efficiency — updated continuously from published evaluation data.
296
Tracked models
27
Providers
253
Benchmarked
13.4
Avg. index
296 models
| Rank | Model | Provider | Score | Benchmarks | Inference | Agentic | Programming | Value | Price |
|---|---|---|---|---|---|---|---|---|---|
| 161 | GPT-5.2 Pro gpt-5.2-pro-2025-12-11 multimodalvisionmulti-input reasoning | OpenAI | 0.0 Programming | 66.9 | 31.6 | 55.4 | 0.0 | 2.7 | $21 in / $168 out |
| 162 | GPT-5.3 Chat gpt-5.3-chat-latest multimodalvisionmulti-input reasoning | OpenAI | 0.0 Programming | 0.0 | 52.7 | 0.0 | 0.0 | 26.5 | |
| 163 | GPT-5 High gpt-5-high-2025-08-07 multimodalvisionmulti-input reasoning | OpenAI | 0.0 Programming | 62.9 | 0.0 | 0.0 | 0.0 | 0.0 | |
| 164 | GPT-5 Medium gpt-5-medium-2025-08-07 multimodalvisionmulti-input reasoning | OpenAI | 0.0 Programming | 56.7 | 61.9 | 0.0 | 0.0 | 28.9 | |
| 165 | GPT OSS 120B gpt-oss-120b textinference | OpenAI | 0.0 Programming | 36.1 | 34.5 | 26.8 | 0.0 | 76.4 | $0.09 in / $0.45 out |
| 166 | GPT OSS 120B High gpt-oss-120b-high multimodalvisionmulti-input reasoning | OpenAI | 0.0 Programming | 44.7 | 58.0 | 0.0 | 0.0 | 73.3 | |
| 167 | GPT OSS 20B gpt-oss-20b textinference | OpenAI | 0.0 Programming | 25.8 | 77.2 | 6.0 | 0.0 | 79.0 | $0.1 in / $0.5 out |
| 168 | GPT OSS 20B High gpt-oss-20b-high textinference | OpenAI | 0.0 Programming | 53.7 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 169 | Granite 3.3 8B Base granite-3.3-8b-base multimodalvisionmulti-input reasoning | IBM | 0.0 Programming | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 170 | Granite 3.3 8B Instruct granite-3.3-8b-instruct multimodalvisionmulti-input reasoning | IBM | 0.0 Programming | 0.0 | 29.7 | 0.0 | 0.0 | 56.7 | $0.5 in / $0.5 out |
| 171 | IBM Granite 4.0 Tiny Preview granite-4.0-tiny-preview textinference | IBM | 0.0 Programming | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 172 | Grok-1.5 grok-1.5 multimodalvisionmulti-input reasoning | xAI | 0.0 Programming | 8.6 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 173 | Grok-1.5V grok-1.5v multimodalvisionmulti-input reasoning | xAI | 0.0 Programming | 9.8 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 174 | Grok-2 grok-2 multimodalvisionmulti-input reasoning | xAI | 0.0 Programming | 27.1 | 38.3 | 0.0 | 0.0 | 25.4 | $2 in / $10 out |
| 175 | Grok-2 Image 1212 grok-2-image-1212 textinference | xAI | 0.0 Programming | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 176 | Grok-2 mini grok-2-mini multimodalvisionmulti-input reasoning | xAI | 0.0 Programming | 24.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 177 | Grok-3 grok-3 multimodalvisionmulti-input reasoning | xAI | 0.0 Programming | 59.3 | 52.7 | 0.0 | 0.0 | 22.7 | $3 in / $15 out |
| 178 | Grok-3 Mini grok-3-mini multimodalvisionmulti-input reasoning | xAI | 0.0 Programming | 53.1 | 52.7 | 0.0 | 0.0 | 65.6 | $0.3 in / $0.5 out |
| 179 | Grok-4 grok-4 multimodalvisionmulti-input reasoning | xAI | 0.0 Programming | 51.5 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 180 | Grok-4.1 grok-4.1-2025-11-17 multimodalvisionmulti-input reasoning | xAI | 0.0 Programming | 0.0 | 64.9 | 0.0 | 0.0 | 22.7 |
GPT-5.2 Pro
OpenAI
0.0
$21 in / $168 out
GPT-5.3 Chat
OpenAI
0.0
$1.75 in / $14 out
GPT-5 High
OpenAI
0.0
N/A
Want benchmark charts, model comparison, and pricing analytics?
Sign in to access the full interactive leaderboard with deep benchmark breakdowns and model comparison tools.
Open full leaderboardRankings are based on multi-dimensional evaluation across benchmark quality, inference efficiency, and cost-per-output. Scores are updated continuously and may differ from individual third-party benchmarks.
| $1.75 in / $14 out |
| N/A |
| $1.25 in / $10 out |
| $0.1 in / $0.5 out |
| $3 in / $15 out |
GPT-5 Medium
OpenAI
0.0
$1.25 in / $10 out
GPT OSS 120B
OpenAI
0.0
$0.09 in / $0.45 out
GPT OSS 120B High
OpenAI
0.0
$0.1 in / $0.5 out
GPT OSS 20B
OpenAI
0.0
$0.1 in / $0.5 out
GPT OSS 20B High
OpenAI
0.0
N/A
Granite 3.3 8B Base
IBM
0.0
N/A
Granite 3.3 8B Instruct
IBM
0.0
$0.5 in / $0.5 out
IBM Granite 4.0 Tiny Preview
IBM
0.0
N/A
Grok-1.5
xAI
0.0
N/A
Grok-1.5V
xAI
0.0
N/A
Grok-2
xAI
0.0
$2 in / $10 out
Grok-2 Image 1212
xAI
0.0
N/A
Grok-2 mini
xAI
0.0
N/A
Grok-3
xAI
0.0
$3 in / $15 out
Grok-3 Mini
xAI
0.0
$0.3 in / $0.5 out
Grok-4
xAI
0.0
N/A
Grok-4.1
xAI
0.0
$3 in / $15 out