Every major AI model ranked across benchmark quality, inference speed, agentic capability, programming aptitude, and cost efficiency — updated continuously from published evaluation data.
296
Tracked models
27
Providers
253
Benchmarked
11.5
Avg. index
296 models
| Rank | Model | Provider | Score | Benchmarks | Inference | Agentic | Programming | Value | Price |
|---|---|---|---|---|---|---|---|---|---|
| 181 | Grok-2 grok-2 multimodalvisionmulti-input reasoning | xAI | 0.0 Agentic | 27.1 | 38.3 | 0.0 | 0.0 | 25.4 | $2 in / $10 out |
| 182 | Grok-2 Image 1212 grok-2-image-1212 textinference | xAI | 0.0 Agentic | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 183 | Grok-2 mini grok-2-mini multimodalvisionmulti-input reasoning | xAI | 0.0 Agentic | 24.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 184 | Grok-3 grok-3 multimodalvisionmulti-input reasoning | xAI | 0.0 Agentic | 59.3 | 52.7 | 0.0 | 0.0 | 22.7 | $3 in / $15 out |
| 185 | Grok-3 Mini grok-3-mini multimodalvisionmulti-input reasoning | xAI | 0.0 Agentic | 53.1 | 52.7 | 0.0 | 0.0 | 65.6 | $0.3 in / $0.5 out |
| 186 | Grok-4 grok-4 multimodalvisionmulti-input reasoning | xAI | 0.0 Agentic | 51.5 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 187 | Grok-4.1 grok-4.1-2025-11-17 multimodalvisionmulti-input reasoning | xAI | 0.0 Agentic | 0.0 | 64.9 | 0.0 | 0.0 | 22.7 | $3 in / $15 out |
| 188 | Grok-4.1 Fast Non-Reasoning grok-4-1-fast-non-reasoning multimodalvisionmulti-input reasoning | xAI | 0.0 Agentic | 0.0 | 68.8 | 0.0 | 0.0 | 67.2 | |
| 189 | Grok-4.1 Fast Reasoning grok-4-1-fast-reasoning multimodalvisionmulti-input reasoning | xAI | 0.0 Agentic | 0.0 | 68.8 | 0.0 | 0.0 | 67.2 | |
| 190 | Grok-4.1 Thinking grok-4.1-thinking-2025-11-17 multimodalvisionmulti-input reasoning | xAI | 0.0 Agentic | 0.0 | 48.5 | 0.0 | 0.0 | 17.8 | |
| 191 | Grok-4.20 Beta Non-Reasoning grok-4.20-beta-0309-non-reasoning multimodalvisionmulti-input reasoning | xAI | 0.0 Agentic | 0.0 | 97.2 | 0.0 | 0.0 | 27.7 | |
| 192 | Grok-4.20 Beta Reasoning grok-4.20-beta-0309-reasoning multimodalvisionmulti-input reasoning | xAI | 0.0 Agentic | 0.0 | 97.2 | 0.0 | 0.0 | 27.7 | |
| 193 | Grok-4.20 Multi-Agent Beta grok-4.20-multi-agent-beta-0309 multimodalvisionmulti-input reasoning | xAI | 0.0 Agentic | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | |
| 194 | Grok-4 Fast Non-Reasoning grok-4-fast-non-reasoning multimodalvisionmulti-input reasoning | xAI | 0.0 Agentic | 0.0 | 68.8 | 0.0 | 0.0 | 67.2 | |
| 195 | Grok-4 Fast Reasoning grok-4-fast-reasoning multimodalvisionmulti-input reasoning | xAI | 0.0 Agentic | 0.0 | 68.8 | 0.0 | 0.0 | 67.2 | |
| 196 | Grok-4 Heavy grok-4-heavy multimodalvisionmulti-input reasoning | xAI | 0.0 Agentic | 72.4 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 197 | Grok Code Fast 1 grok-code-fast-1 codeprogrammingtool use | xAI | 0.0 Agentic | 0.0 | 47.7 | 0.0 | 38.8 | 49.7 | $0.2 in / $1.5 out |
| 198 | Hermes 3 70B hermes-3-70b textinference | Nous Research | 0.0 Agentic | 30.1 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 199 | Jamba 1.5 Large jamba-1.5-large textinference | AI21 Labs | 0.0 Agentic | 8.1 | 33.6 | 0.0 | 0.0 | 25.2 | $2 in / $8 out |
| 200 | Jamba 1.5 Mini jamba-1.5-mini textinference | AI21 Labs | 0.0 Agentic | 4.7 | 65.8 | 0.0 | 0.0 | 72.4 | $0.2 in / $0.4 out |
Grok-2
xAI
0.0
$2 in / $10 out
Grok-2 Image 1212
xAI
0.0
N/A
Grok-2 mini
xAI
0.0
N/A
Want benchmark charts, model comparison, and pricing analytics?
Sign in to access the full interactive leaderboard with deep benchmark breakdowns and model comparison tools.
Open full leaderboardRankings are based on multi-dimensional evaluation across benchmark quality, inference efficiency, and cost-per-output. Scores are updated continuously and may differ from individual third-party benchmarks.
| $0.2 in / $0.5 out |
| $0.2 in / $0.5 out |
| $3 in / $15 out |
| $2 in / $6 out |
| $2 in / $6 out |
| N/A |
| $0.2 in / $0.5 out |
| $0.2 in / $0.5 out |
Grok-3
xAI
0.0
$3 in / $15 out
Grok-3 Mini
xAI
0.0
$0.3 in / $0.5 out
Grok-4
xAI
0.0
N/A
Grok-4.1
xAI
0.0
$3 in / $15 out
Grok-4.1 Fast Non-Reasoning
xAI
0.0
$0.2 in / $0.5 out
Grok-4.1 Fast Reasoning
xAI
0.0
$0.2 in / $0.5 out
Grok-4.1 Thinking
xAI
0.0
$3 in / $15 out
Grok-4.20 Beta Non-Reasoning
xAI
0.0
$2 in / $6 out
Grok-4.20 Beta Reasoning
xAI
0.0
$2 in / $6 out
Grok-4.20 Multi-Agent Beta
xAI
0.0
N/A
Grok-4 Fast Non-Reasoning
xAI
0.0
$0.2 in / $0.5 out
Grok-4 Fast Reasoning
xAI
0.0
$0.2 in / $0.5 out
Grok-4 Heavy
xAI
0.0
N/A
Grok Code Fast 1
xAI
0.0
$0.2 in / $1.5 out
Hermes 3 70B
Nous Research
0.0
N/A
Jamba 1.5 Large
AI21 Labs
0.0
$2 in / $8 out
Jamba 1.5 Mini
AI21 Labs
0.0
$0.2 in / $0.4 out