Every major AI model ranked across benchmark quality, inference speed, agentic capability, programming aptitude, and cost efficiency — updated continuously from published evaluation data.
296
Tracked models
27
Providers
253
Benchmarked
27.4
Avg. index
296 models
| Rank | Model | Provider | Score | Benchmarks | Inference | Agentic | Programming | Value | Price |
|---|---|---|---|---|---|---|---|---|---|
| 261 | Granite 3.3 8B Instruct granite-3.3-8b-instruct multimodalvisionmulti-input reasoning | IBM | 0.0 Benchmarks | 0.0 | 29.2 | 0.0 | 0.0 | 56.3 | $0.5 in / $0.5 out |
| 262 | IBM Granite 4.0 Tiny Preview granite-4.0-tiny-preview textinference | IBM | 0.0 Benchmarks | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 263 | Grok-2 Image 1212 grok-2-image-1212 textinference | xAI | 0.0 Benchmarks | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 264 | Grok-4.1 grok-4.1-2025-11-17 multimodalvisionmulti-input reasoning | xAI | 0.0 Benchmarks | 0.0 | 64.8 | 0.0 | 0.0 | 22.7 | |
| 265 | Grok-4.1 Fast Non-Reasoning grok-4-1-fast-non-reasoning multimodalvisionmulti-input reasoning | xAI | 0.0 Benchmarks | 0.0 | 68.5 | 0.0 | 0.0 | 67.2 | |
| 266 | Grok-4.1 Fast Reasoning grok-4-1-fast-reasoning multimodalvisionmulti-input reasoning | xAI | 0.0 Benchmarks | 0.0 | 68.5 | 0.0 | 0.0 | 67.2 | |
| 267 | Grok-4.1 Thinking grok-4.1-thinking-2025-11-17 multimodalvisionmulti-input reasoning | xAI | 0.0 Benchmarks | 0.0 | 47.9 | 0.0 | 0.0 | 17.6 | |
| 268 | Grok-4.20 Beta Non-Reasoning grok-4.20-beta-0309-non-reasoning multimodalvisionmulti-input reasoning | xAI | 0.0 Benchmarks | 0.0 | 97.3 | 0.0 | 0.0 | 27.6 | |
| 269 | Grok-4.20 Beta Reasoning grok-4.20-beta-0309-reasoning multimodalvisionmulti-input reasoning | xAI | 0.0 Benchmarks | 0.0 | 97.3 | 0.0 | 0.0 | 27.6 | |
| 270 | Grok-4.20 Multi-Agent Beta grok-4.20-multi-agent-beta-0309 multimodalvisionmulti-input reasoning | xAI | 0.0 Benchmarks | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | |
| 271 | Grok-4 Fast Non-Reasoning grok-4-fast-non-reasoning multimodalvisionmulti-input reasoning | xAI | 0.0 Benchmarks | 0.0 | 68.5 | 0.0 | 0.0 | 67.2 | |
| 272 | Grok-4 Fast Reasoning grok-4-fast-reasoning multimodalvisionmulti-input reasoning | xAI | 0.0 Benchmarks | 0.0 | 68.5 | 0.0 | 0.0 | 67.2 | |
| 273 | Grok Code Fast 1 grok-code-fast-1 codeprogrammingtool use | xAI | 0.0 Benchmarks | 0.0 | 47.1 | 0.0 | 38.8 | 49.7 | $0.2 in / $1.5 out |
| 274 | Llama 3.1 Nemotron 70B Instruct llama-3.1-nemotron-70b-instruct textinference | NVIDIA | 0.0 Benchmarks | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 275 | MedGemma 4B IT medgemma-4b-it multimodalvisionmulti-input reasoning | Google | 0.0 Benchmarks | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | |
| 276 | MiMo-V2-Omni mimo-v2-omni multimodalvisionmulti-input reasoning | Xiaomi | 0.0 Benchmarks | 0.0 | 58.2 | 0.0 | 54.4 | 45.1 | $0.4 in / $2 out |
| 277 | MiMo-V2-Pro mimo-v2-pro codeprogrammingtool use | Xiaomi | 0.0 Benchmarks | 0.0 | 84.1 | 0.0 | 65.1 | 36.9 | $1 in / $3 out |
| 278 | MiniMax M2.5 minimax-m2.5 codeprogrammingtool use | MiniMax | 0.0 Benchmarks | 0.0 | 74.5 | 52.2 | 57.4 | 58.1 | $0.3 in / $1.2 out |
| 279 | MiniMax M2.7 minimax-m2.7 codeprogrammingtool use | MiniMax | 0.0 Benchmarks | 0.0 | 51.9 | 44.9 | 40.1 | 55.2 | $0.3 in / $1.2 out |
| 280 | Ministral 3 (14B Base 2512) ministral-3-14b-base-2512 multimodalvisionmulti-input reasoning | Mistral AI | 0.0 Benchmarks | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
Granite 3.3 8B Instruct
IBM
0.0
$0.5 in / $0.5 out
IBM Granite 4.0 Tiny Preview
IBM
0.0
N/A
Grok-2 Image 1212
xAI
0.0
N/A
Want benchmark charts, model comparison, and pricing analytics?
Sign in to access the full interactive leaderboard with deep benchmark breakdowns and model comparison tools.
Open full leaderboardRankings are based on multi-dimensional evaluation across benchmark quality, inference efficiency, and cost-per-output. Scores are updated continuously and may differ from individual third-party benchmarks.
| $3 in / $15 out |
| $0.2 in / $0.5 out |
| $0.2 in / $0.5 out |
| $3 in / $15 out |
| $2 in / $6 out |
| $2 in / $6 out |
| N/A |
| $0.2 in / $0.5 out |
| $0.2 in / $0.5 out |
| N/A |
| N/A |
Grok-4.1
xAI
0.0
$3 in / $15 out
Grok-4.1 Fast Non-Reasoning
xAI
0.0
$0.2 in / $0.5 out
Grok-4.1 Fast Reasoning
xAI
0.0
$0.2 in / $0.5 out
Grok-4.1 Thinking
xAI
0.0
$3 in / $15 out
Grok-4.20 Beta Non-Reasoning
xAI
0.0
$2 in / $6 out
Grok-4.20 Beta Reasoning
xAI
0.0
$2 in / $6 out
Grok-4.20 Multi-Agent Beta
xAI
0.0
N/A
Grok-4 Fast Non-Reasoning
xAI
0.0
$0.2 in / $0.5 out
Grok-4 Fast Reasoning
xAI
0.0
$0.2 in / $0.5 out
Grok Code Fast 1
xAI
0.0
$0.2 in / $1.5 out
Llama 3.1 Nemotron 70B Instruct
NVIDIA
0.0
N/A
MedGemma 4B IT
0.0
N/A
MiMo-V2-Omni
Xiaomi
0.0
$0.4 in / $2 out
MiMo-V2-Pro
Xiaomi
0.0
$1 in / $3 out
MiniMax M2.5
MiniMax
0.0
$0.3 in / $1.2 out
MiniMax M2.7
MiniMax
0.0
$0.3 in / $1.2 out
Ministral 3 (14B Base 2512)
Mistral AI
0.0
N/A