Every major AI model ranked across benchmark quality, inference speed, agentic capability, programming aptitude, and cost efficiency — updated continuously from published evaluation data.
296
Tracked models
27
Providers
253
Benchmarked
13.4
Avg. index
296 models
| Rank | Model | Provider | Score | Benchmarks | Inference | Agentic | Programming | Value | Price |
|---|---|---|---|---|---|---|---|---|---|
| 241 | Nova Pro nova-pro multimodalvisionmulti-input reasoning | Amazon | 0.0 Programming | 20.0 | 70.5 | 0.0 | 0.0 | 43.4 | $0.8 in / $3.2 out |
| 242 | Nemotron Nano 9B v2 nvidia-nemotron-nano-9b-v2 textinference | NVIDIA | 0.0 Programming | 24.9 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 243 | o1-mini o1-mini textinference | OpenAI | 0.0 Programming | 25.7 | 61.8 | 0.0 | 0.0 | 30.2 | $3 in / $12 out |
| 244 | o1-pro o1-pro multimodalvisionmulti-input reasoning | OpenAI | 0.0 Programming | 47.1 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 245 | o3-pro o3-pro-2025-06-10 multimodalvisionmulti-input reasoning | OpenAI | 0.0 Programming | 0.0 | 21.4 | 0.0 | 0.0 | 3.6 | |
| 246 | Phi-3.5-mini-instruct phi-3.5-mini-instruct multimodalvisionmulti-input reasoning | Microsoft | 0.0 Programming | 2.7 | 10.8 | 0.0 | 0.0 | 77.2 | $0.1 in / $0.1 out |
| 247 | Phi-3.5-MoE-instruct phi-3.5-moe-instruct multimodalvisionmulti-input reasoning | Microsoft | 0.0 Programming | 8.2 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 248 | Phi-3.5-vision-instruct phi-3.5-vision-instruct multimodalvisionmulti-input reasoning | Microsoft | 0.0 Programming | 2.3 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 249 | Phi 4 phi-4 textinference | Microsoft | 0.0 Programming | 15.6 | 9.0 | 0.0 | 0.0 | 77.2 | $0.07 in / $0.14 out |
| 250 | Phi 4 Mini phi-4-mini textinference | Microsoft | 0.0 Programming | 2.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 251 | Phi 4 Mini Reasoning phi-4-mini-reasoning textinference | Microsoft | 0.0 Programming | 21.7 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 252 | Phi-4-multimodal-instruct phi-4-multimodal-instruct multimodalvisionmulti-input reasoning | Microsoft | 0.0 Programming | 8.8 | 12.3 | 0.0 | 0.0 | 79.9 | |
| 253 | Phi 4 Reasoning phi-4-reasoning textinference | Microsoft | 0.0 Programming | 23.1 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 254 | Phi 4 Reasoning Plus phi-4-reasoning-plus textinference | Microsoft | 0.0 Programming | 31.5 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 255 | Pixtral-12B pixtral-12b-2409 multimodalvisionmulti-input reasoning | Mistral AI | 0.0 Programming | 8.1 | 7.0 | 0.0 | 0.0 | 73.0 | |
| 256 | Pixtral Large pixtral-large multimodalvisionmulti-input reasoning | Mistral AI | 0.0 Programming | 27.8 | 7.0 | 0.0 | 0.0 | 22.3 | |
| 257 | QvQ-72B-Preview qvq-72b-preview multimodalvisionmulti-input reasoning | Alibaba Cloud / Qwen Team | 0.0 Programming | 38.2 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 258 | Qwen2.5 14B Instruct qwen-2.5-14b-instruct textinference | Alibaba Cloud / Qwen Team | 0.0 Programming | 14.6 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 259 | Qwen2.5 32B Instruct qwen-2.5-32b-instruct textinference | Alibaba Cloud / Qwen Team | 0.0 Programming | 18.6 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 260 | Qwen2.5 72B Instruct qwen-2.5-72b-instruct textinference | Alibaba Cloud / Qwen Team | 0.0 Programming | 17.8 | 15.0 | 0.0 | 0.0 | 54.6 | $0.35 in / $0.4 out |
Nova Pro
Amazon
0.0
$0.8 in / $3.2 out
Nemotron Nano 9B v2
NVIDIA
0.0
N/A
o1-mini
OpenAI
0.0
$3 in / $12 out
Want benchmark charts, model comparison, and pricing analytics?
Sign in to access the full interactive leaderboard with deep benchmark breakdowns and model comparison tools.
Open full leaderboardRankings are based on multi-dimensional evaluation across benchmark quality, inference efficiency, and cost-per-output. Scores are updated continuously and may differ from individual third-party benchmarks.
| $20 in / $80 out |
| $0.05 in / $0.1 out |
| $0.15 in / $0.15 out |
| $2 in / $6 out |
o1-pro
OpenAI
0.0
N/A
o3-pro
OpenAI
0.0
$20 in / $80 out
Phi-3.5-mini-instruct
Microsoft
0.0
$0.1 in / $0.1 out
Phi-3.5-MoE-instruct
Microsoft
0.0
N/A
Phi-3.5-vision-instruct
Microsoft
0.0
N/A
Phi 4
Microsoft
0.0
$0.07 in / $0.14 out
Phi 4 Mini
Microsoft
0.0
N/A
Phi 4 Mini Reasoning
Microsoft
0.0
N/A
Phi-4-multimodal-instruct
Microsoft
0.0
$0.05 in / $0.1 out
Phi 4 Reasoning
Microsoft
0.0
N/A
Phi 4 Reasoning Plus
Microsoft
0.0
N/A
Pixtral-12B
Mistral AI
0.0
$0.15 in / $0.15 out
Pixtral Large
Mistral AI
0.0
$2 in / $6 out
QvQ-72B-Preview
Alibaba Cloud / Qwen Team
0.0
N/A
Qwen2.5 14B Instruct
Alibaba Cloud / Qwen Team
0.0
N/A
Qwen2.5 32B Instruct
Alibaba Cloud / Qwen Team
0.0
N/A
Qwen2.5 72B Instruct
Alibaba Cloud / Qwen Team
0.0
$0.35 in / $0.4 out