Every major AI model ranked across benchmark quality, inference speed, agentic capability, programming aptitude, and cost efficiency — updated continuously from published evaluation data.
296
Tracked models
27
Providers
253
Benchmarked
11.5
Avg. index
296 models
| Rank | Model | Provider | Score | Benchmarks | Inference | Agentic | Programming | Value | Price |
|---|---|---|---|---|---|---|---|---|---|
| 261 | Phi 4 phi-4 textinference | Microsoft | 0.0 Agentic | 15.6 | 9.0 | 0.0 | 0.0 | 77.2 | $0.07 in / $0.14 out |
| 262 | Phi 4 Mini phi-4-mini textinference | Microsoft | 0.0 Agentic | 2.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 263 | Phi 4 Mini Reasoning phi-4-mini-reasoning textinference | Microsoft | 0.0 Agentic | 21.7 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 264 | Phi-4-multimodal-instruct phi-4-multimodal-instruct multimodalvisionmulti-input reasoning | Microsoft | 0.0 Agentic | 8.8 | 12.3 | 0.0 | 0.0 | 79.9 | $0.05 in / $0.1 out |
| 265 | Phi 4 Reasoning phi-4-reasoning textinference | Microsoft | 0.0 Agentic | 23.1 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 266 | Phi 4 Reasoning Plus phi-4-reasoning-plus textinference | Microsoft | 0.0 Agentic | 31.5 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 267 | Pixtral-12B pixtral-12b-2409 multimodalvisionmulti-input reasoning | Mistral AI | 0.0 Agentic | 8.1 | 7.0 | 0.0 | 0.0 | 73.0 | |
| 268 | Pixtral Large pixtral-large multimodalvisionmulti-input reasoning | Mistral AI | 0.0 Agentic | 27.8 | 7.0 | 0.0 | 0.0 | 22.3 | |
| 269 | QvQ-72B-Preview qvq-72b-preview multimodalvisionmulti-input reasoning | Alibaba Cloud / Qwen Team | 0.0 Agentic | 38.2 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 270 | Qwen2.5 14B Instruct qwen-2.5-14b-instruct textinference | Alibaba Cloud / Qwen Team | 0.0 Agentic | 14.6 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 271 | Qwen2.5 32B Instruct qwen-2.5-32b-instruct textinference | Alibaba Cloud / Qwen Team | 0.0 Agentic | 18.6 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 272 | Qwen2.5 72B Instruct qwen-2.5-72b-instruct textinference | Alibaba Cloud / Qwen Team | 0.0 Agentic | 17.8 | 15.0 | 0.0 | 0.0 | 54.6 | $0.35 in / $0.4 out |
| 273 | Qwen2.5 7B Instruct qwen-2.5-7b-instruct textinference | Alibaba Cloud / Qwen Team | 0.0 Agentic | 7.4 | 71.5 | 0.0 | 0.0 | 77.5 | $0.3 in / $0.3 out |
| 274 | Qwen2.5-Coder 32B Instruct qwen-2.5-coder-32b-instruct textinference | Alibaba Cloud / Qwen Team | 0.0 Agentic | 0.0 | 21.1 | 0.0 | 0.0 | 81.4 | $0.09 in / $0.09 out |
| 275 | Qwen2.5-Coder 7B Instruct qwen-2.5-coder-7b-instruct textinference | Alibaba Cloud / Qwen Team | 0.0 Agentic | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 276 | Qwen2.5-Omni-7B qwen2.5-omni-7b multimodalvisionmulti-input reasoning | Alibaba Cloud / Qwen Team | 0.0 Agentic | 7.6 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 277 | Qwen2.5 VL 7B Instruct qwen2.5-vl-7b multimodalvisionmulti-input reasoning | Alibaba Cloud / Qwen Team | 0.0 Agentic | 9.6 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 278 | Qwen2 72B Instruct qwen2-72b-instruct textinference | Alibaba Cloud / Qwen Team | 0.0 Agentic | 12.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 279 | Qwen2 7B Instruct qwen2-7b-instruct textinference | Alibaba Cloud / Qwen Team | 0.0 Agentic | 2.4 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 280 | Qwen2-VL-72B-Instruct qwen2-vl-72b multimodalvisionmulti-input reasoning | Alibaba Cloud / Qwen Team | 0.0 Agentic | 9.3 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
Phi 4
Microsoft
0.0
$0.07 in / $0.14 out
Phi 4 Mini
Microsoft
0.0
N/A
Phi 4 Mini Reasoning
Microsoft
0.0
N/A
Want benchmark charts, model comparison, and pricing analytics?
Sign in to access the full interactive leaderboard with deep benchmark breakdowns and model comparison tools.
Open full leaderboardRankings are based on multi-dimensional evaluation across benchmark quality, inference efficiency, and cost-per-output. Scores are updated continuously and may differ from individual third-party benchmarks.
| $0.15 in / $0.15 out |
| $2 in / $6 out |
Phi-4-multimodal-instruct
Microsoft
0.0
$0.05 in / $0.1 out
Phi 4 Reasoning
Microsoft
0.0
N/A
Phi 4 Reasoning Plus
Microsoft
0.0
N/A
Pixtral-12B
Mistral AI
0.0
$0.15 in / $0.15 out
Pixtral Large
Mistral AI
0.0
$2 in / $6 out
QvQ-72B-Preview
Alibaba Cloud / Qwen Team
0.0
N/A
Qwen2.5 14B Instruct
Alibaba Cloud / Qwen Team
0.0
N/A
Qwen2.5 32B Instruct
Alibaba Cloud / Qwen Team
0.0
N/A
Qwen2.5 72B Instruct
Alibaba Cloud / Qwen Team
0.0
$0.35 in / $0.4 out
Qwen2.5 7B Instruct
Alibaba Cloud / Qwen Team
0.0
$0.3 in / $0.3 out
Qwen2.5-Coder 32B Instruct
Alibaba Cloud / Qwen Team
0.0
$0.09 in / $0.09 out
Qwen2.5-Coder 7B Instruct
Alibaba Cloud / Qwen Team
0.0
N/A
Qwen2.5-Omni-7B
Alibaba Cloud / Qwen Team
0.0
N/A
Qwen2.5 VL 7B Instruct
Alibaba Cloud / Qwen Team
0.0
N/A
Qwen2 72B Instruct
Alibaba Cloud / Qwen Team
0.0
N/A
Qwen2 7B Instruct
Alibaba Cloud / Qwen Team
0.0
N/A
Qwen2-VL-72B-Instruct
Alibaba Cloud / Qwen Team
0.0
N/A