Every major AI model ranked across benchmark quality, inference speed, agentic capability, programming aptitude, and cost efficiency — updated continuously from published evaluation data.
296
Tracked models
27
Providers
253
Benchmarked
13.4
Avg. index
296 models
| Rank | Model | Provider | Score | Benchmarks | Inference | Agentic | Programming | Value | Price |
|---|---|---|---|---|---|---|---|---|---|
| 261 | Qwen2.5 7B Instruct qwen-2.5-7b-instruct textinference | Alibaba Cloud / Qwen Team | 0.0 Programming | 7.4 | 71.5 | 0.0 | 0.0 | 77.5 | $0.3 in / $0.3 out |
| 262 | Qwen2.5-Coder 32B Instruct qwen-2.5-coder-32b-instruct textinference | Alibaba Cloud / Qwen Team | 0.0 Programming | 0.0 | 21.1 | 0.0 | 0.0 | 81.4 | $0.09 in / $0.09 out |
| 263 | Qwen2.5-Coder 7B Instruct qwen-2.5-coder-7b-instruct textinference | Alibaba Cloud / Qwen Team | 0.0 Programming | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 264 | Qwen2.5-Omni-7B qwen2.5-omni-7b multimodalvisionmulti-input reasoning | Alibaba Cloud / Qwen Team | 0.0 Programming | 7.6 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 265 | Qwen2.5 VL 32B Instruct qwen2.5-vl-32b multimodalvisionmulti-input reasoning | Alibaba Cloud / Qwen Team | 0.0 Programming | 21.2 | 0.0 | 1.6 | 0.0 | 0.0 | N/A |
| 266 | Qwen2.5 VL 72B Instruct qwen2.5-vl-72b multimodalvisionmulti-input reasoning | Alibaba Cloud / Qwen Team | 0.0 Programming | 24.9 | 0.0 | 5.7 | 0.0 | 0.0 | N/A |
| 267 | Qwen2.5 VL 7B Instruct qwen2.5-vl-7b multimodalvisionmulti-input reasoning | Alibaba Cloud / Qwen Team | 0.0 Programming | 9.6 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 268 | Qwen2 72B Instruct qwen2-72b-instruct textinference | Alibaba Cloud / Qwen Team | 0.0 Programming | 12.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 269 | Qwen2 7B Instruct qwen2-7b-instruct textinference | Alibaba Cloud / Qwen Team | 0.0 Programming | 2.4 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 270 | Qwen2-VL-72B-Instruct qwen2-vl-72b multimodalvisionmulti-input reasoning | Alibaba Cloud / Qwen Team | 0.0 Programming | 9.3 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 271 | Qwen3 235B A22B qwen3-235b-a22b multimodalvisionmulti-input reasoning | Alibaba Cloud / Qwen Team | 0.0 Programming | 30.5 | 33.0 | 0.0 | 0.0 | 83.9 | $0.1 in / $0.1 out |
| 272 | Qwen3-235B-A22B-Instruct-2507 qwen3-235b-a22b-instruct-2507 textinference | Alibaba Cloud / Qwen Team | 0.0 Programming | 42.4 | 65.9 | 0.0 | 0.0 | 62.8 | $0.15 in / $0.8 out |
| 273 | Qwen3-235B-A22B-Thinking-2507 qwen3-235b-a22b-thinking-2507 textinference | Alibaba Cloud / Qwen Team | 0.0 Programming | 46.4 | 65.9 | 26.8 | 0.0 | 39.9 | $0.3 in / $3 out |
| 274 | Qwen3 30B A3B qwen3-30b-a3b textinference | Alibaba Cloud / Qwen Team | 0.0 Programming | 25.6 | 40.1 | 0.0 | 0.0 | 71.4 | $0.1 in / $0.44 out |
| 275 | Qwen3 32B qwen3-32b textinference | Alibaba Cloud / Qwen Team | 0.0 Programming | 21.4 | 13.3 | 0.0 | 0.0 | 69.9 | $0.1 in / $0.3 out |
| 276 | Qwen3.5-0.8B qwen3.5-0.8b multimodalvisionmulti-input reasoning | Alibaba Cloud / Qwen Team | 0.0 Programming | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 277 | Qwen3.5-2B qwen3.5-2b multimodalvisionmulti-input reasoning | Alibaba Cloud / Qwen Team | 0.0 Programming | 14.4 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 278 | Qwen3.5-4B qwen3.5-4b multimodalvisionmulti-input reasoning | Alibaba Cloud / Qwen Team | 0.0 Programming | 32.1 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 279 | Qwen3.5-9B qwen3.5-9b multimodalvisionmulti-input reasoning | Alibaba Cloud / Qwen Team | 0.0 Programming | 38.5 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 280 | Qwen3-Coder qwen3-coder textinference | Alibaba Cloud / Qwen Team | 0.0 Programming | 0.0 | 54.9 | 0.0 | 0.0 | 88.5 | $0.18 in / $0.18 out |
Qwen2.5 7B Instruct
Alibaba Cloud / Qwen Team
0.0
$0.3 in / $0.3 out
Qwen2.5-Coder 32B Instruct
Alibaba Cloud / Qwen Team
0.0
$0.09 in / $0.09 out
Qwen2.5-Coder 7B Instruct
Alibaba Cloud / Qwen Team
0.0
N/A
Want benchmark charts, model comparison, and pricing analytics?
Sign in to access the full interactive leaderboard with deep benchmark breakdowns and model comparison tools.
Open full leaderboardRankings are based on multi-dimensional evaluation across benchmark quality, inference efficiency, and cost-per-output. Scores are updated continuously and may differ from individual third-party benchmarks.
Qwen2.5-Omni-7B
Alibaba Cloud / Qwen Team
0.0
N/A
Qwen2.5 VL 32B Instruct
Alibaba Cloud / Qwen Team
0.0
N/A
Qwen2.5 VL 72B Instruct
Alibaba Cloud / Qwen Team
0.0
N/A
Qwen2.5 VL 7B Instruct
Alibaba Cloud / Qwen Team
0.0
N/A
Qwen2 72B Instruct
Alibaba Cloud / Qwen Team
0.0
N/A
Qwen2 7B Instruct
Alibaba Cloud / Qwen Team
0.0
N/A
Qwen2-VL-72B-Instruct
Alibaba Cloud / Qwen Team
0.0
N/A
Qwen3 235B A22B
Alibaba Cloud / Qwen Team
0.0
$0.1 in / $0.1 out
Qwen3-235B-A22B-Instruct-2507
Alibaba Cloud / Qwen Team
0.0
$0.15 in / $0.8 out
Qwen3-235B-A22B-Thinking-2507
Alibaba Cloud / Qwen Team
0.0
$0.3 in / $3 out
Qwen3 30B A3B
Alibaba Cloud / Qwen Team
0.0
$0.1 in / $0.44 out
Qwen3 32B
Alibaba Cloud / Qwen Team
0.0
$0.1 in / $0.3 out
Qwen3.5-0.8B
Alibaba Cloud / Qwen Team
0.0
N/A
Qwen3.5-2B
Alibaba Cloud / Qwen Team
0.0
N/A
Qwen3.5-4B
Alibaba Cloud / Qwen Team
0.0
N/A
Qwen3.5-9B
Alibaba Cloud / Qwen Team
0.0
N/A
Qwen3-Coder
Alibaba Cloud / Qwen Team
0.0
$0.18 in / $0.18 out