Every major AI model ranked across benchmark quality, inference speed, agentic capability, programming aptitude, and cost efficiency — updated continuously from published evaluation data.
296
Tracked models
27
Providers
253
Benchmarked
11.5
Avg. index
296 models
| Rank | Model | Provider | Score | Benchmarks | Inference | Agentic | Programming | Value | Price |
|---|---|---|---|---|---|---|---|---|---|
| 101 | Command R+ command-r-plus-04-2024 textinference | Cohere | 0.0 Agentic | 0.0 | 32.5 | 0.0 | 0.0 | 55.4 | $0.25 in / $1 out |
| 102 | DeepSeek-V3.2 (Non-thinking) deepseek-chat textinference | DeepSeek | 0.0 Agentic | 0.0 | 58.0 | 0.0 | 0.0 | 70.2 | $0.28 in / $0.42 out |
| 103 | DeepSeek-R1 deepseek-r1 textinference | DeepSeek | 0.0 Agentic | 0.0 | 14.3 | 0.0 | 0.0 | 35.1 | $0.55 in / $2.19 out |
| 104 | DeepSeek-R1-0528 deepseek-r1-0528 codeprogrammingtool use | DeepSeek | 0.0 Agentic | 50.1 | 14.3 | 0.0 | 6.6 | 35.1 | $0.55 in / $2.19 out |
| 105 | DeepSeek R1 Distill Llama 70B deepseek-r1-distill-llama-70b textinference | DeepSeek | 0.0 Agentic | 28.8 | 16.6 | 0.0 | 0.0 | 66.6 | $0.1 in / $0.4 out |
| 106 | DeepSeek R1 Distill Llama 8B deepseek-r1-distill-llama-8b textinference | DeepSeek | 0.0 Agentic | 17.8 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 107 | DeepSeek R1 Distill Qwen 14B deepseek-r1-distill-qwen-14b textinference | DeepSeek | 0.0 Agentic | 24.7 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 108 | DeepSeek R1 Distill Qwen 1.5B deepseek-r1-distill-qwen-1.5b textinference | DeepSeek | 0.0 Agentic | 6.1 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 109 | DeepSeek R1 Distill Qwen 32B deepseek-r1-distill-qwen-32b textinference | DeepSeek | 0.0 Agentic | 26.6 | 16.6 | 0.0 | 0.0 | 75.9 | $0.12 in / $0.18 out |
| 110 | DeepSeek R1 Distill Qwen 7B deepseek-r1-distill-qwen-7b textinference | DeepSeek | 0.0 Agentic | 18.3 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 111 | DeepSeek R1 Zero deepseek-r1-zero textinference | DeepSeek | 0.0 Agentic | 39.4 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 112 | DeepSeek-V2.5 deepseek-v2.5 codeprogrammingtool use | DeepSeek | 0.0 Agentic | 0.0 | 46.5 | 0.0 | 0.9 | 79.7 | $0.14 in / $0.28 out |
| 113 | DeepSeek-V3 deepseek-v3 codeprogrammingtool use | DeepSeek | 0.0 Agentic | 27.3 | 58.0 | 0.0 | 10.4 | 60.5 | $0.27 in / $1.1 out |
| 114 | DeepSeek-V3 0324 deepseek-v3-0324 textinference | DeepSeek | 0.0 Agentic | 32.8 | 39.8 | 0.0 | 0.0 | 57.7 | $0.28 in / $1.14 out |
| 115 | DeepSeek VL2 deepseek-vl2 multimodalvisionmulti-input reasoning | DeepSeek | 0.0 Agentic | 6.9 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 116 | DeepSeek VL2 Small deepseek-vl2-small multimodalvisionmulti-input reasoning | DeepSeek | 0.0 Agentic | 4.6 | 0.0 | 0.0 | 0.0 | 0.0 | |
| 117 | DeepSeek VL2 Tiny deepseek-vl2-tiny multimodalvisionmulti-input reasoning | DeepSeek | 0.0 Agentic | 1.2 | 0.0 | 0.0 | 0.0 | 0.0 | |
| 118 | Devstral Medium devstral-medium-2507 codeprogrammingtool use | Mistral AI | 0.0 Agentic | 0.0 | 64.8 | 0.0 | 24.2 | 53.4 | $0.4 in / $2 out |
| 119 | Devstral Small 1.1 devstral-small-2507 codeprogrammingtool use | Mistral AI | 0.0 Agentic | 0.0 | 64.8 | 0.0 | 14.7 | 85.3 | $0.1 in / $0.3 out |
| 120 | ERNIE 4.5 ernie-4.5 textinference | Baidu | 0.0 Agentic | 24.5 | 18.8 | 0.0 | 0.0 | 34.6 | $0.4 in / $4 out |
Command R+
Cohere
0.0
$0.25 in / $1 out
DeepSeek-V3.2 (Non-thinking)
DeepSeek
0.0
$0.28 in / $0.42 out
DeepSeek-R1
DeepSeek
0.0
$0.55 in / $2.19 out
Want benchmark charts, model comparison, and pricing analytics?
Sign in to access the full interactive leaderboard with deep benchmark breakdowns and model comparison tools.
Open full leaderboardRankings are based on multi-dimensional evaluation across benchmark quality, inference efficiency, and cost-per-output. Scores are updated continuously and may differ from individual third-party benchmarks.
| N/A |
| N/A |
DeepSeek-R1-0528
DeepSeek
0.0
$0.55 in / $2.19 out
DeepSeek R1 Distill Llama 70B
DeepSeek
0.0
$0.1 in / $0.4 out
DeepSeek R1 Distill Llama 8B
DeepSeek
0.0
N/A
DeepSeek R1 Distill Qwen 14B
DeepSeek
0.0
N/A
DeepSeek R1 Distill Qwen 1.5B
DeepSeek
0.0
N/A
DeepSeek R1 Distill Qwen 32B
DeepSeek
0.0
$0.12 in / $0.18 out
DeepSeek R1 Distill Qwen 7B
DeepSeek
0.0
N/A
DeepSeek R1 Zero
DeepSeek
0.0
N/A
DeepSeek-V2.5
DeepSeek
0.0
$0.14 in / $0.28 out
DeepSeek-V3
DeepSeek
0.0
$0.27 in / $1.1 out
DeepSeek-V3 0324
DeepSeek
0.0
$0.28 in / $1.14 out
DeepSeek VL2
DeepSeek
0.0
N/A
DeepSeek VL2 Small
DeepSeek
0.0
N/A
DeepSeek VL2 Tiny
DeepSeek
0.0
N/A
Devstral Medium
Mistral AI
0.0
$0.4 in / $2 out
Devstral Small 1.1
Mistral AI
0.0
$0.1 in / $0.3 out
ERNIE 4.5
Baidu
0.0
$0.4 in / $4 out