Every major AI model ranked across benchmark quality, inference speed, agentic capability, programming aptitude, and cost efficiency — updated continuously from published evaluation data.
296
Tracked models
27
Providers
253
Benchmarked
30.8
Avg. index
296 models
| Rank | Model | Provider | Score | Benchmarks | Inference | Agentic | Programming | Value | Price |
|---|---|---|---|---|---|---|---|---|---|
| 121 | DeepSeek-R1-0528 deepseek-r1-0528 codeprogrammingtool use | DeepSeek | 35.1 Value / Price | 50.1 | 14.3 | 0.0 | 6.6 | 35.1 | $0.55 in / $2.19 out |
| 122 | ERNIE 4.5 ernie-4.5 textinference | Baidu | 34.6 Value / Price | 24.5 | 18.8 | 0.0 | 0.0 | 34.6 | $0.4 in / $4 out |
| 123 | GPT-4.1 gpt-4.1-2025-04-14 multimodalvisionmulti-input reasoning | OpenAI | 34.6 Value / Price | 28.7 | 75.9 | 32.8 | 17.3 | 34.6 | |
| 124 | Kimi K2.6 kimi-k2.6 multimodalvisionmulti-input reasoning | Moonshot AI | 33.5 Value / Price | 68.1 | 66.0 | 44.2 | 81.2 | 33.5 | |
| 125 | DeepSeek-V4-Pro-Max deepseek-v4-pro-max codeprogrammingtool use | DeepSeek | 33.0 Value / Price | 67.7 | 92.3 | 68.6 | 58.3 | 33.0 | |
| 126 | GPT-5.4 Mini gpt-5.4-mini texttext-to-textlanguage | OpenAI | 32.4 Value / Price | 56.8 | 76.5 | 23.8 | 28.1 | 32.4 | |
| 127 | ChatGPT-4o Latest chatgpt-4o-latest multimodalvisionmulti-input reasoning | OpenAI | 32.0 Value / Price | 56.0 | 63.8 | 0.0 | 0.0 | 32.0 | |
| 128 | GPT-5.1 gpt-5.1-2025-11-13 multimodalvisionmulti-input reasoning | OpenAI | 32.0 Value / Price | 64.9 | 72.0 | 0.0 | 56.2 | 32.0 | |
| 129 | GPT-5.1 Instant gpt-5.1-instant-2025-11-12 multimodalvisionmulti-input reasoning | OpenAI | 32.0 Value / Price | 64.9 | 72.0 | 0.0 | 56.2 | 32.0 | |
| 130 | Claude 3.5 Haiku claude-3-5-haiku-20241022 codeprogrammingtool use | Anthropic | 31.8 Value / Price | 10.8 | 30.5 | 3.0 | 7.8 | 31.8 | |
| 131 | Qwen3 Max qwen3-max codeprogrammingtool use | Alibaba Cloud / Qwen Team | 31.3 Value / Price | 29.8 | 55.2 | 0.0 | 35.8 | 31.3 | $0.5 in / $5 out |
| 132 | GLM-5 glm-5 codeprogrammingtool use | Zhipu AI | 30.6 Value / Price | 0.0 | 23.0 | 47.8 | 63.8 | 30.6 | $1 in / $3.2 out |
| 133 | GLM-5.1 glm-5.1 codeprogrammingtool use | Zhipu AI | 30.2 Value / Price | 66.8 | 46.1 | 51.5 | 60.2 | 30.2 | $1.4 in / $4.4 out |
| 134 | o1-mini o1-mini textinference | OpenAI | 30.1 Value / Price | 25.7 | 61.3 | 0.0 | 0.0 | 30.1 | $3 in / $12 out |
| 135 | Mistral Large 3 mistral-large-3-2509 multimodalvisionmulti-input reasoning | Mistral AI | 29.1 Value / Price | 9.6 | 18.8 | 0.0 | 0.0 | 29.1 | |
| 136 | GPT-5.1 Medium gpt-5.1-medium-2025-11-12 multimodalvisionmulti-input reasoning | OpenAI | 28.9 Value / Price | 63.6 | 61.9 | 0.0 | 0.0 | 28.9 | |
| 137 | GPT-5 Medium gpt-5-medium-2025-08-07 multimodalvisionmulti-input reasoning | OpenAI | 28.9 Value / Price | 56.7 | 61.9 | 0.0 | 0.0 | 28.9 | |
| 138 | Grok-4.20 Beta Non-Reasoning grok-4.20-beta-0309-non-reasoning multimodalvisionmulti-input reasoning | xAI | 27.7 Value / Price | 0.0 | 97.2 | 0.0 | 0.0 | 27.7 | |
| 139 | Grok-4.20 Beta Reasoning grok-4.20-beta-0309-reasoning multimodalvisionmulti-input reasoning | xAI | 27.7 Value / Price | 0.0 | 97.2 | 0.0 | 0.0 | 27.7 | |
| 140 | o3 o3-2025-04-16 multimodalvisionmulti-input reasoning | OpenAI | 27.7 Value / Price | 46.0 | 38.9 | 19.6 | 30.2 | 27.7 | $2 in / $8 out |
DeepSeek-R1-0528
DeepSeek
35.1
$0.55 in / $2.19 out
ERNIE 4.5
Baidu
34.6
$0.4 in / $4 out
GPT-4.1
OpenAI
34.6
$2 in / $8 out
Want benchmark charts, model comparison, and pricing analytics?
Sign in to access the full interactive leaderboard with deep benchmark breakdowns and model comparison tools.
Open full leaderboardRankings are based on multi-dimensional evaluation across benchmark quality, inference efficiency, and cost-per-output. Scores are updated continuously and may differ from individual third-party benchmarks.
| $2 in / $8 out |
| $0.95 in / $4 out |
| $1.74 in / $3.48 out |
| $0.75 in / $4.5 out |
| $2.5 in / $10 out |
| $1.25 in / $10 out |
| $1.25 in / $10 out |
| $0.8 in / $4 out |
| $2 in / $5 out |
| $1.25 in / $10 out |
| $1.25 in / $10 out |
| $2 in / $6 out |
| $2 in / $6 out |
Kimi K2.6
Moonshot AI
33.5
$0.95 in / $4 out
DeepSeek-V4-Pro-Max
DeepSeek
33.0
$1.74 in / $3.48 out
ChatGPT-4o Latest
OpenAI
32.0
$2.5 in / $10 out
GPT-5.1
OpenAI
32.0
$1.25 in / $10 out
GPT-5.1 Instant
OpenAI
32.0
$1.25 in / $10 out
Claude 3.5 Haiku
Anthropic
31.8
$0.8 in / $4 out
Qwen3 Max
Alibaba Cloud / Qwen Team
31.3
$0.5 in / $5 out
GLM-5
Zhipu AI
30.6
$1 in / $3.2 out
GLM-5.1
Zhipu AI
30.2
$1.4 in / $4.4 out
o1-mini
OpenAI
30.1
$3 in / $12 out
Mistral Large 3
Mistral AI
29.1
$2 in / $5 out
GPT-5.1 Medium
OpenAI
28.9
$1.25 in / $10 out
GPT-5 Medium
OpenAI
28.9
$1.25 in / $10 out
Grok-4.20 Beta Non-Reasoning
xAI
27.7
$2 in / $6 out
Grok-4.20 Beta Reasoning
xAI
27.7
$2 in / $6 out
o3
OpenAI
27.7
$2 in / $8 out