Every major AI model ranked across benchmark quality, inference speed, agentic capability, programming aptitude, and cost efficiency — updated continuously from published evaluation data.
296
Tracked models
27
Providers
253
Benchmarked
27.4
Avg. index
296 models
| Rank | Model | Provider | Score | Benchmarks | Inference | Agentic | Programming | Value | Price |
|---|---|---|---|---|---|---|---|---|---|
| 241 | Codestral-22B codestral-22b textinference | Mistral AI | 0.0 Benchmarks | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 242 | Command R+ command-r-plus-04-2024 textinference | Cohere | 0.0 Benchmarks | 0.0 | 32.0 | 0.0 | 0.0 | 55.4 | $0.25 in / $1 out |
| 243 | DeepSeek-V3.2 (Non-thinking) deepseek-chat textinference | DeepSeek | 0.0 Benchmarks | 0.0 | 57.9 | 0.0 | 0.0 | 70.3 | $0.28 in / $0.42 out |
| 244 | DeepSeek-R1 deepseek-r1 textinference | DeepSeek | 0.0 Benchmarks | 0.0 | 14.2 | 0.0 | 0.0 | 35.4 | $0.55 in / $2.19 out |
| 245 | DeepSeek-V2.5 deepseek-v2.5 codeprogrammingtool use | DeepSeek | 0.0 Benchmarks | 0.0 | 46.5 | 0.0 | 0.9 | 79.7 | $0.14 in / $0.28 out |
| 246 | Devstral Medium devstral-medium-2507 codeprogrammingtool use | Mistral AI | 0.0 Benchmarks | 0.0 | 65.3 | 0.0 | 24.2 | 53.8 | $0.4 in / $2 out |
| 247 | Devstral Small 1.1 devstral-small-2507 codeprogrammingtool use | Mistral AI | 0.0 Benchmarks | 0.0 | 65.3 | 0.0 | 14.7 | 85.5 | |
| 248 | Gemma 2 27B gemma-2-27b-it textinference | Google | 0.0 Benchmarks | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 249 | Gemma 2 9B gemma-2-9b-it textinference | Google | 0.0 Benchmarks | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 250 | Gemma 3n E2B gemma-3n-e2b multimodalvisionmulti-input reasoning | Google | 0.0 Benchmarks | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 251 | Gemma 3n E4B gemma-3n-e4b multimodalvisionmulti-input reasoning | Google | 0.0 Benchmarks | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 252 | GLM-4.5V glm-4.5v multimodalvisionmulti-input reasoning | Zhipu AI | 0.0 Benchmarks | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 253 | GLM-5 glm-5 codeprogrammingtool use | Zhipu AI | 0.0 Benchmarks | 0.0 | 22.9 | 47.8 | 63.8 | 30.7 | $1 in / $3.2 out |
| 254 | GLM-5V-Turbo glm-5v-turbo multimodalvisionmulti-input reasoning | Zhipu AI | 0.0 Benchmarks | 0.0 | 0.0 | 54.9 | 0.0 | 0.0 | N/A |
| 255 | GPT-5.1 Codex gpt-5.1-codex multimodalvisionmulti-input reasoning | OpenAI | 0.0 Benchmarks | 0.0 | 48.5 | 0.0 | 50.0 | 24.8 | |
| 256 | GPT-5.2 Codex gpt-5.2-codex multimodalvisionmulti-input reasoning | OpenAI | 0.0 Benchmarks | 0.0 | 48.5 | 0.0 | 44.1 | 19.4 | |
| 257 | GPT-5.3 Chat gpt-5.3-chat-latest multimodalvisionmulti-input reasoning | OpenAI | 0.0 Benchmarks | 0.0 | 52.7 | 0.0 | 0.0 | 26.4 | |
| 258 | GPT-5.3 Codex gpt-5.3-codex texttext-to-textcoding | OpenAI | 0.0 Benchmarks | 0.0 | 48.5 | 0.0 | 52.2 | 19.4 | |
| 259 | GPT-5 Codex gpt-5-codex-2025-09-15 codeprogrammingtool use | OpenAI | 0.0 Benchmarks | 0.0 | 0.0 | 0.0 | 53.1 | 0.0 | N/A |
| 260 | Granite 3.3 8B Base granite-3.3-8b-base multimodalvisionmulti-input reasoning | IBM | 0.0 Benchmarks | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
Codestral-22B
Mistral AI
0.0
N/A
Command R+
Cohere
0.0
$0.25 in / $1 out
DeepSeek-V3.2 (Non-thinking)
DeepSeek
0.0
$0.28 in / $0.42 out
Want benchmark charts, model comparison, and pricing analytics?
Sign in to access the full interactive leaderboard with deep benchmark breakdowns and model comparison tools.
Open full leaderboardRankings are based on multi-dimensional evaluation across benchmark quality, inference efficiency, and cost-per-output. Scores are updated continuously and may differ from individual third-party benchmarks.
| $0.1 in / $0.3 out |
| $1.25 in / $10 out |
| $1.75 in / $14 out |
| $1.75 in / $14 out |
| $1.75 in / $14 out |
DeepSeek-R1
DeepSeek
0.0
$0.55 in / $2.19 out
DeepSeek-V2.5
DeepSeek
0.0
$0.14 in / $0.28 out
Devstral Medium
Mistral AI
0.0
$0.4 in / $2 out
Devstral Small 1.1
Mistral AI
0.0
$0.1 in / $0.3 out
Gemma 2 27B
0.0
N/A
Gemma 2 9B
0.0
N/A
Gemma 3n E2B
0.0
N/A
Gemma 3n E4B
0.0
N/A
GLM-4.5V
Zhipu AI
0.0
N/A
GLM-5
Zhipu AI
0.0
$1 in / $3.2 out
GLM-5V-Turbo
Zhipu AI
0.0
N/A
GPT-5.1 Codex
OpenAI
0.0
$1.25 in / $10 out
GPT-5.2 Codex
OpenAI
0.0
$1.75 in / $14 out
GPT-5.3 Chat
OpenAI
0.0
$1.75 in / $14 out
GPT-5 Codex
OpenAI
0.0
N/A
Granite 3.3 8B Base
IBM
0.0
N/A