Every major AI model ranked across benchmark quality, inference speed, agentic capability, programming aptitude, and cost efficiency — updated continuously from published evaluation data.
296
Tracked models
27
Providers
253
Benchmarked
34.7
Avg. index
296 models
| Rank | Model | Provider | Score | Benchmarks | Inference | Agentic | Programming | Value | Price |
|---|---|---|---|---|---|---|---|---|---|
| 261 | DeepSeek VL2 deepseek-vl2 multimodalvisionmulti-input reasoning | DeepSeek | 6.9 overall | 6.9 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 262 | Mistral Small 3 24B Base mistral-small-24b-base-2501 multimodalvisionmulti-input reasoning | Mistral AI | 6.4 overall | 6.4 | 0.0 | 0.0 | 0.0 | 0.0 | |
| 263 | DeepSeek R1 Distill Qwen 1.5B deepseek-r1-distill-qwen-1.5b textinference | DeepSeek | 6.1 overall | 6.1 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 264 | Qwen2.5 VL 7B Instruct qwen2.5-vl-7b multimodalvisionmulti-input reasoning | Alibaba Cloud / Qwen Team | 5.2 overall | 9.6 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 265 | Gemini Diffusion gemini-diffusion codeprogrammingtool use | Google | 4.7 overall | 7.0 | 0.0 | 0.0 | 1.7 | 0.0 | N/A |
| 266 | DeepSeek VL2 Small deepseek-vl2-small multimodalvisionmulti-input reasoning | DeepSeek | 4.6 overall | 4.6 | 0.0 | 0.0 | 0.0 | 0.0 | |
| 267 | GPT-5.1 Codex Mini gpt-5.1-codex-mini multimodalvisionmulti-input reasoning | OpenAI | 4.0 overall | 4.0 | 0.0 | 0.0 | 0.0 | 0.0 | |
| 268 | Qwen2 7B Instruct qwen2-7b-instruct textinference | Alibaba Cloud / Qwen Team | 2.4 overall | 2.4 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 269 | Phi-3.5-vision-instruct phi-3.5-vision-instruct multimodalvisionmulti-input reasoning | Microsoft | 2.3 overall | 2.3 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 270 | Phi 4 Mini phi-4-mini textinference | Microsoft | 2.0 overall | 2.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 271 | Gemma 3n E4B Instructed LiteRT Preview gemma-3n-e4b-it-litert-preview multimodalvisionmulti-input reasoning | Google | 1.3 overall | 1.3 | 0.0 | 0.0 | 0.0 | 0.0 | |
| 272 | DeepSeek VL2 Tiny deepseek-vl2-tiny multimodalvisionmulti-input reasoning | DeepSeek | 1.2 overall | 1.2 | 0.0 | 0.0 | 0.0 | 0.0 | |
| 273 | Gemma 3n E2B Instructed gemma-3n-e2b-it multimodalvisionmulti-input reasoning | Google | 1.0 overall | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | |
| 274 | Gemma 3n E2B Instructed LiteRT (Preview) gemma-3n-e2b-it-litert-preview multimodalvisionmulti-input reasoning | Google | 1.0 overall | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | |
| 275 | Gemma 3 1B gemma-3-1b-it textinference | Google | 0.9 overall | 0.9 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 276 | Codestral-22B codestral-22b textinference | Mistral AI | 0.0 overall | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 277 | Gemma 2 27B gemma-2-27b-it textinference | Google | 0.0 overall | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 278 | Gemma 2 9B gemma-2-9b-it textinference | Google | 0.0 overall | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 279 | Gemma 3n E2B gemma-3n-e2b multimodalvisionmulti-input reasoning | Google | 0.0 overall | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 280 | Gemma 3n E4B gemma-3n-e4b multimodalvisionmulti-input reasoning | Google | 0.0 overall | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
DeepSeek VL2
DeepSeek
6.9
N/A
Mistral Small 3 24B Base
Mistral AI
6.4
N/A
DeepSeek R1 Distill Qwen 1.5B
DeepSeek
6.1
N/A
Want benchmark charts, model comparison, and pricing analytics?
Sign in to access the full interactive leaderboard with deep benchmark breakdowns and model comparison tools.
Open full leaderboardRankings are based on multi-dimensional evaluation across benchmark quality, inference efficiency, and cost-per-output. Scores are updated continuously and may differ from individual third-party benchmarks.
| N/A |
| N/A |
| N/A |
| N/A |
| N/A |
| N/A |
| N/A |
Qwen2.5 VL 7B Instruct
Alibaba Cloud / Qwen Team
5.2
N/A
Gemini Diffusion
4.7
N/A
DeepSeek VL2 Small
DeepSeek
4.6
N/A
GPT-5.1 Codex Mini
OpenAI
4.0
N/A
Qwen2 7B Instruct
Alibaba Cloud / Qwen Team
2.4
N/A
Phi-3.5-vision-instruct
Microsoft
2.3
N/A
Phi 4 Mini
Microsoft
2.0
N/A
Gemma 3n E4B Instructed LiteRT Preview
1.3
N/A
DeepSeek VL2 Tiny
DeepSeek
1.2
N/A
Gemma 3n E2B Instructed
1.0
N/A
Gemma 3n E2B Instructed LiteRT (Preview)
1.0
N/A
Gemma 3 1B
0.9
N/A
Codestral-22B
Mistral AI
0.0
N/A
Gemma 2 27B
0.0
N/A
Gemma 2 9B
0.0
N/A
Gemma 3n E2B
0.0
N/A
Gemma 3n E4B
0.0
N/A