Every major AI model ranked across benchmark quality, inference speed, agentic capability, programming aptitude, and cost efficiency — updated continuously from published evaluation data.
296
Tracked models
27
Providers
253
Benchmarked
34.7
Avg. index
296 models
| Rank | Model | Provider | Score | Benchmarks | Inference | Agentic | Programming | Value | Price |
|---|---|---|---|---|---|---|---|---|---|
| 281 | GLM-4.5V glm-4.5v multimodalvisionmulti-input reasoning | Zhipu AI | 0.0 overall | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 282 | Granite 3.3 8B Base granite-3.3-8b-base multimodalvisionmulti-input reasoning | IBM | 0.0 overall | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 283 | IBM Granite 4.0 Tiny Preview granite-4.0-tiny-preview textinference | IBM | 0.0 overall | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 284 | Grok-2 Image 1212 grok-2-image-1212 textinference | xAI | 0.0 overall | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 285 | Grok-4.20 Multi-Agent Beta grok-4.20-multi-agent-beta-0309 multimodalvisionmulti-input reasoning | xAI | 0.0 overall | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | |
| 286 | Llama 3.1 Nemotron 70B Instruct llama-3.1-nemotron-70b-instruct textinference | NVIDIA | 0.0 overall | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 287 | MedGemma 4B IT medgemma-4b-it multimodalvisionmulti-input reasoning | Google | 0.0 overall | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 288 | Ministral 3 (14B Base 2512) ministral-3-14b-base-2512 multimodalvisionmulti-input reasoning | Mistral AI | 0.0 overall | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | |
| 289 | MiniStral 3 (14B Instruct 2512) ministral-3-14b-instruct-2512 multimodalvisionmulti-input reasoning | Mistral AI | 0.0 overall | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | |
| 290 | Ministral 3 (3B Base 2512) ministral-3-3b-base-2512 multimodalvisionmulti-input reasoning | Mistral AI | 0.0 overall | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | |
| 291 | Ministral 3 (3B Instruct 2512) ministral-3-3b-instruct-2512 multimodalvisionmulti-input reasoning | Mistral AI | 0.0 overall | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | |
| 292 | Ministral 3 (8B Base 2512) ministral-3-8b-base-2512 multimodalvisionmulti-input reasoning | Mistral AI | 0.0 overall | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | |
| 293 | Ministral 3 (8B Instruct 2512) ministral-3-8b-instruct-2512 multimodalvisionmulti-input reasoning | Mistral AI | 0.0 overall | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | |
| 294 | Qwen2.5-Coder 7B Instruct qwen-2.5-coder-7b-instruct textinference | Alibaba Cloud / Qwen Team | 0.0 overall | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 295 | Qwen3.5-0.8B qwen3.5-0.8b multimodalvisionmulti-input reasoning | Alibaba Cloud / Qwen Team | 0.0 overall | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 296 | Qwen3-Next-80B-A3B-Base qwen3-next-80b-a3b-base textinference | Alibaba Cloud / Qwen Team | 0.0 overall | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
GLM-4.5V
Zhipu AI
0.0
N/A
Granite 3.3 8B Base
IBM
0.0
N/A
IBM Granite 4.0 Tiny Preview
IBM
0.0
N/A
Page 15 of 15 · 296 models
Want benchmark charts, model comparison, and pricing analytics?
Sign in to access the full interactive leaderboard with deep benchmark breakdowns and model comparison tools.
Open full leaderboardRankings are based on multi-dimensional evaluation across benchmark quality, inference efficiency, and cost-per-output. Scores are updated continuously and may differ from individual third-party benchmarks.
| N/A |
| N/A |
| N/A |
| N/A |
| N/A |
| N/A |
| N/A |
Grok-2 Image 1212
xAI
0.0
N/A
Grok-4.20 Multi-Agent Beta
xAI
0.0
N/A
Llama 3.1 Nemotron 70B Instruct
NVIDIA
0.0
N/A
MedGemma 4B IT
0.0
N/A
Ministral 3 (14B Base 2512)
Mistral AI
0.0
N/A
MiniStral 3 (14B Instruct 2512)
Mistral AI
0.0
N/A
Ministral 3 (3B Base 2512)
Mistral AI
0.0
N/A
Ministral 3 (3B Instruct 2512)
Mistral AI
0.0
N/A
Ministral 3 (8B Base 2512)
Mistral AI
0.0
N/A
Ministral 3 (8B Instruct 2512)
Mistral AI
0.0
N/A
Qwen2.5-Coder 7B Instruct
Alibaba Cloud / Qwen Team
0.0
N/A
Qwen3.5-0.8B
Alibaba Cloud / Qwen Team
0.0
N/A
Qwen3-Next-80B-A3B-Base
Alibaba Cloud / Qwen Team
0.0
N/A