Every major AI model ranked across benchmark quality, inference speed, agentic capability, programming aptitude, and cost efficiency — updated continuously from published evaluation data.
296
Tracked models
27
Providers
253
Benchmarked
30.8
Avg. index
296 models
| Rank | Model | Provider | Score | Benchmarks | Inference | Agentic | Programming | Value | Price |
|---|---|---|---|---|---|---|---|---|---|
| 221 | GPT-5 nano gpt-5-nano-2025-08-07 multimodalvisionmulti-input reasoning | OpenAI | 0.0 Value / Price | 26.3 | 0.0 | 0.0 | 11.8 | 0.0 | N/A |
| 222 | GPT OSS 20B High gpt-oss-20b-high textinference | OpenAI | 0.0 Value / Price | 53.7 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 223 | Granite 3.3 8B Base granite-3.3-8b-base multimodalvisionmulti-input reasoning | IBM | 0.0 Value / Price | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 224 | IBM Granite 4.0 Tiny Preview granite-4.0-tiny-preview textinference | IBM | 0.0 Value / Price | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 225 | Grok-1.5 grok-1.5 multimodalvisionmulti-input reasoning | xAI | 0.0 Value / Price | 8.6 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 226 | Grok-1.5V grok-1.5v multimodalvisionmulti-input reasoning | xAI | 0.0 Value / Price | 9.8 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 227 | Grok-2 Image 1212 grok-2-image-1212 textinference | xAI | 0.0 Value / Price | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 228 | Grok-2 mini grok-2-mini multimodalvisionmulti-input reasoning | xAI | 0.0 Value / Price | 24.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 229 | Grok-4 grok-4 multimodalvisionmulti-input reasoning | xAI | 0.0 Value / Price | 51.5 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 230 | Grok-4.20 Multi-Agent Beta grok-4.20-multi-agent-beta-0309 multimodalvisionmulti-input reasoning | xAI | 0.0 Value / Price | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | |
| 231 | Grok-4 Heavy grok-4-heavy multimodalvisionmulti-input reasoning | xAI | 0.0 Value / Price | 72.4 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 232 | Hermes 3 70B hermes-3-70b textinference | Nous Research | 0.0 Value / Price | 30.1 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 233 | Kimi-k1.5 kimi-k1.5 multimodalvisionmulti-input reasoning | Moonshot AI | 0.0 Value / Price | 35.3 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 234 | Kimi K2 Base kimi-k2-base textinference | Moonshot AI | 0.0 Value / Price | 26.9 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 235 | Kimi K2-Instruct-0905 kimi-k2-instruct-0905 codeprogrammingtool use | Moonshot AI | 0.0 Value / Price | 24.4 | 0.0 | 6.6 | 19.3 | 0.0 | |
| 236 | Kimi K2-Thinking-0905 kimi-k2-thinking-0905 codeprogrammingtool use | Moonshot AI | 0.0 Value / Price | 69.2 | 0.0 | 52.7 | 62.0 | 0.0 | |
| 237 | Llama 3.1 Nemotron 70B Instruct llama-3.1-nemotron-70b-instruct textinference | NVIDIA | 0.0 Value / Price | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 238 | Llama 3.1 Nemotron Nano 8B V1 llama-3.1-nemotron-nano-8b-v1 textinference | NVIDIA | 0.0 Value / Price | 16.3 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 239 | Llama 3.1 Nemotron Ultra 253B v1 llama-3.1-nemotron-ultra-253b-v1 textinference | NVIDIA | 0.0 Value / Price | 35.4 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 240 | Llama-3.3 Nemotron Super 49B v1 llama-3.3-nemotron-super-49b-v1 textinference | NVIDIA | 0.0 Value / Price | 23.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
GPT-5 nano
OpenAI
0.0
N/A
GPT OSS 20B High
OpenAI
0.0
N/A
Granite 3.3 8B Base
IBM
0.0
N/A
Want benchmark charts, model comparison, and pricing analytics?
Sign in to access the full interactive leaderboard with deep benchmark breakdowns and model comparison tools.
Open full leaderboardRankings are based on multi-dimensional evaluation across benchmark quality, inference efficiency, and cost-per-output. Scores are updated continuously and may differ from individual third-party benchmarks.
| N/A |
| N/A |
| N/A |
IBM Granite 4.0 Tiny Preview
IBM
0.0
N/A
Grok-1.5
xAI
0.0
N/A
Grok-1.5V
xAI
0.0
N/A
Grok-2 Image 1212
xAI
0.0
N/A
Grok-2 mini
xAI
0.0
N/A
Grok-4
xAI
0.0
N/A
Grok-4.20 Multi-Agent Beta
xAI
0.0
N/A
Grok-4 Heavy
xAI
0.0
N/A
Hermes 3 70B
Nous Research
0.0
N/A
Kimi-k1.5
Moonshot AI
0.0
N/A
Kimi K2 Base
Moonshot AI
0.0
N/A
Kimi K2-Instruct-0905
Moonshot AI
0.0
N/A
Kimi K2-Thinking-0905
Moonshot AI
0.0
N/A
Llama 3.1 Nemotron 70B Instruct
NVIDIA
0.0
N/A
Llama 3.1 Nemotron Nano 8B V1
NVIDIA
0.0
N/A
Llama 3.1 Nemotron Ultra 253B v1
NVIDIA
0.0
N/A
Llama-3.3 Nemotron Super 49B v1
NVIDIA
0.0
N/A