Every major AI model ranked across benchmark quality, inference speed, agentic capability, programming aptitude, and cost efficiency — updated continuously from published evaluation data.
296
Tracked models
27
Providers
253
Benchmarked
11.5
Avg. index
296 models
| Rank | Model | Provider | Score | Benchmarks | Inference | Agentic | Programming | Value | Price |
|---|---|---|---|---|---|---|---|---|---|
| 201 | K-EXAONE-236B-A23B k-exaone-236b-a23b multimodalvisionmulti-input reasoning | LG AI Research | 0.0 Agentic | 43.4 | 24.9 | 0.0 | 0.0 | 49.2 | $0.6 in / $1 out |
| 202 | Kimi-k1.5 kimi-k1.5 multimodalvisionmulti-input reasoning | Moonshot AI | 0.0 Agentic | 35.3 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 203 | Kimi K2 0905 kimi-k2-0905 textinference | Moonshot AI | 0.0 Agentic | 44.0 | 66.0 | 0.0 | 0.0 | 40.1 | $0.6 in / $2.5 out |
| 204 | Kimi K2 Base kimi-k2-base textinference | Moonshot AI | 0.0 Agentic | 26.9 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 205 | Llama 3.1 405B Instruct llama-3.1-405b-instruct textinference | Meta | 0.0 Agentic | 20.0 | 21.4 | 0.0 | 0.0 | 44.5 | $0.89 in / $0.89 out |
| 206 | Llama 3.1 70B Instruct llama-3.1-70b-instruct textinference | Meta | 0.0 Agentic | 11.2 | 21.4 | 0.0 | 0.0 | 72.2 | $0.2 in / $0.2 out |
| 207 | Llama 3.1 8B Instruct llama-3.1-8b-instruct textinference | Meta | 0.0 Agentic | 3.2 | 26.7 | 0.0 | 0.0 | 83.9 | $0.03 in / $0.03 out |
| 208 | Llama 3.1 Nemotron 70B Instruct llama-3.1-nemotron-70b-instruct textinference | NVIDIA | 0.0 Agentic | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 209 | Llama 3.1 Nemotron Nano 8B V1 llama-3.1-nemotron-nano-8b-v1 textinference | NVIDIA | 0.0 Agentic | 16.3 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 210 | Llama 3.1 Nemotron Ultra 253B v1 llama-3.1-nemotron-ultra-253b-v1 textinference | NVIDIA | 0.0 Agentic | 35.4 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 211 | Llama 3.2 11B Instruct llama-3.2-11b-instruct multimodalvisionmulti-input reasoning | Meta | 0.0 Agentic | 4.0 | 60.3 | 0.0 | 0.0 | 94.9 | $0.05 in / $0.05 out |
| 212 | Llama 3.2 3B Instruct llama-3.2-3b-instruct textinference | Meta | 0.0 Agentic | 5.2 | 68.9 | 0.0 | 0.0 | 98.8 | $0.01 in / $0.02 out |
| 213 | Llama 3.2 90B Instruct llama-3.2-90b-instruct multimodalvisionmulti-input reasoning | Meta | 0.0 Agentic | 16.3 | 11.3 | 0.0 | 0.0 | 54.9 | $0.35 in / $0.4 out |
| 214 | Llama 3.3 70B Instruct llama-3.3-70b-instruct textinference | Meta | 0.0 Agentic | 19.6 | 21.4 | 0.0 | 0.0 | 72.2 | $0.2 in / $0.2 out |
| 215 | Llama-3.3 Nemotron Super 49B v1 llama-3.3-nemotron-super-49b-v1 textinference | NVIDIA | 0.0 Agentic | 23.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 216 | Llama 4 Maverick llama-4-maverick multimodalvisionmulti-input reasoning | Meta | 0.0 Agentic | 35.4 | 55.8 | 0.0 | 0.0 | 57.1 | $0.17 in / $0.85 out |
| 217 | Llama 4 Scout llama-4-scout multimodalvisionmulti-input reasoning | Meta | 0.0 Agentic | 29.0 | 62.1 | 0.0 | 0.0 | 78.1 | $0.08 in / $0.3 out |
| 218 | LongCat-Flash-Thinking longcat-flash-thinking codeprogrammingtool use | Meituan | 0.0 Agentic | 50.2 | 0.0 | 0.0 | 21.6 | 0.0 | |
| 219 | Magistral Medium magistral-medium multimodalvisionmulti-input reasoning | Mistral AI | 0.0 Agentic | 22.2 | 0.0 | 0.0 | 0.0 | 0.0 | |
| 220 | Magistral Small 2506 magistral-small-2506 textinference | Mistral AI | 0.0 Agentic | 24.5 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
K-EXAONE-236B-A23B
LG AI Research
0.0
$0.6 in / $1 out
Kimi-k1.5
Moonshot AI
0.0
N/A
Kimi K2 0905
Moonshot AI
0.0
$0.6 in / $2.5 out
Want benchmark charts, model comparison, and pricing analytics?
Sign in to access the full interactive leaderboard with deep benchmark breakdowns and model comparison tools.
Open full leaderboardRankings are based on multi-dimensional evaluation across benchmark quality, inference efficiency, and cost-per-output. Scores are updated continuously and may differ from individual third-party benchmarks.
| N/A |
| N/A |
Kimi K2 Base
Moonshot AI
0.0
N/A
Llama 3.1 405B Instruct
Meta
0.0
$0.89 in / $0.89 out
Llama 3.1 70B Instruct
Meta
0.0
$0.2 in / $0.2 out
Llama 3.1 8B Instruct
Meta
0.0
$0.03 in / $0.03 out
Llama 3.1 Nemotron 70B Instruct
NVIDIA
0.0
N/A
Llama 3.1 Nemotron Nano 8B V1
NVIDIA
0.0
N/A
Llama 3.1 Nemotron Ultra 253B v1
NVIDIA
0.0
N/A
Llama 3.2 11B Instruct
Meta
0.0
$0.05 in / $0.05 out
Llama 3.2 3B Instruct
Meta
0.0
$0.01 in / $0.02 out
Llama 3.2 90B Instruct
Meta
0.0
$0.35 in / $0.4 out
Llama 3.3 70B Instruct
Meta
0.0
$0.2 in / $0.2 out
Llama-3.3 Nemotron Super 49B v1
NVIDIA
0.0
N/A
Llama 4 Maverick
Meta
0.0
$0.17 in / $0.85 out
Llama 4 Scout
Meta
0.0
$0.08 in / $0.3 out
LongCat-Flash-Thinking
Meituan
0.0
N/A
Magistral Medium
Mistral AI
0.0
N/A
Magistral Small 2506
Mistral AI
0.0
N/A