Every major AI model ranked across benchmark quality, inference speed, agentic capability, programming aptitude, and cost efficiency — updated continuously from published evaluation data.
296
Tracked models
27
Providers
253
Benchmarked
13.4
Avg. index
296 models
| Rank | Model | Provider | Score | Benchmarks | Inference | Agentic | Programming | Value | Price |
|---|---|---|---|---|---|---|---|---|---|
| 221 | Ministral 3 (8B Instruct 2512) ministral-3-8b-instruct-2512 multimodalvisionmulti-input reasoning | Mistral AI | 0.0 Programming | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | N/A |
| 222 | Min istral 3 (3B Reasoning 2512) ministral-3b-latest multimodalvisionmulti-input reasoning | Mistral AI | 0.0 Programming | 22.0 | 79.6 | 0.0 | 0.0 | 95.8 | |
| 223 | Ministral 8B Instruct ministral-8b-instruct-2410 textinference | Mistral AI | 0.0 Programming | 0.0 | 7.0 | 0.0 | 0.0 | 76.1 | $0.1 in / $0.1 out |
| 224 | Ministral 3 (8B Reasoning 2512) ministral-8b-latest multimodalvisionmulti-input reasoning | Mistral AI | 0.0 Programming | 31.6 | 84.5 | 0.0 | 0.0 | 92.1 | |
| 225 | Mistral Large 2 mistral-large-2-2407 textinference | Mistral AI | 0.0 Programming | 0.0 | 21.4 | 0.0 | 0.0 | 26.7 | $2 in / $6 out |
| 226 | Mistral Large 3 mistral-large-3-2509 multimodalvisionmulti-input reasoning | Mistral AI | 0.0 Programming | 9.6 | 18.8 | 0.0 | 0.0 | 29.1 | |
| 227 | Mistral Large 3 (675B Base) mistral-large-3-675b-base-2512 multimodalvisionmulti-input reasoning | Mistral AI | 0.0 Programming | 22.2 | 0.0 | 0.0 | 0.0 | 0.0 | |
| 228 | Mistral Large 3 (675B Instruct 2512 Eagle) mistral-large-3-675B-instruct-2512-eagle multimodalvisionmulti-input reasoning | Mistral AI | 0.0 Programming | 22.2 | 0.0 | 0.0 | 0.0 | 0.0 | |
| 229 | Mistral Large 3 (675B Instruct 2512 NVFP4) mistral-large-3-675b-instruct-2512-nvfp4 multimodalvisionmulti-input reasoning | Mistral AI | 0.0 Programming | 22.2 | 0.0 | 0.0 | 0.0 | 0.0 | |
| 230 | Mistral Large 3 (675B Instruct 2512) mistral-large-latest multimodalvisionmulti-input reasoning | Mistral AI | 0.0 Programming | 22.2 | 40.1 | 0.0 | 0.0 | 44.5 | |
| 231 | Mistral NeMo Instruct mistral-nemo-instruct-2407 textinference | Mistral AI | 0.0 Programming | 0.0 | 21.4 | 0.0 | 0.0 | 77.3 | $0.15 in / $0.15 out |
| 232 | Mistral Small mistral-small-2409 textinference | Mistral AI | 0.0 Programming | 0.0 | 2.1 | 0.0 | 0.0 | 51.9 | $0.2 in / $0.6 out |
| 233 | Mistral Small 3 24B Base mistral-small-24b-base-2501 multimodalvisionmulti-input reasoning | Mistral AI | 0.0 Programming | 6.4 | 0.0 | 0.0 | 0.0 | 0.0 | |
| 234 | Mistral Small 3 24B Instruct mistral-small-24b-instruct-2501 textinference | Mistral AI | 0.0 Programming | 14.2 | 21.4 | 0.0 | 0.0 | 80.7 | $0.07 in / $0.14 out |
| 235 | Mistral Small 3.1 24B Base mistral-small-3.1-24b-base-2503 multimodalvisionmulti-input reasoning | Mistral AI | 0.0 Programming | 13.4 | 64.8 | 0.0 | 0.0 | 85.3 | |
| 236 | Mistral Small 3.1 24B Instruct mistral-small-3.1-24b-instruct-2503 multimodalvisionmulti-input reasoning | Mistral AI | 0.0 Programming | 15.7 | 0.0 | 0.0 | 0.0 | 0.0 | |
| 237 | Mistral Small 3.2 24B Instruct mistral-small-3.2-24b-instruct-2506 multimodalvisionmulti-input reasoning | Mistral AI | 0.0 Programming | 19.1 | 0.0 | 0.0 | 0.0 | 0.0 | |
| 238 | Mistral Small 4 mistral-small-latest multimodalvisionmulti-input reasoning | Mistral AI | 0.0 Programming | 34.7 | 55.2 | 0.0 | 0.0 | 66.8 | |
| 239 | Nova Lite nova-lite multimodalvisionmulti-input reasoning | Amazon | 0.0 Programming | 13.5 | 70.5 | 0.0 | 0.0 | 86.7 | $0.06 in / $0.24 out |
| 240 | Nova Micro nova-micro textinference | Amazon | 0.0 Programming | 9.1 | 52.7 | 0.0 | 0.0 | 91.3 | $0.03 in / $0.14 out |
Ministral 3 (8B Instruct 2512)
Mistral AI
0.0
N/A
Min istral 3 (3B Reasoning 2512)
Mistral AI
0.0
$0.1 in / $0.1 out
Ministral 8B Instruct
Mistral AI
0.0
$0.1 in / $0.1 out
Want benchmark charts, model comparison, and pricing analytics?
Sign in to access the full interactive leaderboard with deep benchmark breakdowns and model comparison tools.
Open full leaderboardRankings are based on multi-dimensional evaluation across benchmark quality, inference efficiency, and cost-per-output. Scores are updated continuously and may differ from individual third-party benchmarks.
| $0.1 in / $0.1 out |
| $0.15 in / $0.15 out |
| $2 in / $5 out |
| N/A |
| N/A |
| N/A |
| $0.5 in / $1.5 out |
| N/A |
| $0.1 in / $0.3 out |
| N/A |
| N/A |
| $0.15 in / $0.6 out |
Ministral 3 (8B Reasoning 2512)
Mistral AI
0.0
$0.15 in / $0.15 out
Mistral Large 2
Mistral AI
0.0
$2 in / $6 out
Mistral Large 3
Mistral AI
0.0
$2 in / $5 out
Mistral Large 3 (675B Base)
Mistral AI
0.0
N/A
Mistral Large 3 (675B Instruct 2512 Eagle)
Mistral AI
0.0
N/A
Mistral Large 3 (675B Instruct 2512 NVFP4)
Mistral AI
0.0
N/A
Mistral Large 3 (675B Instruct 2512)
Mistral AI
0.0
$0.5 in / $1.5 out
Mistral NeMo Instruct
Mistral AI
0.0
$0.15 in / $0.15 out
Mistral Small
Mistral AI
0.0
$0.2 in / $0.6 out
Mistral Small 3 24B Base
Mistral AI
0.0
N/A
Mistral Small 3 24B Instruct
Mistral AI
0.0
$0.07 in / $0.14 out
Mistral Small 3.1 24B Base
Mistral AI
0.0
$0.1 in / $0.3 out
Mistral Small 3.1 24B Instruct
Mistral AI
0.0
N/A
Mistral Small 3.2 24B Instruct
Mistral AI
0.0
N/A
Mistral Small 4
Mistral AI
0.0
$0.15 in / $0.6 out
Nova Lite
Amazon
0.0
$0.06 in / $0.24 out
Nova Micro
Amazon
0.0
$0.03 in / $0.14 out