Skytells
HomeModelsCLIChangelog
  • Home
  • Models
  • CLI
  • Changelog
Skytells

Addressing the world's greatest challenges with AI. Enterprise research, foundation models, and infrastructure trusted by organizations worldwide since 2012.

Get Started

  • Console
  • Learn
  • Documentation
  • API Reference
  • Pricing
  • ModelsNew

Platform

  • Cloud AgentsNew
  • AI Solutions
  • Infrastructure
  • Edge Network
  • Trust Center
  • CLI

Resources

  • Blog
  • Changelog
  • AI Leaderboard
  • Research
  • Status

Company

  • About
  • Careers
  • Legal
  • Privacy Policy

© 2012–2026 Skytells, Inc. All rights reserved.

Live rankings

AI Model Leaderboard

Every major AI model ranked across benchmark quality, inference speed, agentic capability, programming aptitude, and cost efficiency — updated continuously from published evaluation data.

Explore full leaderboardBrowse model catalog

296

Tracked models

27

Providers

253

Benchmarked

13.4

Avg. index

OverallBenchmarksInferenceAgenticProgrammingValue / Price

296 models

RankModelProviderScoreBenchmarksInferenceAgenticProgrammingValuePrice
201

Llama 3.1 Nemotron 70B Instruct

llama-3.1-nemotron-70b-instruct

textinference
NNVIDIA

0.0

Programming

0.00.00.00.00.0N/A
202

Llama 3.1 Nemotron Nano 8B V1

llama-3.1-nemotron-nano-8b-v1

textinference
NNVIDIA

0.0

Programming

16.30.00.00.00.0N/A
203

Llama 3.1 Nemotron Ultra 253B v1

llama-3.1-nemotron-ultra-253b-v1

textinference
NNVIDIA

0.0

Programming

35.40.00.00.00.0N/A
204

Llama 3.2 11B Instruct

llama-3.2-11b-instruct

multimodalvisionmulti-input reasoning
MMeta

0.0

Programming

4.060.30.00.094.9$0.05 in / $0.05 out
205

Llama 3.2 3B Instruct

llama-3.2-3b-instruct

textinference
MMeta

0.0

Programming

5.268.90.00.098.8$0.01 in / $0.02 out
206

Llama 3.2 90B Instruct

llama-3.2-90b-instruct

multimodalvisionmulti-input reasoning
MMeta

0.0

Programming

16.311.30.00.054.9$0.35 in / $0.4 out
207

Llama 3.3 70B Instruct

llama-3.3-70b-instruct

textinference
MMeta

0.0

Programming

19.621.40.00.072.2$0.2 in / $0.2 out
208

Llama-3.3 Nemotron Super 49B v1

llama-3.3-nemotron-super-49b-v1

textinference
NNVIDIA

0.0

Programming

23.00.00.00.00.0N/A
209

Llama 4 Maverick

llama-4-maverick

multimodalvisionmulti-input reasoning
MMeta

0.0

Programming

35.455.80.00.057.1$0.17 in / $0.85 out
210

Llama 4 Scout

llama-4-scout

multimodalvisionmulti-input reasoning
MMeta

0.0

Programming

29.062.10.00.078.1$0.08 in / $0.3 out
211

Magistral Medium

magistral-medium

multimodalvisionmulti-input reasoning
Mistral AI

0.0

Programming

22.20.00.00.00.0
212

Magistral Small 2506

magistral-small-2506

textinference
Mistral AI

0.0

Programming

24.50.00.00.00.0N/A
213

MedGemma 4B IT

medgemma-4b-it

multimodalvisionmulti-input reasoning
Google

0.0

Programming

0.00.00.00.00.0
214

MiniCPM-SALA

minicpm-sala

textinference
OOpenBMB

0.0

Programming

27.50.00.00.00.0N/A
215

Ministral 3 (14B Reasoning 2512)

ministral-14b-latest

multimodalvisionmulti-input reasoning
Mistral AI

0.0

Programming

37.777.00.00.084.8
216

Ministral 3 (14B Base 2512)

ministral-3-14b-base-2512

multimodalvisionmulti-input reasoning
Mistral AI

0.0

Programming

0.00.00.00.00.0
217

MiniStral 3 (14B Instruct 2512)

ministral-3-14b-instruct-2512

multimodalvisionmulti-input reasoning
Mistral AI

0.0

Programming

0.00.00.00.00.0
218

Ministral 3 (3B Base 2512)

ministral-3-3b-base-2512

multimodalvisionmulti-input reasoning
Mistral AI

0.0

Programming

0.00.00.00.00.0
219

Ministral 3 (3B Instruct 2512)

ministral-3-3b-instruct-2512

multimodalvisionmulti-input reasoning
Mistral AI

0.0

Programming

0.00.00.00.00.0
220

Ministral 3 (8B Base 2512)

ministral-3-8b-base-2512

multimodalvisionmulti-input reasoning
Mistral AI

0.0

Programming

0.00.00.00.00.0
201
N

Llama 3.1 Nemotron 70B Instruct

NVIDIA

0.0

N/A

202
N

Llama 3.1 Nemotron Nano 8B V1

NVIDIA

0.0

N/A

203
N

Llama 3.1 Nemotron Ultra 253B v1

NVIDIA

0.0

N/A

204

Page 11 of 15 · 296 models

PreviousNext

Want benchmark charts, model comparison, and pricing analytics?

Sign in to access the full interactive leaderboard with deep benchmark breakdowns and model comparison tools.

Open full leaderboard

Rankings are based on multi-dimensional evaluation across benchmark quality, inference efficiency, and cost-per-output. Scores are updated continuously and may differ from individual third-party benchmarks.

N/A
N/A
$0.2 in / $0.2 out
N/A
N/A
N/A
N/A
N/A
M

Llama 3.2 11B Instruct

Meta

0.0

$0.05 in / $0.05 out

205
M

Llama 3.2 3B Instruct

Meta

0.0

$0.01 in / $0.02 out

206
M

Llama 3.2 90B Instruct

Meta

0.0

$0.35 in / $0.4 out

207
M

Llama 3.3 70B Instruct

Meta

0.0

$0.2 in / $0.2 out

208
N

Llama-3.3 Nemotron Super 49B v1

NVIDIA

0.0

N/A

209
M

Llama 4 Maverick

Meta

0.0

$0.17 in / $0.85 out

210
M

Llama 4 Scout

Meta

0.0

$0.08 in / $0.3 out

211

Magistral Medium

Mistral AI

0.0

N/A

212

Magistral Small 2506

Mistral AI

0.0

N/A

213

MedGemma 4B IT

Google

0.0

N/A

214
O

MiniCPM-SALA

OpenBMB

0.0

N/A

215

Ministral 3 (14B Reasoning 2512)

Mistral AI

0.0

$0.2 in / $0.2 out

216

Ministral 3 (14B Base 2512)

Mistral AI

0.0

N/A

217

MiniStral 3 (14B Instruct 2512)

Mistral AI

0.0

N/A

218

Ministral 3 (3B Base 2512)

Mistral AI

0.0

N/A

219

Ministral 3 (3B Instruct 2512)

Mistral AI

0.0

N/A

220

Ministral 3 (8B Base 2512)

Mistral AI

0.0

N/A