Skytells
HomeModelsCLIChangelog
  • Home
  • Models
  • CLI
  • Changelog
Skytells

Addressing the world's greatest challenges with AI. Enterprise research, foundation models, and infrastructure trusted by organizations worldwide since 2012.

Get Started

  • Console
  • Learn
  • Documentation
  • API Reference
  • Pricing
  • ModelsNew

Platform

  • Cloud AgentsNew
  • AI Solutions
  • Infrastructure
  • Edge Network
  • Trust Center
  • CLI

Resources

  • Blog
  • Changelog
  • AI Leaderboard
  • Research
  • Status

Company

  • About
  • Careers
  • Legal
  • Privacy Policy

© 2012–2026 Skytells, Inc. All rights reserved.

Live rankings

AI Model Leaderboard

Every major AI model ranked across benchmark quality, inference speed, agentic capability, programming aptitude, and cost efficiency — updated continuously from published evaluation data.

Explore full leaderboardBrowse model catalog

296

Tracked models

27

Providers

253

Benchmarked

13.4

Avg. index

OverallBenchmarksInferenceAgenticProgrammingValue / Price

296 models

RankModelProviderScoreBenchmarksInferenceAgenticProgrammingValuePrice
241

Nova Pro

nova-pro

multimodalvisionmulti-input reasoning
AAmazon

0.0

Programming

20.070.50.00.043.4$0.8 in / $3.2 out
242

Nemotron Nano 9B v2

nvidia-nemotron-nano-9b-v2

textinference
NNVIDIA

0.0

Programming

24.90.00.00.00.0N/A
243

o1-mini

o1-mini

textinference
OpenAI

0.0

Programming

25.761.80.00.030.2$3 in / $12 out
244

o1-pro

o1-pro

multimodalvisionmulti-input reasoning
OpenAI

0.0

Programming

47.10.00.00.00.0N/A
245

o3-pro

o3-pro-2025-06-10

multimodalvisionmulti-input reasoning
OpenAI

0.0

Programming

0.021.40.00.03.6
246

Phi-3.5-mini-instruct

phi-3.5-mini-instruct

multimodalvisionmulti-input reasoning
MMicrosoft

0.0

Programming

2.710.80.00.077.2$0.1 in / $0.1 out
247

Phi-3.5-MoE-instruct

phi-3.5-moe-instruct

multimodalvisionmulti-input reasoning
MMicrosoft

0.0

Programming

8.20.00.00.00.0N/A
248

Phi-3.5-vision-instruct

phi-3.5-vision-instruct

multimodalvisionmulti-input reasoning
MMicrosoft

0.0

Programming

2.30.00.00.00.0N/A
249

Phi 4

phi-4

textinference
MMicrosoft

0.0

Programming

15.69.00.00.077.2$0.07 in / $0.14 out
250

Phi 4 Mini

phi-4-mini

textinference
MMicrosoft

0.0

Programming

2.00.00.00.00.0N/A
251

Phi 4 Mini Reasoning

phi-4-mini-reasoning

textinference
MMicrosoft

0.0

Programming

21.70.00.00.00.0N/A
252

Phi-4-multimodal-instruct

phi-4-multimodal-instruct

multimodalvisionmulti-input reasoning
MMicrosoft

0.0

Programming

8.812.30.00.079.9
253

Phi 4 Reasoning

phi-4-reasoning

textinference
MMicrosoft

0.0

Programming

23.10.00.00.00.0N/A
254

Phi 4 Reasoning Plus

phi-4-reasoning-plus

textinference
MMicrosoft

0.0

Programming

31.50.00.00.00.0N/A
255

Pixtral-12B

pixtral-12b-2409

multimodalvisionmulti-input reasoning
Mistral AI

0.0

Programming

8.17.00.00.073.0
256

Pixtral Large

pixtral-large

multimodalvisionmulti-input reasoning
Mistral AI

0.0

Programming

27.87.00.00.022.3
257

QvQ-72B-Preview

qvq-72b-preview

multimodalvisionmulti-input reasoning
AAlibaba Cloud / Qwen Team

0.0

Programming

38.20.00.00.00.0N/A
258

Qwen2.5 14B Instruct

qwen-2.5-14b-instruct

textinference
AAlibaba Cloud / Qwen Team

0.0

Programming

14.60.00.00.00.0N/A
259

Qwen2.5 32B Instruct

qwen-2.5-32b-instruct

textinference
AAlibaba Cloud / Qwen Team

0.0

Programming

18.60.00.00.00.0N/A
260

Qwen2.5 72B Instruct

qwen-2.5-72b-instruct

textinference
AAlibaba Cloud / Qwen Team

0.0

Programming

17.815.00.00.054.6$0.35 in / $0.4 out
241
A

Nova Pro

Amazon

0.0

$0.8 in / $3.2 out

242
N

Nemotron Nano 9B v2

NVIDIA

0.0

N/A

243

o1-mini

OpenAI

0.0

$3 in / $12 out

244

Page 13 of 15 · 296 models

PreviousNext

Want benchmark charts, model comparison, and pricing analytics?

Sign in to access the full interactive leaderboard with deep benchmark breakdowns and model comparison tools.

Open full leaderboard

Rankings are based on multi-dimensional evaluation across benchmark quality, inference efficiency, and cost-per-output. Scores are updated continuously and may differ from individual third-party benchmarks.

$20 in / $80 out
$0.05 in / $0.1 out
$0.15 in / $0.15 out
$2 in / $6 out

o1-pro

OpenAI

0.0

N/A

245

o3-pro

OpenAI

0.0

$20 in / $80 out

246
M

Phi-3.5-mini-instruct

Microsoft

0.0

$0.1 in / $0.1 out

247
M

Phi-3.5-MoE-instruct

Microsoft

0.0

N/A

248
M

Phi-3.5-vision-instruct

Microsoft

0.0

N/A

249
M

Phi 4

Microsoft

0.0

$0.07 in / $0.14 out

250
M

Phi 4 Mini

Microsoft

0.0

N/A

251
M

Phi 4 Mini Reasoning

Microsoft

0.0

N/A

252
M

Phi-4-multimodal-instruct

Microsoft

0.0

$0.05 in / $0.1 out

253
M

Phi 4 Reasoning

Microsoft

0.0

N/A

254
M

Phi 4 Reasoning Plus

Microsoft

0.0

N/A

255

Pixtral-12B

Mistral AI

0.0

$0.15 in / $0.15 out

256

Pixtral Large

Mistral AI

0.0

$2 in / $6 out

257
A

QvQ-72B-Preview

Alibaba Cloud / Qwen Team

0.0

N/A

258
A

Qwen2.5 14B Instruct

Alibaba Cloud / Qwen Team

0.0

N/A

259
A

Qwen2.5 32B Instruct

Alibaba Cloud / Qwen Team

0.0

N/A

260
A

Qwen2.5 72B Instruct

Alibaba Cloud / Qwen Team

0.0

$0.35 in / $0.4 out