Skytells
HomeModelsCLIChangelog
  • Home
  • Models
  • CLI
  • Changelog
Skytells

Addressing the world's greatest challenges with AI. Enterprise research, foundation models, and infrastructure trusted by organizations worldwide since 2012.

Get Started

  • Console
  • Learn
  • Documentation
  • API Reference
  • Pricing
  • ModelsNew

Platform

  • Cloud AgentsNew
  • AI Solutions
  • Infrastructure
  • Edge Network
  • Trust Center
  • CLI

Resources

  • Blog
  • Changelog
  • AI Leaderboard
  • Research
  • Status

Company

  • About
  • Careers
  • Legal
  • Privacy Policy

© 2012–2026 Skytells, Inc. All rights reserved.

Live rankings

AI Model Leaderboard

Every major AI model ranked across benchmark quality, inference speed, agentic capability, programming aptitude, and cost efficiency — updated continuously from published evaluation data.

Explore full leaderboardBrowse model catalog

296

Tracked models

27

Providers

253

Benchmarked

32.1

Avg. index

OverallBenchmarksInferenceAgenticProgrammingValue / Price

296 models

RankModelProviderScoreBenchmarksInferenceAgenticProgrammingValuePrice
221

GPT-5 nano

gpt-5-nano-2025-08-07

multimodalvisionmulti-input reasoning
OpenAI

0.0

Inference

26.30.00.011.80.0N/A
222

GPT OSS 20B High

gpt-oss-20b-high

textinference
OpenAI

0.0

Inference

53.70.00.00.00.0N/A
223

Granite 3.3 8B Base

granite-3.3-8b-base

multimodalvisionmulti-input reasoning
IIBM

0.0

Inference

0.00.00.00.00.0N/A
224

IBM Granite 4.0 Tiny Preview

granite-4.0-tiny-preview

textinference
IIBM

0.0

Inference

0.00.00.00.00.0N/A
225

Grok-1.5

grok-1.5

multimodalvisionmulti-input reasoning
xAI

0.0

Inference

8.60.00.00.00.0N/A
226

Grok-1.5V

grok-1.5v

multimodalvisionmulti-input reasoning
xAI

0.0

Inference

9.80.00.00.00.0N/A
227

Grok-2 Image 1212

grok-2-image-1212

textinference
xAI

0.0

Inference

0.00.00.00.00.0N/A
228

Grok-2 mini

grok-2-mini

multimodalvisionmulti-input reasoning
xAI

0.0

Inference

24.00.00.00.00.0N/A
229

Grok-4

grok-4

multimodalvisionmulti-input reasoning
xAI

0.0

Inference

51.50.00.00.00.0N/A
230

Grok-4.20 Multi-Agent Beta

grok-4.20-multi-agent-beta-0309

multimodalvisionmulti-input reasoning
xAI

0.0

Inference

0.00.00.00.00.0
231

Grok-4 Heavy

grok-4-heavy

multimodalvisionmulti-input reasoning
xAI

0.0

Inference

72.40.00.00.00.0N/A
232

Hermes 3 70B

hermes-3-70b

textinference
NNous Research

0.0

Inference

30.10.00.00.00.0N/A
233

Kimi-k1.5

kimi-k1.5

multimodalvisionmulti-input reasoning
Moonshot AI

0.0

Inference

35.30.00.00.00.0N/A
234

Kimi K2 Base

kimi-k2-base

textinference
Moonshot AI

0.0

Inference

26.90.00.00.00.0N/A
235

Kimi K2-Instruct-0905

kimi-k2-instruct-0905

codeprogrammingtool use
Moonshot AI

0.0

Inference

24.40.06.619.30.0
236

Kimi K2-Thinking-0905

kimi-k2-thinking-0905

codeprogrammingtool use
Moonshot AI

0.0

Inference

69.20.052.762.00.0
237

Llama 3.1 Nemotron 70B Instruct

llama-3.1-nemotron-70b-instruct

textinference
NNVIDIA

0.0

Inference

0.00.00.00.00.0N/A
238

Llama 3.1 Nemotron Nano 8B V1

llama-3.1-nemotron-nano-8b-v1

textinference
NNVIDIA

0.0

Inference

16.30.00.00.00.0N/A
239

Llama 3.1 Nemotron Ultra 253B v1

llama-3.1-nemotron-ultra-253b-v1

textinference
NNVIDIA

0.0

Inference

35.40.00.00.00.0N/A
240

Llama-3.3 Nemotron Super 49B v1

llama-3.3-nemotron-super-49b-v1

textinference
NNVIDIA

0.0

Inference

23.00.00.00.00.0N/A
221

GPT-5 nano

OpenAI

0.0

N/A

222

GPT OSS 20B High

OpenAI

0.0

N/A

223
I

Granite 3.3 8B Base

IBM

0.0

N/A

224

Page 12 of 15 · 296 models

PreviousNext

Want benchmark charts, model comparison, and pricing analytics?

Sign in to access the full interactive leaderboard with deep benchmark breakdowns and model comparison tools.

Open full leaderboard

Rankings are based on multi-dimensional evaluation across benchmark quality, inference efficiency, and cost-per-output. Scores are updated continuously and may differ from individual third-party benchmarks.

N/A
N/A
N/A
I

IBM Granite 4.0 Tiny Preview

IBM

0.0

N/A

225

Grok-1.5

xAI

0.0

N/A

226

Grok-1.5V

xAI

0.0

N/A

227

Grok-2 Image 1212

xAI

0.0

N/A

228

Grok-2 mini

xAI

0.0

N/A

229

Grok-4

xAI

0.0

N/A

230

Grok-4.20 Multi-Agent Beta

xAI

0.0

N/A

231

Grok-4 Heavy

xAI

0.0

N/A

232
N

Hermes 3 70B

Nous Research

0.0

N/A

233

Kimi-k1.5

Moonshot AI

0.0

N/A

234

Kimi K2 Base

Moonshot AI

0.0

N/A

235

Kimi K2-Instruct-0905

Moonshot AI

0.0

N/A

236

Kimi K2-Thinking-0905

Moonshot AI

0.0

N/A

237
N

Llama 3.1 Nemotron 70B Instruct

NVIDIA

0.0

N/A

238
N

Llama 3.1 Nemotron Nano 8B V1

NVIDIA

0.0

N/A

239
N

Llama 3.1 Nemotron Ultra 253B v1

NVIDIA

0.0

N/A

240
N

Llama-3.3 Nemotron Super 49B v1

NVIDIA

0.0

N/A