Skytells
HomeModelsCLIChangelog
  • Home
  • Models
  • CLI
  • Changelog
Skytells

Addressing the world's greatest challenges with AI. Enterprise research, foundation models, and infrastructure trusted by organizations worldwide since 2012.

Get Started

  • Console
  • Learn
  • Documentation
  • API Reference
  • Pricing
  • ModelsNew

Platform

  • Cloud AgentsNew
  • AI Solutions
  • Infrastructure
  • Edge Network
  • Trust Center
  • CLI

Resources

  • Blog
  • Changelog
  • AI Leaderboard
  • Research
  • Status

Company

  • About
  • Careers
  • Legal
  • Privacy Policy

© 2012–2026 Skytells, Inc. All rights reserved.

Live rankings

AI Model Leaderboard

Every major AI model ranked across benchmark quality, inference speed, agentic capability, programming aptitude, and cost efficiency — updated continuously from published evaluation data.

Explore full leaderboardBrowse model catalog

296

Tracked models

27

Providers

253

Benchmarked

11.5

Avg. index

OverallBenchmarksInferenceAgenticProgrammingValue / Price

296 models

RankModelProviderScoreBenchmarksInferenceAgenticProgrammingValuePrice
181

Grok-2

grok-2

multimodalvisionmulti-input reasoning
xAI

0.0

Agentic

27.138.30.00.025.4$2 in / $10 out
182

Grok-2 Image 1212

grok-2-image-1212

textinference
xAI

0.0

Agentic

0.00.00.00.00.0N/A
183

Grok-2 mini

grok-2-mini

multimodalvisionmulti-input reasoning
xAI

0.0

Agentic

24.00.00.00.00.0N/A
184

Grok-3

grok-3

multimodalvisionmulti-input reasoning
xAI

0.0

Agentic

59.352.70.00.022.7$3 in / $15 out
185

Grok-3 Mini

grok-3-mini

multimodalvisionmulti-input reasoning
xAI

0.0

Agentic

53.152.70.00.065.6$0.3 in / $0.5 out
186

Grok-4

grok-4

multimodalvisionmulti-input reasoning
xAI

0.0

Agentic

51.50.00.00.00.0N/A
187

Grok-4.1

grok-4.1-2025-11-17

multimodalvisionmulti-input reasoning
xAI

0.0

Agentic

0.064.90.00.022.7$3 in / $15 out
188

Grok-4.1 Fast Non-Reasoning

grok-4-1-fast-non-reasoning

multimodalvisionmulti-input reasoning
xAI

0.0

Agentic

0.068.80.00.067.2
189

Grok-4.1 Fast Reasoning

grok-4-1-fast-reasoning

multimodalvisionmulti-input reasoning
xAI

0.0

Agentic

0.068.80.00.067.2
190

Grok-4.1 Thinking

grok-4.1-thinking-2025-11-17

multimodalvisionmulti-input reasoning
xAI

0.0

Agentic

0.048.50.00.017.8
191

Grok-4.20 Beta Non-Reasoning

grok-4.20-beta-0309-non-reasoning

multimodalvisionmulti-input reasoning
xAI

0.0

Agentic

0.097.20.00.027.7
192

Grok-4.20 Beta Reasoning

grok-4.20-beta-0309-reasoning

multimodalvisionmulti-input reasoning
xAI

0.0

Agentic

0.097.20.00.027.7
193

Grok-4.20 Multi-Agent Beta

grok-4.20-multi-agent-beta-0309

multimodalvisionmulti-input reasoning
xAI

0.0

Agentic

0.00.00.00.00.0
194

Grok-4 Fast Non-Reasoning

grok-4-fast-non-reasoning

multimodalvisionmulti-input reasoning
xAI

0.0

Agentic

0.068.80.00.067.2
195

Grok-4 Fast Reasoning

grok-4-fast-reasoning

multimodalvisionmulti-input reasoning
xAI

0.0

Agentic

0.068.80.00.067.2
196

Grok-4 Heavy

grok-4-heavy

multimodalvisionmulti-input reasoning
xAI

0.0

Agentic

72.40.00.00.00.0N/A
197

Grok Code Fast 1

grok-code-fast-1

codeprogrammingtool use
xAI

0.0

Agentic

0.047.70.038.849.7$0.2 in / $1.5 out
198

Hermes 3 70B

hermes-3-70b

textinference
NNous Research

0.0

Agentic

30.10.00.00.00.0N/A
199

Jamba 1.5 Large

jamba-1.5-large

textinference
AAI21 Labs

0.0

Agentic

8.133.60.00.025.2$2 in / $8 out
200

Jamba 1.5 Mini

jamba-1.5-mini

textinference
AAI21 Labs

0.0

Agentic

4.765.80.00.072.4$0.2 in / $0.4 out
181

Grok-2

xAI

0.0

$2 in / $10 out

182

Grok-2 Image 1212

xAI

0.0

N/A

183

Grok-2 mini

xAI

0.0

N/A

184

Page 10 of 15 · 296 models

PreviousNext

Want benchmark charts, model comparison, and pricing analytics?

Sign in to access the full interactive leaderboard with deep benchmark breakdowns and model comparison tools.

Open full leaderboard

Rankings are based on multi-dimensional evaluation across benchmark quality, inference efficiency, and cost-per-output. Scores are updated continuously and may differ from individual third-party benchmarks.

$0.2 in / $0.5 out
$0.2 in / $0.5 out
$3 in / $15 out
$2 in / $6 out
$2 in / $6 out
N/A
$0.2 in / $0.5 out
$0.2 in / $0.5 out

Grok-3

xAI

0.0

$3 in / $15 out

185

Grok-3 Mini

xAI

0.0

$0.3 in / $0.5 out

186

Grok-4

xAI

0.0

N/A

187

Grok-4.1

xAI

0.0

$3 in / $15 out

188

Grok-4.1 Fast Non-Reasoning

xAI

0.0

$0.2 in / $0.5 out

189

Grok-4.1 Fast Reasoning

xAI

0.0

$0.2 in / $0.5 out

190

Grok-4.1 Thinking

xAI

0.0

$3 in / $15 out

191

Grok-4.20 Beta Non-Reasoning

xAI

0.0

$2 in / $6 out

192

Grok-4.20 Beta Reasoning

xAI

0.0

$2 in / $6 out

193

Grok-4.20 Multi-Agent Beta

xAI

0.0

N/A

194

Grok-4 Fast Non-Reasoning

xAI

0.0

$0.2 in / $0.5 out

195

Grok-4 Fast Reasoning

xAI

0.0

$0.2 in / $0.5 out

196

Grok-4 Heavy

xAI

0.0

N/A

197

Grok Code Fast 1

xAI

0.0

$0.2 in / $1.5 out

198
N

Hermes 3 70B

Nous Research

0.0

N/A

199
A

Jamba 1.5 Large

AI21 Labs

0.0

$2 in / $8 out

200
A

Jamba 1.5 Mini

AI21 Labs

0.0

$0.2 in / $0.4 out