Skytells
HomeModelsCLIChangelog
  • Home
  • Models
  • CLI
  • Changelog
Skytells

Addressing the world's greatest challenges with AI. Enterprise research, foundation models, and infrastructure trusted by organizations worldwide since 2012.

Get Started

  • Console
  • Learn
  • Documentation
  • API Reference
  • Pricing
  • ModelsNew

Platform

  • Cloud AgentsNew
  • AI Solutions
  • Infrastructure
  • Edge Network
  • Trust Center
  • CLI

Resources

  • Blog
  • Changelog
  • AI Leaderboard
  • Research
  • Status

Company

  • About
  • Careers
  • Legal
  • Privacy Policy

© 2012–2026 Skytells, Inc. All rights reserved.

Live rankings

AI Model Leaderboard

Every major AI model ranked across benchmark quality, inference speed, agentic capability, programming aptitude, and cost efficiency — updated continuously from published evaluation data.

Explore full leaderboardBrowse model catalog

296

Tracked models

27

Providers

253

Benchmarked

13.4

Avg. index

OverallBenchmarksInferenceAgenticProgrammingValue / Price

296 models

RankModelProviderScoreBenchmarksInferenceAgenticProgrammingValuePrice
181

Grok-4.1 Fast Non-Reasoning

grok-4-1-fast-non-reasoning

multimodalvisionmulti-input reasoning
xAI

0.0

Programming

0.068.80.00.067.2$0.2 in / $0.5 out
182

Grok-4.1 Fast Reasoning

grok-4-1-fast-reasoning

multimodalvisionmulti-input reasoning
xAI

0.0

Programming

0.068.80.00.067.2
183

Grok-4.1 Thinking

grok-4.1-thinking-2025-11-17

multimodalvisionmulti-input reasoning
xAI

0.0

Programming

0.048.50.00.017.8
184

Grok-4.20 Beta Non-Reasoning

grok-4.20-beta-0309-non-reasoning

multimodalvisionmulti-input reasoning
xAI

0.0

Programming

0.097.20.00.027.7
185

Grok-4.20 Beta Reasoning

grok-4.20-beta-0309-reasoning

multimodalvisionmulti-input reasoning
xAI

0.0

Programming

0.097.20.00.027.7
186

Grok-4.20 Multi-Agent Beta

grok-4.20-multi-agent-beta-0309

multimodalvisionmulti-input reasoning
xAI

0.0

Programming

0.00.00.00.00.0
187

Grok 4 Fast

grok-4-fast

multimodalvisionmulti-input reasoning
xAI

0.0

Programming

57.668.814.70.067.2$0.2 in / $0.5 out
188

Grok-4 Fast Non-Reasoning

grok-4-fast-non-reasoning

multimodalvisionmulti-input reasoning
xAI

0.0

Programming

0.068.80.00.067.2
189

Grok-4 Fast Reasoning

grok-4-fast-reasoning

multimodalvisionmulti-input reasoning
xAI

0.0

Programming

0.068.80.00.067.2
190

Grok-4 Heavy

grok-4-heavy

multimodalvisionmulti-input reasoning
xAI

0.0

Programming

72.40.00.00.00.0N/A
191

Hermes 3 70B

hermes-3-70b

textinference
NNous Research

0.0

Programming

30.10.00.00.00.0N/A
192

Jamba 1.5 Large

jamba-1.5-large

textinference
AAI21 Labs

0.0

Programming

8.133.60.00.025.2$2 in / $8 out
193

Jamba 1.5 Mini

jamba-1.5-mini

textinference
AAI21 Labs

0.0

Programming

4.765.80.00.072.4$0.2 in / $0.4 out
194

K-EXAONE-236B-A23B

k-exaone-236b-a23b

multimodalvisionmulti-input reasoning
LLG AI Research

0.0

Programming

43.424.90.00.049.2$0.6 in / $1 out
195

Kimi-k1.5

kimi-k1.5

multimodalvisionmulti-input reasoning
Moonshot AI

0.0

Programming

35.30.00.00.00.0N/A
196

Kimi K2 0905

kimi-k2-0905

textinference
Moonshot AI

0.0

Programming

44.066.00.00.040.1$0.6 in / $2.5 out
197

Kimi K2 Base

kimi-k2-base

textinference
Moonshot AI

0.0

Programming

26.90.00.00.00.0N/A
198

Llama 3.1 405B Instruct

llama-3.1-405b-instruct

textinference
MMeta

0.0

Programming

20.021.40.00.044.5$0.89 in / $0.89 out
199

Llama 3.1 70B Instruct

llama-3.1-70b-instruct

textinference
MMeta

0.0

Programming

11.221.40.00.072.2$0.2 in / $0.2 out
200

Llama 3.1 8B Instruct

llama-3.1-8b-instruct

textinference
MMeta

0.0

Programming

3.226.70.00.083.9$0.03 in / $0.03 out
181

Grok-4.1 Fast Non-Reasoning

xAI

0.0

$0.2 in / $0.5 out

182

Grok-4.1 Fast Reasoning

xAI

0.0

$0.2 in / $0.5 out

183

Grok-4.1 Thinking

xAI

0.0

$3 in / $15 out

Page 10 of 15 · 296 models

PreviousNext

Want benchmark charts, model comparison, and pricing analytics?

Sign in to access the full interactive leaderboard with deep benchmark breakdowns and model comparison tools.

Open full leaderboard

Rankings are based on multi-dimensional evaluation across benchmark quality, inference efficiency, and cost-per-output. Scores are updated continuously and may differ from individual third-party benchmarks.

$0.2 in / $0.5 out
$3 in / $15 out
$2 in / $6 out
$2 in / $6 out
N/A
$0.2 in / $0.5 out
$0.2 in / $0.5 out
184

Grok-4.20 Beta Non-Reasoning

xAI

0.0

$2 in / $6 out

185

Grok-4.20 Beta Reasoning

xAI

0.0

$2 in / $6 out

186

Grok-4.20 Multi-Agent Beta

xAI

0.0

N/A

187

Grok 4 Fast

xAI

0.0

$0.2 in / $0.5 out

188

Grok-4 Fast Non-Reasoning

xAI

0.0

$0.2 in / $0.5 out

189

Grok-4 Fast Reasoning

xAI

0.0

$0.2 in / $0.5 out

190

Grok-4 Heavy

xAI

0.0

N/A

191
N

Hermes 3 70B

Nous Research

0.0

N/A

192
A

Jamba 1.5 Large

AI21 Labs

0.0

$2 in / $8 out

193
A

Jamba 1.5 Mini

AI21 Labs

0.0

$0.2 in / $0.4 out

194
L

K-EXAONE-236B-A23B

LG AI Research

0.0

$0.6 in / $1 out

195

Kimi-k1.5

Moonshot AI

0.0

N/A

196

Kimi K2 0905

Moonshot AI

0.0

$0.6 in / $2.5 out

197

Kimi K2 Base

Moonshot AI

0.0

N/A

198
M

Llama 3.1 405B Instruct

Meta

0.0

$0.89 in / $0.89 out

199
M

Llama 3.1 70B Instruct

Meta

0.0

$0.2 in / $0.2 out

200
M

Llama 3.1 8B Instruct

Meta

0.0

$0.03 in / $0.03 out