Skytells
HomeModelsCLIChangelog
  • Home
  • Models
  • CLI
  • Changelog
Skytells

Addressing the world's greatest challenges with AI. Enterprise research, foundation models, and infrastructure trusted by organizations worldwide since 2012.

Get Started

  • Console
  • Learn
  • Documentation
  • API Reference
  • Pricing
  • ModelsNew

Platform

  • Cloud AgentsNew
  • AI Solutions
  • Infrastructure
  • Edge Network
  • Trust Center
  • CLI

Resources

  • Blog
  • Changelog
  • AI Leaderboard
  • Research
  • Status

Company

  • About
  • Careers
  • Legal
  • Privacy Policy

© 2012–2026 Skytells, Inc. All rights reserved.

Live rankings

AI Model Leaderboard

Every major AI model ranked across benchmark quality, inference speed, agentic capability, programming aptitude, and cost efficiency — updated continuously from published evaluation data.

Explore full leaderboardBrowse model catalog

296

Tracked models

27

Providers

253

Benchmarked

30.8

Avg. index

OverallBenchmarksInferenceAgenticProgrammingValue / Price

296 models

RankModelProviderScoreBenchmarksInferenceAgenticProgrammingValuePrice
161

GPT-5.3 Codex

gpt-5.3-codex

texttext-to-textcoding
OpenAI

19.6

Value / Price

0.049.00.052.219.6$1.75 in / $14 out
162

Claude 3 Opus

claude-3-opus-20240229

multimodalvisionmulti-input reasoning
Anthropic

19.5

Value / Price

19.371.70.00.019.5
163

GPT-4 Turbo

gpt-4-turbo-2024-04-09

textinference
OpenAI

18.8

Value / Price

16.952.70.00.018.8$10 in / $30 out
164

GPT-4

gpt-4-0613

multimodalvisionmulti-input reasoning
OpenAI

18.7

Value / Price

6.854.90.00.018.7$30 in / $60 out
165

GPT-5.4

gpt-5.4

texttext-to-textlanguage
OpenAI

18.3

Value / Price

75.951.561.863.918.3
166

Grok-4.1 Thinking

grok-4.1-thinking-2025-11-17

multimodalvisionmulti-input reasoning
xAI

17.8

Value / Price

0.048.50.00.017.8
167

Claude 3.7 Sonnet

claude-3-7-sonnet-20250219

multimodalvisionmulti-input reasoning
Anthropic

13.3

Value / Price

43.530.549.039.613.3
168

Claude 3 Sonnet

claude-3-sonnet-20240229

multimodalvisionmulti-input reasoning
Anthropic

13.3

Value / Price

10.030.50.00.013.3
169

Claude Sonnet 4.5

claude-sonnet-4-5-20250929

multimodalvisionmulti-input reasoning
Anthropic

13.3

Value / Price

53.030.571.874.613.3
170

Claude Sonnet 4.6

claude-sonnet-4-6

multimodalvisionmulti-input reasoning
Anthropic

13.3

Value / Price

66.130.548.568.213.3
171

o1-preview

o1-preview

codeprogrammingtool use
OpenAI

11.8

Value / Price

41.833.00.09.511.8$15 in / $60 out
172

Claude Opus 4.5

claude-opus-4-5-20251101

multimodalvisionmulti-input reasoning
Anthropic

10.7

Value / Price

56.130.542.574.210.7
173

Claude Opus 4.6

claude-opus-4-6

multimodalvisionmulti-input reasoning
Anthropic

10.7

Value / Price

79.543.159.373.310.7
174

Claude Opus 4.7

claude-opus-4-7

multimodalvisionmulti-input reasoning
Anthropic

10.7

Value / Price

76.843.168.681.410.7
175

Gemma 3n E4B Instructed

gemma-3n-e4b-it

multimodalvisionmulti-input reasoning
Google

10.3

Value / Price

1.320.30.00.010.3
176

Claude Opus 4.1

claude-opus-4-1-20250805

multimodalvisionmulti-input reasoning
Anthropic

7.2

Value / Price

47.930.566.862.17.2
177

GPT-4.5

gpt-4.5

multimodalvisionmulti-input reasoning
OpenAI

7.0

Value / Price

41.929.735.86.07.0$75 in / $150 out
178

GPT-5.5

gpt-5.5

multimodalvisionmulti-input reasoning
OpenAI

6.6

Value / Price

80.484.076.566.36.6$5 in / $30 out
179

o1

o1-2024-12-17

multimodalvisionmulti-input reasoning
OpenAI

4.9

Value / Price

42.919.444.76.54.9$15 in / $60 out
180

o3-pro

o3-pro-2025-06-10

multimodalvisionmulti-input reasoning
OpenAI

3.6

Value / Price

0.021.40.00.03.6
161

GPT-5.3 Codex

OpenAI

19.6

$1.75 in / $14 out

162

Claude 3 Opus

Anthropic

19.5

$15 in / $75 out

163

GPT-4 Turbo

OpenAI

18.8

$10 in / $30 out

Page 9 of 15 · 296 models

PreviousNext

Want benchmark charts, model comparison, and pricing analytics?

Sign in to access the full interactive leaderboard with deep benchmark breakdowns and model comparison tools.

Open full leaderboard

Rankings are based on multi-dimensional evaluation across benchmark quality, inference efficiency, and cost-per-output. Scores are updated continuously and may differ from individual third-party benchmarks.

$15 in / $75 out
$2.5 in / $15 out
$3 in / $15 out
$3 in / $15 out
$3 in / $15 out
$3 in / $15 out
$3 in / $15 out
$5 in / $25 out
$5 in / $25 out
$5 in / $25 out
$20 in / $40 out
$15 in / $75 out
$20 in / $80 out
164

GPT-4

OpenAI

18.7

$30 in / $60 out

165

GPT-5.4

OpenAI

18.3

$2.5 in / $15 out

166

Grok-4.1 Thinking

xAI

17.8

$3 in / $15 out

167

Claude 3.7 Sonnet

Anthropic

13.3

$3 in / $15 out

168

Claude 3 Sonnet

Anthropic

13.3

$3 in / $15 out

169

Claude Sonnet 4.5

Anthropic

13.3

$3 in / $15 out

170

Claude Sonnet 4.6

Anthropic

13.3

$3 in / $15 out

171

o1-preview

OpenAI

11.8

$15 in / $60 out

172

Claude Opus 4.5

Anthropic

10.7

$5 in / $25 out

173

Claude Opus 4.6

Anthropic

10.7

$5 in / $25 out

174

Claude Opus 4.7

Anthropic

10.7

$5 in / $25 out

175

Gemma 3n E4B Instructed

Google

10.3

$20 in / $40 out

176

Claude Opus 4.1

Anthropic

7.2

$15 in / $75 out

177

GPT-4.5

OpenAI

7.0

$75 in / $150 out

178

GPT-5.5

OpenAI

6.6

$5 in / $30 out

179

o1

OpenAI

4.9

$15 in / $60 out

180

o3-pro

OpenAI

3.6

$20 in / $80 out