Skytells
HomeModelsCLIChangelog
  • Home
  • Models
  • CLI
  • Changelog
Skytells

Addressing the world's greatest challenges with AI. Enterprise research, foundation models, and infrastructure trusted by organizations worldwide since 2012.

Get Started

  • Console
  • Learn
  • Documentation
  • API Reference
  • Pricing
  • ModelsNew

Platform

  • Cloud AgentsNew
  • AI Solutions
  • Infrastructure
  • Edge Network
  • Trust Center
  • CLI

Resources

  • Blog
  • Changelog
  • AI Leaderboard
  • Research
  • Status

Company

  • About
  • Careers
  • Legal
  • Privacy Policy

© 2012–2026 Skytells, Inc. All rights reserved.

Live rankings

AI Model Leaderboard

Every major AI model ranked across benchmark quality, inference speed, agentic capability, programming aptitude, and cost efficiency — updated continuously from published evaluation data.

Explore full leaderboardBrowse model catalog

296

Tracked models

27

Providers

253

Benchmarked

11.5

Avg. index

OverallBenchmarksInferenceAgenticProgrammingValue / Price

296 models

RankModelProviderScoreBenchmarksInferenceAgenticProgrammingValuePrice
81

Grok 4 Fast

grok-4-fast

multimodalvisionmulti-input reasoning
xAI

14.7

Agentic

57.668.814.70.067.2$0.2 in / $0.5 out
82

o3-mini

o3-mini

codeprogrammingtool use
OpenAI

11.9

Agentic

25.670.411.912.241.6$1.1 in / $4.4 out
83

GLM-4.7-Flash

glm-4.7-flash

codeprogrammingtool use
ZZhipu AI

11.4

Agentic

38.229.711.420.772.1$0.07 in / $0.4 out
84

GPT-5.4 nano

gpt-5.4-nano

multimodalvisionmulti-input reasoning
OpenAI

9.7

Agentic

45.676.59.710.057.1$0.2 in / $1.25 out
85

GPT-4.1 mini

gpt-4.1-mini-2025-04-14

multimodalvisionmulti-input reasoning
OpenAI

8.9

Agentic

20.790.68.92.656.8
86

Nemotron 3 Super (120B A12B)

nemotron-3-super-120b-a12b

codeprogrammingtool use
NNVIDIA

8.7

Agentic

48.30.08.726.80.0N/A
87

DeepSeek-V3.2-Speciale

deepseek-v3.2-speciale

codeprogrammingtool use
DeepSeek

8.5

Agentic

53.80.08.544.90.0
88

Sarvam-30B

sarvam-30b

codeprogrammingtool use
SSarvam AI

8.2

Agentic

46.40.08.25.20.0N/A
89

Kimi K2-Instruct-0905

kimi-k2-instruct-0905

codeprogrammingtool use
Moonshot AI

6.6

Agentic

24.40.06.619.30.0
90

GPT OSS 20B

gpt-oss-20b

textinference
OpenAI

6.0

Agentic

25.877.26.00.079.0$0.1 in / $0.5 out
91

Qwen2.5 VL 72B Instruct

qwen2.5-vl-72b

multimodalvisionmulti-input reasoning
AAlibaba Cloud / Qwen Team

5.7

Agentic

24.90.05.70.00.0N/A
92

Nemotron 3 Nano (30B A3B)

nemotron-3-nano-30b-a3b

codeprogrammingtool use
NNVIDIA

3.3

Agentic

45.466.03.34.490.9$0.06 in / $0.24 out
93

Claude 3.5 Haiku

claude-3-5-haiku-20241022

codeprogrammingtool use
Anthropic

3.0

Agentic

10.830.53.07.831.8
94

Qwen2.5 VL 32B Instruct

qwen2.5-vl-32b

multimodalvisionmulti-input reasoning
AAlibaba Cloud / Qwen Team

1.6

Agentic

21.20.01.60.00.0N/A
95

ChatGPT-4o Latest

chatgpt-4o-latest

multimodalvisionmulti-input reasoning
OpenAI

0.0

Agentic

56.063.80.00.032.0
96

Claude 3.5 Sonnet

claude-3-5-sonnet-20240620

multimodalvisionmulti-input reasoning
Anthropic

0.0

Agentic

25.468.20.00.024.6
97

Claude 3 Haiku

claude-3-haiku-20240307

multimodalvisionmulti-input reasoning
Anthropic

0.0

Agentic

5.861.80.00.057.9
98

Claude 3 Opus

claude-3-opus-20240229

multimodalvisionmulti-input reasoning
Anthropic

0.0

Agentic

19.371.70.00.019.5
99

Claude 3 Sonnet

claude-3-sonnet-20240229

multimodalvisionmulti-input reasoning
Anthropic

0.0

Agentic

10.030.50.00.013.3
100

Codestral-22B

codestral-22b

textinference
Mistral AI

0.0

Agentic

0.00.00.00.00.0N/A
81

Grok 4 Fast

xAI

14.7

$0.2 in / $0.5 out

82

o3-mini

OpenAI

11.9

$1.1 in / $4.4 out

83
Z

GLM-4.7-Flash

Zhipu AI

11.4

$0.07 in / $0.4 out

84

Page 5 of 15 · 296 models

PreviousNext

Want benchmark charts, model comparison, and pricing analytics?

Sign in to access the full interactive leaderboard with deep benchmark breakdowns and model comparison tools.

Open full leaderboard

Rankings are based on multi-dimensional evaluation across benchmark quality, inference efficiency, and cost-per-output. Scores are updated continuously and may differ from individual third-party benchmarks.

$0.4 in / $1.6 out
N/A
N/A
$0.8 in / $4 out
$2.5 in / $10 out
$3 in / $15 out
$0.25 in / $1.25 out
$15 in / $75 out
$3 in / $15 out

GPT-5.4 nano

OpenAI

9.7

$0.2 in / $1.25 out

85

GPT-4.1 mini

OpenAI

8.9

$0.4 in / $1.6 out

86
N

Nemotron 3 Super (120B A12B)

NVIDIA

8.7

N/A

87

DeepSeek-V3.2-Speciale

DeepSeek

8.5

N/A

88
S

Sarvam-30B

Sarvam AI

8.2

N/A

89

Kimi K2-Instruct-0905

Moonshot AI

6.6

N/A

90

GPT OSS 20B

OpenAI

6.0

$0.1 in / $0.5 out

91
A

Qwen2.5 VL 72B Instruct

Alibaba Cloud / Qwen Team

5.7

N/A

92
N

Nemotron 3 Nano (30B A3B)

NVIDIA

3.3

$0.06 in / $0.24 out

93

Claude 3.5 Haiku

Anthropic

3.0

$0.8 in / $4 out

94
A

Qwen2.5 VL 32B Instruct

Alibaba Cloud / Qwen Team

1.6

N/A

95

ChatGPT-4o Latest

OpenAI

0.0

$2.5 in / $10 out

96

Claude 3.5 Sonnet

Anthropic

0.0

$3 in / $15 out

97

Claude 3 Haiku

Anthropic

0.0

$0.25 in / $1.25 out

98

Claude 3 Opus

Anthropic

0.0

$15 in / $75 out

99

Claude 3 Sonnet

Anthropic

0.0

$3 in / $15 out

100

Codestral-22B

Mistral AI

0.0

N/A