Skytells
HomeModelsCLIChangelog
  • Home
  • Models
  • CLI
  • Changelog
Skytells

Addressing the world's greatest challenges with AI. Enterprise research, foundation models, and infrastructure trusted by organizations worldwide since 2012.

Get Started

  • Console
  • Learn
  • Documentation
  • API Reference
  • Pricing
  • ModelsNew

Platform

  • Cloud AgentsNew
  • AI Solutions
  • Infrastructure
  • Edge Network
  • Trust Center
  • CLI

Resources

  • Blog
  • Changelog
  • AI Leaderboard
  • Research
  • Status

Company

  • About
  • Careers
  • Legal
  • Privacy Policy

© 2012–2026 Skytells, Inc. All rights reserved.

Live rankings

AI Model Leaderboard

Every major AI model ranked across benchmark quality, inference speed, agentic capability, programming aptitude, and cost efficiency — updated continuously from published evaluation data.

Explore full leaderboardBrowse model catalog

296

Tracked models

27

Providers

253

Benchmarked

30.8

Avg. index

OverallBenchmarksInferenceAgenticProgrammingValue / Price

296 models

RankModelProviderScoreBenchmarksInferenceAgenticProgrammingValuePrice
101

Llama 3.1 405B Instruct

llama-3.1-405b-instruct

textinference
MMeta

44.5

Value / Price

20.021.40.00.044.5$0.89 in / $0.89 out
102

Mistral Large 3 (675B Instruct 2512)

mistral-large-latest

multimodalvisionmulti-input reasoning
Mistral AI

44.5

Value / Price

22.240.10.00.044.5
103

Qwen3.5-27B

qwen3.5-27b

multimodalvisionmulti-input reasoning
AAlibaba Cloud / Qwen Team

44.0

Value / Price

61.866.046.541.444.0$0.3 in / $2.4 out
104

Nova Pro

nova-pro

multimodalvisionmulti-input reasoning
AAmazon

43.2

Value / Price

20.070.50.00.043.2$0.8 in / $3.2 out
105

GLM-4.6

glm-4.6

multimodalvisionmulti-input reasoning
ZZhipu AI

42.9

Value / Price

46.534.537.345.742.9$0.55 in / $2.19 out
106

Gemini 2.5 Flash

gemini-2.5-flash

multimodalvisionmulti-input reasoning
Google

42.6

Value / Price

39.662.80.022.942.6
107

MiniMax M1 80K

minimax-m1-80k

codeprogrammingtool use
MiniMax

41.8

Value / Price

24.284.020.919.041.8$0.55 in / $2.2 out
108

o3-mini

o3-mini

codeprogrammingtool use
OpenAI

41.6

Value / Price

25.670.411.912.241.6$1.1 in / $4.4 out
109

o4-mini

o4-mini

multimodalvisionmulti-input reasoning
OpenAI

41.6

Value / Price

48.570.437.631.941.6$1.1 in / $4.4 out
110

GLM-4.7

glm-4.7

multimodalvisionmulti-input reasoning
ZZhipu AI

40.7

Value / Price

62.452.227.643.840.7$0.6 in / $2.2 out
111

Kimi K2 0905

kimi-k2-0905

textinference
Moonshot AI

40.1

Value / Price

44.066.00.00.040.1$0.6 in / $2.5 out
112

Qwen3-235B-A22B-Thinking-2507

qwen3-235b-a22b-thinking-2507

textinference
AAlibaba Cloud / Qwen Team

39.6

Value / Price

46.466.026.80.039.6$0.3 in / $3 out
113

Gemini 3 Flash

gemini-3-flash-preview

multimodalvisionmulti-input reasoning
Google

39.0

Value / Price

71.184.041.265.139.0
114

Kimi K2.5

kimi-k2.5

multimodalvisionmulti-input reasoning
Moonshot AI

38.2

Value / Price

67.966.048.947.738.2
115

Qwen3.5-122B-A10B

qwen3.5-122b-a10b

multimodalvisionmulti-input reasoning
AAlibaba Cloud / Qwen Team

38.2

Value / Price

64.566.050.540.538.2$0.4 in / $3.2 out
116

Claude Haiku 4.5

claude-haiku-4-5-20251001

multimodalvisionmulti-input reasoning
Anthropic

37.7

Value / Price

32.761.854.256.637.7
117

Qwen3 VL 235B A22B Thinking

qwen3-vl-235b-a22b-thinking

multimodalvisionmulti-input reasoning
AAlibaba Cloud / Qwen Team

37.4

Value / Price

37.766.040.20.037.4
118

MiMo-V2-Pro

mimo-v2-pro

codeprogrammingtool use
Xiaomi

36.5

Value / Price

0.084.00.065.136.5$1 in / $3 out
119

Qwen3.5-397B-A17B

qwen3.5-397b-a17b

multimodalvisionmulti-input reasoning
AAlibaba Cloud / Qwen Team

35.4

Value / Price

58.066.033.059.535.4$0.6 in / $3.6 out
120

DeepSeek-R1

deepseek-r1

textinference
DeepSeek

35.1

Value / Price

0.014.30.00.035.1$0.55 in / $2.19 out
101
M

Llama 3.1 405B Instruct

Meta

44.5

$0.89 in / $0.89 out

102

Mistral Large 3 (675B Instruct 2512)

Mistral AI

44.5

$0.5 in / $1.5 out

103
A

Qwen3.5-27B

Alibaba Cloud / Qwen Team

44.0

$0.3 in / $2.4 out

104

Page 6 of 15 · 296 models

PreviousNext

Want benchmark charts, model comparison, and pricing analytics?

Sign in to access the full interactive leaderboard with deep benchmark breakdowns and model comparison tools.

Open full leaderboard

Rankings are based on multi-dimensional evaluation across benchmark quality, inference efficiency, and cost-per-output. Scores are updated continuously and may differ from individual third-party benchmarks.

$0.5 in / $1.5 out
$0.3 in / $2.5 out
$0.5 in / $3 out
$0.6 in / $3 out
$1 in / $5 out
$0.45 in / $3.49 out
A

Nova Pro

Amazon

43.2

$0.8 in / $3.2 out

105
Z

GLM-4.6

Zhipu AI

42.9

$0.55 in / $2.19 out

106

Gemini 2.5 Flash

Google

42.6

$0.3 in / $2.5 out

107

MiniMax M1 80K

MiniMax

41.8

$0.55 in / $2.2 out

108

o3-mini

OpenAI

41.6

$1.1 in / $4.4 out

109

o4-mini

OpenAI

41.6

$1.1 in / $4.4 out

110
Z

GLM-4.7

Zhipu AI

40.7

$0.6 in / $2.2 out

111

Kimi K2 0905

Moonshot AI

40.1

$0.6 in / $2.5 out

112
A

Qwen3-235B-A22B-Thinking-2507

Alibaba Cloud / Qwen Team

39.6

$0.3 in / $3 out

113

Gemini 3 Flash

Google

39.0

$0.5 in / $3 out

114

Kimi K2.5

Moonshot AI

38.2

$0.6 in / $3 out

115
A

Qwen3.5-122B-A10B

Alibaba Cloud / Qwen Team

38.2

$0.4 in / $3.2 out

116

Claude Haiku 4.5

Anthropic

37.7

$1 in / $5 out

117
A

Qwen3 VL 235B A22B Thinking

Alibaba Cloud / Qwen Team

37.4

$0.45 in / $3.49 out

118

MiMo-V2-Pro

Xiaomi

36.5

$1 in / $3 out

119
A

Qwen3.5-397B-A17B

Alibaba Cloud / Qwen Team

35.4

$0.6 in / $3.6 out

120

DeepSeek-R1

DeepSeek

35.1

$0.55 in / $2.19 out