Skytells
HomeModelsCLIChangelog
  • Home
  • Models
  • CLI
  • Changelog
Skytells

Addressing the world's greatest challenges with AI. Enterprise research, foundation models, and infrastructure trusted by organizations worldwide since 2012.

Get Started

  • Console
  • Learn
  • Documentation
  • API Reference
  • Pricing
  • ModelsNew

Platform

  • Cloud AgentsNew
  • AI Solutions
  • Infrastructure
  • Edge Network
  • Trust Center
  • CLI

Resources

  • Blog
  • Changelog
  • AI Leaderboard
  • Research
  • Status

Company

  • About
  • Careers
  • Legal
  • Privacy Policy

© 2012–2026 Skytells, Inc. All rights reserved.

Live rankings

AI Model Leaderboard

Every major AI model ranked across benchmark quality, inference speed, agentic capability, programming aptitude, and cost efficiency — updated continuously from published evaluation data.

Explore full leaderboardBrowse model catalog

296

Tracked models

27

Providers

253

Benchmarked

30.8

Avg. index

OverallBenchmarksInferenceAgenticProgrammingValue / Price

296 models

RankModelProviderScoreBenchmarksInferenceAgenticProgrammingValuePrice
81

Granite 3.3 8B Instruct

granite-3.3-8b-instruct

multimodalvisionmulti-input reasoning
IIBM

56.7

Value / Price

0.029.70.00.056.7$0.5 in / $0.5 out
82

GPT-5 mini

gpt-5-mini-2025-08-07

multimodalvisionmulti-input reasoning
OpenAI

56.3

Value / Price

41.589.40.023.756.3
83

Command R+

command-r-plus-04-2024

textinference
Cohere

55.4

Value / Price

0.032.50.00.055.4$0.25 in / $1 out
84

Gemini 1.0 Pro

gemini-1.0-pro

multimodalvisionmulti-input reasoning
Google

55.4

Value / Price

3.257.20.00.055.4
85

Llama 3.2 90B Instruct

llama-3.2-90b-instruct

multimodalvisionmulti-input reasoning
MMeta

54.9

Value / Price

16.311.30.00.054.9$0.35 in / $0.4 out
86

MiniMax M2

minimax-m2

codeprogrammingtool use
MiniMax

54.9

Value / Price

31.984.041.142.454.9$0.3 in / $1.2 out
87

MiniMax M2.7

minimax-m2.7

codeprogrammingtool use
MiniMax

54.9

Value / Price

0.052.244.940.154.9$0.3 in / $1.2 out
88

Qwen2.5 72B Instruct

qwen-2.5-72b-instruct

textinference
AAlibaba Cloud / Qwen Team

54.5

Value / Price

17.815.00.00.054.5$0.35 in / $0.4 out
89

Devstral Medium

devstral-medium-2507

codeprogrammingtool use
Mistral AI

53.4

Value / Price

0.064.80.024.253.4
90

Mistral Small

mistral-small-2409

textinference
Mistral AI

51.9

Value / Price

0.02.10.00.051.9$0.2 in / $0.6 out
91

Qwen3-Next-80B-A3B-Instruct

qwen3-next-80b-a3b-instruct

textinference
AAlibaba Cloud / Qwen Team

51.9

Value / Price

29.56.117.90.051.9$0.15 in / $1.5 out
92

Qwen3-Next-80B-A3B-Thinking

qwen3-next-80b-a3b-thinking

textinference
AAlibaba Cloud / Qwen Team

51.9

Value / Price

44.76.141.70.051.9$0.15 in / $1.5 out
93

Gemini 3.1 Flash-Lite

gemini-3.1-flash-lite-preview

multimodalvisionmulti-input reasoning
Google

50.5

Value / Price

56.084.00.00.050.5
94

Grok Code Fast 1

grok-code-fast-1

codeprogrammingtool use
xAI

49.7

Value / Price

0.047.70.038.849.7$0.2 in / $1.5 out
95

Qwen3 VL 235B A22B Instruct

qwen3-vl-235b-a22b-instruct

multimodalvisionmulti-input reasoning
AAlibaba Cloud / Qwen Team

49.5

Value / Price

36.966.056.70.049.5
96

GPT-3.5 Turbo

gpt-3.5-turbo-0125

multimodalvisionmulti-input reasoning
OpenAI

49.4

Value / Price

2.536.70.00.049.4
97

K-EXAONE-236B-A23B

k-exaone-236b-a23b

multimodalvisionmulti-input reasoning
LLG AI Research

49.2

Value / Price

43.424.90.00.049.2$0.6 in / $1 out
98

Qwen3.5-35B-A3B

qwen3.5-35b-a3b

multimodalvisionmulti-input reasoning
AAlibaba Cloud / Qwen Team

46.4

Value / Price

56.966.043.333.646.4$0.25 in / $2 out
99

Qwen3 VL 8B Thinking

qwen3-vl-8b-thinking

multimodalvisionmulti-input reasoning
AAlibaba Cloud / Qwen Team

45.6

Value / Price

35.666.023.50.045.6
100

MiMo-V2-Omni

mimo-v2-omni

multimodalvisionmulti-input reasoning
Xiaomi

44.8

Value / Price

0.058.60.054.444.8
81
I

Granite 3.3 8B Instruct

IBM

56.7

$0.5 in / $0.5 out

82

GPT-5 mini

OpenAI

56.3

$0.25 in / $2 out

83

Command R+

Cohere

55.4

$0.25 in / $1 out

84

Page 5 of 15 · 296 models

PreviousNext

Want benchmark charts, model comparison, and pricing analytics?

Sign in to access the full interactive leaderboard with deep benchmark breakdowns and model comparison tools.

Open full leaderboard

Rankings are based on multi-dimensional evaluation across benchmark quality, inference efficiency, and cost-per-output. Scores are updated continuously and may differ from individual third-party benchmarks.

$0.25 in / $2 out
$0.5 in / $1.5 out
$0.4 in / $2 out
$0.25 in / $1.5 out
$0.3 in / $1.5 out
$0.5 in / $1.5 out
$0.18 in / $2.09 out
$0.4 in / $2 out

Gemini 1.0 Pro

Google

55.4

$0.5 in / $1.5 out

85
M

Llama 3.2 90B Instruct

Meta

54.9

$0.35 in / $0.4 out

86

MiniMax M2

MiniMax

54.9

$0.3 in / $1.2 out

87

MiniMax M2.7

MiniMax

54.9

$0.3 in / $1.2 out

88
A

Qwen2.5 72B Instruct

Alibaba Cloud / Qwen Team

54.5

$0.35 in / $0.4 out

89

Devstral Medium

Mistral AI

53.4

$0.4 in / $2 out

90

Mistral Small

Mistral AI

51.9

$0.2 in / $0.6 out

91
A

Qwen3-Next-80B-A3B-Instruct

Alibaba Cloud / Qwen Team

51.9

$0.15 in / $1.5 out

92
A

Qwen3-Next-80B-A3B-Thinking

Alibaba Cloud / Qwen Team

51.9

$0.15 in / $1.5 out

93

Gemini 3.1 Flash-Lite

Google

50.5

$0.25 in / $1.5 out

94

Grok Code Fast 1

xAI

49.7

$0.2 in / $1.5 out

95
A

Qwen3 VL 235B A22B Instruct

Alibaba Cloud / Qwen Team

49.5

$0.3 in / $1.5 out

96

GPT-3.5 Turbo

OpenAI

49.4

$0.5 in / $1.5 out

97
L

K-EXAONE-236B-A23B

LG AI Research

49.2

$0.6 in / $1 out

98
A

Qwen3.5-35B-A3B

Alibaba Cloud / Qwen Team

46.4

$0.25 in / $2 out

99
A

Qwen3 VL 8B Thinking

Alibaba Cloud / Qwen Team

45.6

$0.18 in / $2.09 out

100

MiMo-V2-Omni

Xiaomi

44.8

$0.4 in / $2 out