Skytells
HomeModelsCLIChangelog
  • Home
  • Models
  • CLI
  • Changelog
Skytells

Addressing the world's greatest challenges with AI. Enterprise research, foundation models, and infrastructure trusted by organizations worldwide since 2012.

Get Started

  • Console
  • Learn
  • Documentation
  • API Reference
  • Pricing
  • ModelsNew

Platform

  • Cloud AgentsNew
  • AI Solutions
  • Infrastructure
  • Edge Network
  • Trust Center
  • CLI

Resources

  • Blog
  • Changelog
  • AI Leaderboard
  • Research
  • Status

Company

  • About
  • Careers
  • Legal
  • Privacy Policy

© 2012–2026 Skytells, Inc. All rights reserved.

Live rankings

AI Model Leaderboard

Every major AI model ranked across benchmark quality, inference speed, agentic capability, programming aptitude, and cost efficiency — updated continuously from published evaluation data.

Explore full leaderboardBrowse model catalog

296

Tracked models

27

Providers

253

Benchmarked

13.4

Avg. index

OverallBenchmarksInferenceAgenticProgrammingValue / Price

296 models

RankModelProviderScoreBenchmarksInferenceAgenticProgrammingValuePrice
81

MiniMax M1 40K

minimax-m1-40k

codeprogrammingtool use
MiniMax

18.1

Programming

22.60.026.818.10.0N/A
82

GPT-4.1

gpt-4.1-2025-04-14

multimodalvisionmulti-input reasoning
OpenAI

17.3

Programming

28.775.932.817.334.6
83

Kimi K2 Instruct

kimi-k2-instruct

codeprogrammingtool use
Moonshot AI

15.3

Programming

24.446.114.815.362.1
84

Devstral Small 1.1

devstral-small-2507

codeprogrammingtool use
Mistral AI

14.7

Programming

0.064.80.014.785.3
85

Claude 3.5 Sonnet

claude-3-5-sonnet-20241022

multimodalvisionmulti-input reasoning
Anthropic

12.9

Programming

33.768.238.712.924.6
86

o3-mini

o3-mini

codeprogrammingtool use
OpenAI

12.2

Programming

25.670.411.912.241.6$1.1 in / $4.4 out
87

Sarvam-105B

sarvam-105b

codeprogrammingtool use
SSarvam AI

12.1

Programming

42.90.017.912.10.0N/A
88

GPT-5 nano

gpt-5-nano-2025-08-07

multimodalvisionmulti-input reasoning
OpenAI

11.8

Programming

26.30.00.011.80.0
89

DeepSeek-V3

deepseek-v3

codeprogrammingtool use
DeepSeek

10.4

Programming

27.358.00.010.460.5$0.27 in / $1.1 out
90

GPT-5.4 nano

gpt-5.4-nano

multimodalvisionmulti-input reasoning
OpenAI

10.0

Programming

45.676.59.710.057.1
91

o1-preview

o1-preview

codeprogrammingtool use
OpenAI

9.5

Programming

41.833.00.09.511.8$15 in / $60 out
92

Claude 3.5 Haiku

claude-3-5-haiku-20241022

codeprogrammingtool use
Anthropic

7.8

Programming

10.830.53.07.831.8
93

DeepSeek-R1-0528

deepseek-r1-0528

codeprogrammingtool use
DeepSeek

6.6

Programming

50.114.30.06.635.1$0.55 in / $2.19 out
94

o1

o1-2024-12-17

multimodalvisionmulti-input reasoning
OpenAI

6.5

Programming

42.919.444.76.54.9$15 in / $60 out
95

GPT-4.5

gpt-4.5

multimodalvisionmulti-input reasoning
OpenAI

6.0

Programming

41.929.735.86.07.0$75 in / $150 out
96

Sarvam-30B

sarvam-30b

codeprogrammingtool use
SSarvam AI

5.2

Programming

46.40.08.25.20.0N/A
97

Nemotron 3 Nano (30B A3B)

nemotron-3-nano-30b-a3b

codeprogrammingtool use
NNVIDIA

4.4

Programming

45.466.03.34.490.9$0.06 in / $0.24 out
98

GPT-4o

gpt-4o-2024-08-06

multimodalvisionmulti-input reasoning
OpenAI

4.3

Programming

31.546.714.94.326.8
99

Gemini 2.5 Flash-Lite

gemini-2.5-flash-lite

multimodalvisionmulti-input reasoning
Google

3.5

Programming

21.432.80.03.564.1
100

GPT-4.1 mini

gpt-4.1-mini-2025-04-14

multimodalvisionmulti-input reasoning
OpenAI

2.6

Programming

20.790.68.92.656.8
81

MiniMax M1 40K

MiniMax

18.1

N/A

82

GPT-4.1

OpenAI

17.3

$2 in / $8 out

83

Kimi K2 Instruct

Moonshot AI

15.3

$0.5 in / $0.5 out

84

Page 5 of 15 · 296 models

PreviousNext

Want benchmark charts, model comparison, and pricing analytics?

Sign in to access the full interactive leaderboard with deep benchmark breakdowns and model comparison tools.

Open full leaderboard

Rankings are based on multi-dimensional evaluation across benchmark quality, inference efficiency, and cost-per-output. Scores are updated continuously and may differ from individual third-party benchmarks.

$2 in / $8 out
$0.5 in / $0.5 out
$0.1 in / $0.3 out
$3 in / $15 out
N/A
$0.2 in / $1.25 out
$0.8 in / $4 out
$2.5 in / $10 out
$0.1 in / $0.4 out
$0.4 in / $1.6 out

Devstral Small 1.1

Mistral AI

14.7

$0.1 in / $0.3 out

85

Claude 3.5 Sonnet

Anthropic

12.9

$3 in / $15 out

86

o3-mini

OpenAI

12.2

$1.1 in / $4.4 out

87
S

Sarvam-105B

Sarvam AI

12.1

N/A

88

GPT-5 nano

OpenAI

11.8

N/A

89

DeepSeek-V3

DeepSeek

10.4

$0.27 in / $1.1 out

90

GPT-5.4 nano

OpenAI

10.0

$0.2 in / $1.25 out

91

o1-preview

OpenAI

9.5

$15 in / $60 out

92

Claude 3.5 Haiku

Anthropic

7.8

$0.8 in / $4 out

93

DeepSeek-R1-0528

DeepSeek

6.6

$0.55 in / $2.19 out

94

o1

OpenAI

6.5

$15 in / $60 out

95

GPT-4.5

OpenAI

6.0

$75 in / $150 out

96
S

Sarvam-30B

Sarvam AI

5.2

N/A

97
N

Nemotron 3 Nano (30B A3B)

NVIDIA

4.4

$0.06 in / $0.24 out

98

GPT-4o

OpenAI

4.3

$2.5 in / $10 out

99

Gemini 2.5 Flash-Lite

Google

3.5

$0.1 in / $0.4 out

100

GPT-4.1 mini

OpenAI

2.6

$0.4 in / $1.6 out