Skytells
HomeModelsCLIChangelog
  • Home
  • Models
  • CLI
  • Changelog
Skytells

Addressing the world's greatest challenges with AI. Enterprise research, foundation models, and infrastructure trusted by organizations worldwide since 2012.

Get Started

  • Console
  • Learn
  • Documentation
  • API Reference
  • Pricing
  • ModelsNew

Platform

  • Cloud AgentsNew
  • AI Solutions
  • Infrastructure
  • Edge Network
  • Trust Center
  • CLI

Resources

  • Blog
  • Changelog
  • AI Leaderboard
  • Research
  • Status

Company

  • About
  • Careers
  • Legal
  • Privacy Policy

© 2012–2026 Skytells, Inc. All rights reserved.

Live rankings

AI Model Leaderboard

Every major AI model ranked across benchmark quality, inference speed, agentic capability, programming aptitude, and cost efficiency — updated continuously from published evaluation data.

Explore full leaderboardBrowse model catalog

309

Tracked models

27

Providers

264

Benchmarked

11.8

Avg. index

OverallBenchmarksInferenceAgenticProgrammingValue / Price

309 models

RankModelProviderScoreBenchmarksInferenceAgenticProgrammingValuePrice
81

DeepSeek-V3.2 (Thinking)

deepseek-reasoner

codeprogrammingtool use
DeepSeek

15.0

Agentic

51.80.015.043.50.0N/A
82

DeepSeek-V3.2

deepseek-v3.2

codeprogrammingtool use
DeepSeek

15.0

Agentic

56.70.015.043.50.0N/A
83

GPT-4o

gpt-4o-2024-08-06

multimodalvisionmulti-input reasoning
OpenAI

14.9

Agentic

30.439.414.94.026.8
84

DeepSeek-V3.1

deepseek-v3.1

codeprogrammingtool use
DeepSeek

14.0

Agentic

37.50.014.026.40.0N/A
85

Grok 4 Fast

grok-4-fast

multimodalvisionmulti-input reasoning
xAI

13.7

Agentic

57.162.113.70.073.7$0.2 in / $0.5 out
86

Kimi K2 Instruct

kimi-k2-instruct

codeprogrammingtool use
Moonshot AI

13.5

Agentic

23.80.013.514.00.0N/A
87

Qwen3.6-35B-A3B

qwen3.6-35b-a3b

multimodalvisionmulti-input reasoning
AAlibaba Cloud / Qwen Team

13.5

Agentic

54.20.013.525.10.0N/A
88

Nova 2 Lite

nova-2-lite

multimodalvisionmulti-input reasoning
AAmazon

13.0

Agentic

42.872.213.027.050.0$0.3 in / $2.5 out
89

o3-mini

o3-mini

codeprogrammingtool use
OpenAI

11.9

Agentic

26.80.011.912.90.0N/A
90

GLM-4.7-Flash

glm-4.7-flash

codeprogrammingtool use
ZZhipu AI

10.7

Agentic

37.10.010.719.00.0N/A
91

GPT-5.4 nano

gpt-5.4-nano

multimodalvisionmulti-input reasoning
OpenAI

10.4

Agentic

46.056.310.410.770.9$0.2 in / $1.25 out
92

GPT-4.1 mini

gpt-4.1-mini-2025-04-14

multimodalvisionmulti-input reasoning
OpenAI

8.9

Agentic

20.287.88.92.465.6
93

Nemotron 3 Super (120B A12B)

nemotron-3-super-120b-a12b

codeprogrammingtool use
NNVIDIA

8.1

Agentic

47.30.08.124.50.0N/A
94

DeepSeek-V3.2-Speciale

deepseek-v3.2-speciale

codeprogrammingtool use
DeepSeek

7.6

Agentic

53.00.07.643.50.0
95

Sarvam-30B

sarvam-30b

codeprogrammingtool use
SSarvam AI

7.6

Agentic

45.70.07.64.70.0N/A
96

GPT OSS 20B

gpt-oss-20b

textinference
OpenAI

6.0

Agentic

24.80.06.00.00.0N/A
97

Kimi K2-Instruct-0905

kimi-k2-instruct-0905

codeprogrammingtool use
Moonshot AI

6.0

Agentic

23.80.06.018.10.0
98

Qwen2.5 VL 72B Instruct

qwen2.5-vl-72b

multimodalvisionmulti-input reasoning
AAlibaba Cloud / Qwen Team

5.6

Agentic

24.10.05.60.00.0N/A
99

Claude 3.5 Haiku

claude-3-5-haiku-20241022

codeprogrammingtool use
Anthropic

3.0

Agentic

10.50.03.07.10.0
100

Nemotron 3 Nano (30B A3B)

nemotron-3-nano-30b-a3b

codeprogrammingtool use
NNVIDIA

3.0

Agentic

44.541.13.04.0100.0$0.06 in / $0.24 out
81

DeepSeek-V3.2 (Thinking)

DeepSeek

15.0

N/A

82

DeepSeek-V3.2

DeepSeek

15.0

N/A

83

GPT-4o

OpenAI

14.9

$2.5 in / $10 out

84

Page 5 of 16 · 309 models

PreviousNext

Want benchmark charts, model comparison, and pricing analytics?

Sign in to access the full interactive leaderboard with deep benchmark breakdowns and model comparison tools.

Open full leaderboard

Rankings are based on multi-dimensional evaluation across benchmark quality, inference efficiency, and cost-per-output. Scores are updated continuously and may differ from individual third-party benchmarks.

$2.5 in / $10 out
$0.4 in / $1.6 out
N/A
N/A
N/A

DeepSeek-V3.1

DeepSeek

14.0

N/A

85

Grok 4 Fast

xAI

13.7

$0.2 in / $0.5 out

86

Kimi K2 Instruct

Moonshot AI

13.5

N/A

87
A

Qwen3.6-35B-A3B

Alibaba Cloud / Qwen Team

13.5

N/A

88
A

Nova 2 Lite

Amazon

13.0

$0.3 in / $2.5 out

89

o3-mini

OpenAI

11.9

N/A

90
Z

GLM-4.7-Flash

Zhipu AI

10.7

N/A

91

GPT-5.4 nano

OpenAI

10.4

$0.2 in / $1.25 out

92

GPT-4.1 mini

OpenAI

8.9

$0.4 in / $1.6 out

93
N

Nemotron 3 Super (120B A12B)

NVIDIA

8.1

N/A

94

DeepSeek-V3.2-Speciale

DeepSeek

7.6

N/A

95
S

Sarvam-30B

Sarvam AI

7.6

N/A

96

GPT OSS 20B

OpenAI

6.0

N/A

97

Kimi K2-Instruct-0905

Moonshot AI

6.0

N/A

98
A

Qwen2.5 VL 72B Instruct

Alibaba Cloud / Qwen Team

5.6

N/A

99

Claude 3.5 Haiku

Anthropic

3.0

N/A

100
N

Nemotron 3 Nano (30B A3B)

NVIDIA

3.0

$0.06 in / $0.24 out