Skytells
HomeModelsCLIChangelog
  • Home
  • Models
  • CLI
  • Changelog
Skytells

Addressing the world's greatest challenges with AI. Enterprise research, foundation models, and infrastructure trusted by organizations worldwide since 2012.

Get Started

  • Console
  • Learn
  • Documentation
  • API Reference
  • Pricing
  • ModelsNew

Platform

  • Cloud AgentsNew
  • AI Solutions
  • Infrastructure
  • Edge Network
  • Trust Center
  • CLI

Resources

  • Blog
  • Changelog
  • AI Leaderboard
  • Research
  • Status

Company

  • About
  • Careers
  • Legal
  • Privacy Policy

© 2012–2026 Skytells, Inc. All rights reserved.

Live rankings

AI Model Leaderboard

Every major AI model ranked across benchmark quality, inference speed, agentic capability, programming aptitude, and cost efficiency — updated continuously from published evaluation data.

Explore full leaderboardBrowse model catalog

294

Tracked models

27

Providers

251

Benchmarked

13.2

Avg. index

OverallBenchmarksInferenceAgenticProgrammingValue / Price

294 models

RankModelProviderScoreBenchmarksInferenceAgenticProgrammingValuePrice
21

GLM-5.1

glm-5.1

codeprogrammingtool use
ZZhipu AI

58.3

Programming

67.146.654.458.330.6$1.4 in / $4.4 out
22

Gemini 3 Pro

gemini-3-pro-preview

multimodalvisionmulti-input reasoning
Google

57.4

Programming

73.30.063.857.40.0
23

Claude Haiku 4.5

claude-haiku-4-5-20251001

multimodalvisionmulti-input reasoning
Anthropic

57.2

Programming

32.961.254.257.237.7
24

GPT-5.1

gpt-5.1-2025-11-13

multimodalvisionmulti-input reasoning
OpenAI

57.2

Programming

65.071.40.057.231.9
25

GPT-5.1 Instant

gpt-5.1-instant-2025-11-12

multimodalvisionmulti-input reasoning
OpenAI

57.2

Programming

65.071.40.057.231.9
26

GPT-5.1 Thinking

gpt-5.1-thinking-2025-11-12

multimodalvisionmulti-input reasoning
OpenAI

57.2

Programming

65.055.10.057.227.0
27

MiniMax M2.5

minimax-m2.5

codeprogrammingtool use
MiniMax

56.3

Programming

0.073.953.056.357.7$0.3 in / $1.2 out
28

MiMo-V2-Omni

mimo-v2-omni

multimodalvisionmulti-input reasoning
Xiaomi

55.6

Programming

0.059.20.055.644.7$0.4 in / $2 out
29

GPT-5 Codex

gpt-5-codex-2025-09-15

codeprogrammingtool use
OpenAI

54.3

Programming

0.00.00.054.30.0N/A
30

Step-3.5-Flash

step-3.5-flash

codeprogrammingtool use
SStepFun

53.0

Programming

62.363.245.353.082.1$0.1 in / $0.4 out
31

GPT-5

gpt-5-2025-08-07

multimodalvisionmulti-input reasoning
OpenAI

51.7

Programming

64.40.029.051.70.0
32

GPT-5.1 Codex

gpt-5.1-codex

multimodalvisionmulti-input reasoning
OpenAI

51.2

Programming

0.048.60.051.225.1
33

MiniMax M2.1

minimax-m2.1

codeprogrammingtool use
MiniMax

50.6

Programming

42.773.956.650.657.7$0.3 in / $1.2 out
34

Seed 2.0 Lite

seed-2.0-lite

multimodalvisionmulti-input reasoning
BByteDance

50.3

Programming

58.10.00.050.30.0N/A
35

Claude Opus 4

claude-opus-4-20250514

multimodalvisionmulti-input reasoning
Anthropic

49.5

Programming

37.80.057.949.50.0
36

GPT-5.3 Codex

gpt-5.3-codex

texttext-to-textcoding
OpenAI

49.3

Programming

0.048.60.049.319.5
37

Kimi K2.5

kimi-k2.5

multimodalvisionmulti-input reasoning
Moonshot AI

48.5

Programming

68.066.849.548.538.1
38

GLM-4.6

glm-4.6

multimodalvisionmulti-input reasoning
ZZhipu AI

46.1

Programming

47.034.937.746.142.8$0.55 in / $2.19 out
39

DeepSeek-V3.2 (Thinking)

deepseek-reasoner

codeprogrammingtool use
DeepSeek

45.9

Programming

53.10.016.645.90.0
40

DeepSeek-V3.2

deepseek-v3.2

codeprogrammingtool use
DeepSeek

45.9

Programming

58.152.516.645.970.0$0.26 in / $0.38 out
21
Z

GLM-5.1

Zhipu AI

58.3

$1.4 in / $4.4 out

22

Gemini 3 Pro

Google

57.4

N/A

23

Claude Haiku 4.5

Anthropic

57.2

$1 in / $5 out

24

Page 2 of 15 · 294 models

PreviousNext

Want benchmark charts, model comparison, and pricing analytics?

Sign in to access the full interactive leaderboard with deep benchmark breakdowns and model comparison tools.

Open full leaderboard

Rankings are based on multi-dimensional evaluation across benchmark quality, inference efficiency, and cost-per-output. Scores are updated continuously and may differ from individual third-party benchmarks.

N/A
$1 in / $5 out
$1.25 in / $10 out
$1.25 in / $10 out
$1.25 in / $10 out
N/A
$1.25 in / $10 out
N/A
$1.75 in / $14 out
$0.6 in / $3 out
N/A

GPT-5.1

OpenAI

57.2

$1.25 in / $10 out

25

GPT-5.1 Instant

OpenAI

57.2

$1.25 in / $10 out

26

GPT-5.1 Thinking

OpenAI

57.2

$1.25 in / $10 out

27

MiniMax M2.5

MiniMax

56.3

$0.3 in / $1.2 out

28

MiMo-V2-Omni

Xiaomi

55.6

$0.4 in / $2 out

29

GPT-5 Codex

OpenAI

54.3

N/A

30
S

Step-3.5-Flash

StepFun

53.0

$0.1 in / $0.4 out

31

GPT-5

OpenAI

51.7

N/A

32

GPT-5.1 Codex

OpenAI

51.2

$1.25 in / $10 out

33

MiniMax M2.1

MiniMax

50.6

$0.3 in / $1.2 out

34
B

Seed 2.0 Lite

ByteDance

50.3

N/A

35

Claude Opus 4

Anthropic

49.5

N/A

36

GPT-5.3 Codex

OpenAI

49.3

$1.75 in / $14 out

37

Kimi K2.5

Moonshot AI

48.5

$0.6 in / $3 out

38
Z

GLM-4.6

Zhipu AI

46.1

$0.55 in / $2.19 out

39

DeepSeek-V3.2 (Thinking)

DeepSeek

45.9

N/A

40

DeepSeek-V3.2

DeepSeek

45.9

$0.26 in / $0.38 out