Skytells
HomeModelsCLIChangelog
  • Home
  • Models
  • CLI
  • Changelog
Skytells

Addressing the world's greatest challenges with AI. Enterprise research, foundation models, and infrastructure trusted by organizations worldwide since 2012.

Get Started

  • Console
  • Learn
  • Documentation
  • API Reference
  • Pricing
  • ModelsNew

Platform

  • Cloud AgentsNew
  • AI Solutions
  • Infrastructure
  • Edge Network
  • Trust Center
  • CLI

Resources

  • Blog
  • Changelog
  • AI Leaderboard
  • Research
  • Status

Company

  • About
  • Careers
  • Legal
  • Privacy Policy

© 2012–2026 Skytells, Inc. All rights reserved.

Live rankings

AI Model Leaderboard

Every major AI model ranked across benchmark quality, inference speed, agentic capability, programming aptitude, and cost efficiency — updated continuously from published evaluation data.

Explore full leaderboardBrowse model catalog

309

Tracked models

27

Providers

264

Benchmarked

11.8

Avg. index

OverallBenchmarksInferenceAgenticProgrammingValue / Price

309 models

RankModelProviderScoreBenchmarksInferenceAgenticProgrammingValuePrice
61

GLM-4.7

glm-4.7

multimodalvisionmulti-input reasoning
ZZhipu AI

28.0

Agentic

62.30.028.043.60.0N/A
62

GPT-5

gpt-5-2025-08-07

multimodalvisionmulti-input reasoning
OpenAI

27.5

Agentic

63.80.027.550.60.0N/A
63

Qwen3 VL 32B Instruct

qwen3-vl-32b-instruct

multimodalvisionmulti-input reasoning
AAlibaba Cloud / Qwen Team

27.2

Agentic

28.60.027.20.00.0
64

GPT OSS 120B

gpt-oss-120b

textinference
OpenAI

26.8

Agentic

34.914.626.80.090.5$0.09 in / $0.45 out
65

MiniMax M1 40K

minimax-m1-40k

codeprogrammingtool use
MiniMax

26.8

Agentic

21.80.026.816.60.0N/A
66

Qwen3-235B-A22B-Thinking-2507

qwen3-235b-a22b-thinking-2507

textinference
AAlibaba Cloud / Qwen Team

26.8

Agentic

45.80.026.80.00.0N/A
67

Qwen3 VL 8B Instruct

qwen3-vl-8b-instruct

multimodalvisionmulti-input reasoning
AAlibaba Cloud / Qwen Team

26.4

Agentic

8.841.126.40.087.3$0.08 in / $0.5 out
68

MiMo-V2-Flash

mimo-v2-flash

codeprogrammingtool use
Xiaomi

25.3

Agentic

52.40.025.336.80.0N/A
69

GLM-4.5-Air

glm-4.5-air

codeprogrammingtool use
ZZhipu AI

24.3

Agentic

26.70.024.318.30.0N/A
70

Qwen3 VL 8B Thinking

qwen3-vl-8b-thinking

multimodalvisionmulti-input reasoning
AAlibaba Cloud / Qwen Team

23.3

Agentic

34.141.123.30.054.4
71

Qwen3 VL 30B A3B Instruct

qwen3-vl-30b-a3b-instruct

multimodalvisionmulti-input reasoning
AAlibaba Cloud / Qwen Team

22.9

Agentic

27.00.022.90.00.0
72

GPT-5.4 Mini

gpt-5.4-mini

texttext-to-textlanguage
OpenAI

22.4

Agentic

56.956.322.427.432.9
73

MiniMax M1 80K

minimax-m1-80k

codeprogrammingtool use
MiniMax

20.9

Agentic

23.40.020.917.40.0N/A
74

Qwen3 VL 30B A3B Thinking

qwen3-vl-30b-a3b-thinking

multimodalvisionmulti-input reasoning
AAlibaba Cloud / Qwen Team

20.7

Agentic

34.10.020.70.00.0
75

o3

o3-2025-04-16

multimodalvisionmulti-input reasoning
OpenAI

19.8

Agentic

45.70.019.829.90.0N/A
76

Qwen3 VL 4B Instruct

qwen3-vl-4b-instruct

multimodalvisionmulti-input reasoning
AAlibaba Cloud / Qwen Team

18.8

Agentic

18.941.118.80.081.0
77

Qwen3 VL 4B Thinking

qwen3-vl-4b-thinking

multimodalvisionmulti-input reasoning
AAlibaba Cloud / Qwen Team

18.6

Agentic

21.641.118.60.073.4
78

Sarvam-105B

sarvam-105b

codeprogrammingtool use
SSarvam AI

18.3

Agentic

42.10.018.311.10.0N/A
79

Qwen3-Next-80B-A3B-Instruct

qwen3-next-80b-a3b-instruct

textinference
AAlibaba Cloud / Qwen Team

17.9

Agentic

28.40.017.90.00.0N/A
80

Mistral Medium 3.5

mistral-medium-3-5

multimodalvisionmulti-input reasoning
Mistral AI

16.8

Agentic

34.928.516.861.729.1
61
Z

GLM-4.7

Zhipu AI

28.0

N/A

62

GPT-5

OpenAI

27.5

N/A

63
A

Qwen3 VL 32B Instruct

Alibaba Cloud / Qwen Team

27.2

N/A

64

Page 4 of 16 · 309 models

PreviousNext

Want benchmark charts, model comparison, and pricing analytics?

Sign in to access the full interactive leaderboard with deep benchmark breakdowns and model comparison tools.

Open full leaderboard

Rankings are based on multi-dimensional evaluation across benchmark quality, inference efficiency, and cost-per-output. Scores are updated continuously and may differ from individual third-party benchmarks.

N/A
$0.18 in / $2.09 out
N/A
$0.75 in / $4.5 out
N/A
$0.1 in / $0.6 out
$0.1 in / $1 out
$1.5 in / $7.5 out

GPT OSS 120B

OpenAI

26.8

$0.09 in / $0.45 out

65

MiniMax M1 40K

MiniMax

26.8

N/A

66
A

Qwen3-235B-A22B-Thinking-2507

Alibaba Cloud / Qwen Team

26.8

N/A

67
A

Qwen3 VL 8B Instruct

Alibaba Cloud / Qwen Team

26.4

$0.08 in / $0.5 out

68

MiMo-V2-Flash

Xiaomi

25.3

N/A

69
Z

GLM-4.5-Air

Zhipu AI

24.3

N/A

70
A

Qwen3 VL 8B Thinking

Alibaba Cloud / Qwen Team

23.3

$0.18 in / $2.09 out

71
A

Qwen3 VL 30B A3B Instruct

Alibaba Cloud / Qwen Team

22.9

N/A

72

GPT-5.4 Mini

OpenAI

22.4

$0.75 in / $4.5 out

73

MiniMax M1 80K

MiniMax

20.9

N/A

74
A

Qwen3 VL 30B A3B Thinking

Alibaba Cloud / Qwen Team

20.7

N/A

75

o3

OpenAI

19.8

N/A

76
A

Qwen3 VL 4B Instruct

Alibaba Cloud / Qwen Team

18.8

$0.1 in / $0.6 out

77
A

Qwen3 VL 4B Thinking

Alibaba Cloud / Qwen Team

18.6

$0.1 in / $1 out

78
S

Sarvam-105B

Sarvam AI

18.3

N/A

79
A

Qwen3-Next-80B-A3B-Instruct

Alibaba Cloud / Qwen Team

17.9

N/A

80

Mistral Medium 3.5

Mistral AI

16.8

$1.5 in / $7.5 out