Skytells
HomeModelsCLIChangelog
  • Home
  • Models
  • CLI
  • Changelog
Skytells

Addressing the world's greatest challenges with AI. Enterprise research, foundation models, and infrastructure trusted by organizations worldwide since 2012.

Get Started

  • Console
  • Learn
  • Documentation
  • API Reference
  • Pricing
  • ModelsNew

Platform

  • Cloud AgentsNew
  • AI Solutions
  • Infrastructure
  • Edge Network
  • Trust Center
  • CLI

Resources

  • Blog
  • Changelog
  • AI Leaderboard
  • Research
  • Status

Company

  • About
  • Careers
  • Legal
  • Privacy Policy

© 2012–2026 Skytells, Inc. All rights reserved.

Live rankings

AI Model Leaderboard

Every major AI model ranked across benchmark quality, inference speed, agentic capability, programming aptitude, and cost efficiency — updated continuously from published evaluation data.

Explore full leaderboardBrowse model catalog

294

Tracked models

27

Providers

251

Benchmarked

30.7

Avg. index

OverallBenchmarksInferenceAgenticProgrammingValue / Price

294 models

RankModelProviderScoreBenchmarksInferenceAgenticProgrammingValuePrice
61

Gemini 2.5 Flash-Lite

gemini-2.5-flash-lite

multimodalvisionmulti-input reasoning
Google

64.4

Value / Price

21.632.90.03.564.4$0.1 in / $0.4 out
62

Qwen3 32B

qwen3-32b

textinference
AAlibaba Cloud / Qwen Team

63.4

Value / Price

21.413.30.00.063.4$0.1 in / $0.44 out
63

Qwen3 VL 30B A3B Instruct

qwen3-vl-30b-a3b-instruct

multimodalvisionmulti-input reasoning
AAlibaba Cloud / Qwen Team

63.3

Value / Price

28.766.823.60.063.3
64

Qwen3-235B-A22B-Instruct-2507

qwen3-235b-a22b-instruct-2507

textinference
AAlibaba Cloud / Qwen Team

62.8

Value / Price

42.966.80.00.062.8$0.15 in / $0.8 out
65

QwQ-32B-Preview

qwq-32b-preview

textinference
AAlibaba Cloud / Qwen Team

62.0

Value / Price

29.029.50.00.062.0$0.15 in / $0.6 out
66

Kimi K2 Instruct

kimi-k2-instruct

codeprogrammingtool use
Moonshot AI

61.7

Value / Price

24.946.614.815.361.7
67

Llama 4 Maverick

llama-4-maverick

multimodalvisionmulti-input reasoning
MMeta

61.2

Value / Price

35.558.90.00.061.2$0.17 in / $0.6 out
68

Qwen3 VL 4B Thinking

qwen3-vl-4b-thinking

multimodalvisionmulti-input reasoning
AAlibaba Cloud / Qwen Team

60.6

Value / Price

23.166.818.90.060.6
69

DeepSeek-V3

deepseek-v3

codeprogrammingtool use
DeepSeek

60.4

Value / Price

28.057.30.010.660.4$0.27 in / $1.1 out
70

Qwen3 VL 30B A3B Thinking

qwen3-vl-30b-a3b-thinking

multimodalvisionmulti-input reasoning
AAlibaba Cloud / Qwen Team

60.0

Value / Price

35.566.821.30.060.0
71

Claude 3 Haiku

claude-3-haiku-20240307

multimodalvisionmulti-input reasoning
Anthropic

59.9

Value / Price

5.868.70.00.059.9
72

DeepSeek-V3.1

deepseek-v3.1

codeprogrammingtool use
DeepSeek

58.9

Value / Price

38.740.215.328.758.9$0.27 in / $1 out
73

DeepSeek-V3 0324

deepseek-v3-0324

textinference
DeepSeek

57.8

Value / Price

33.140.20.00.057.8$0.28 in / $1.14 out
74

LongCat-Flash-Chat

longcat-flash-chat

codeprogrammingtool use
Meituan

57.7

Value / Price

28.151.949.239.457.7
75

LongCat-Flash-Thinking-2601

longcat-flash-thinking-2601

codeprogrammingtool use
Meituan

57.7

Value / Price

56.351.930.838.057.7
76

MiniMax M2.1

minimax-m2.1

codeprogrammingtool use
MiniMax

57.7

Value / Price

42.773.956.650.657.7$0.3 in / $1.2 out
77

MiniMax M2.5

minimax-m2.5

codeprogrammingtool use
MiniMax

57.7

Value / Price

0.073.953.056.357.7$0.3 in / $1.2 out
78

GPT-5.4 nano

gpt-5.4-nano

multimodalvisionmulti-input reasoning
OpenAI

57.2

Value / Price

46.177.411.011.257.2
79

GPT-4.1 mini

gpt-4.1-mini-2025-04-14

multimodalvisionmulti-input reasoning
OpenAI

56.8

Value / Price

20.890.98.92.656.8
80

GPT-5 mini

gpt-5-mini-2025-08-07

multimodalvisionmulti-input reasoning
OpenAI

56.3

Value / Price

41.989.70.023.756.3
61

Gemini 2.5 Flash-Lite

Google

64.4

$0.1 in / $0.4 out

62
A

Qwen3 32B

Alibaba Cloud / Qwen Team

63.4

$0.1 in / $0.44 out

63
A

Qwen3 VL 30B A3B Instruct

Alibaba Cloud / Qwen Team

63.3

$0.2 in / $0.7 out

64

Page 4 of 15 · 294 models

PreviousNext

Want benchmark charts, model comparison, and pricing analytics?

Sign in to access the full interactive leaderboard with deep benchmark breakdowns and model comparison tools.

Open full leaderboard

Rankings are based on multi-dimensional evaluation across benchmark quality, inference efficiency, and cost-per-output. Scores are updated continuously and may differ from individual third-party benchmarks.

$0.2 in / $0.7 out
$0.5 in / $0.5 out
$0.1 in / $1 out
$0.2 in / $1 out
$0.25 in / $1.25 out
$0.3 in / $1.2 out
$0.3 in / $1.2 out
$0.2 in / $1.25 out
$0.4 in / $1.6 out
$0.25 in / $2 out
A

Qwen3-235B-A22B-Instruct-2507

Alibaba Cloud / Qwen Team

62.8

$0.15 in / $0.8 out

65
A

QwQ-32B-Preview

Alibaba Cloud / Qwen Team

62.0

$0.15 in / $0.6 out

66

Kimi K2 Instruct

Moonshot AI

61.7

$0.5 in / $0.5 out

67
M

Llama 4 Maverick

Meta

61.2

$0.17 in / $0.6 out

68
A

Qwen3 VL 4B Thinking

Alibaba Cloud / Qwen Team

60.6

$0.1 in / $1 out

69

DeepSeek-V3

DeepSeek

60.4

$0.27 in / $1.1 out

70
A

Qwen3 VL 30B A3B Thinking

Alibaba Cloud / Qwen Team

60.0

$0.2 in / $1 out

71

Claude 3 Haiku

Anthropic

59.9

$0.25 in / $1.25 out

72

DeepSeek-V3.1

DeepSeek

58.9

$0.27 in / $1 out

73

DeepSeek-V3 0324

DeepSeek

57.8

$0.28 in / $1.14 out

74

LongCat-Flash-Chat

Meituan

57.7

$0.3 in / $1.2 out

75

LongCat-Flash-Thinking-2601

Meituan

57.7

$0.3 in / $1.2 out

76

MiniMax M2.1

MiniMax

57.7

$0.3 in / $1.2 out

77

MiniMax M2.5

MiniMax

57.7

$0.3 in / $1.2 out

78

GPT-5.4 nano

OpenAI

57.2

$0.2 in / $1.25 out

79

GPT-4.1 mini

OpenAI

56.8

$0.4 in / $1.6 out

80

GPT-5 mini

OpenAI

56.3

$0.25 in / $2 out