Skytells
HomeModelsCLIChangelog
  • Home
  • Models
  • CLI
  • Changelog
Skytells

Addressing the world's greatest challenges with AI. Enterprise research, foundation models, and infrastructure trusted by organizations worldwide since 2012.

Get Started

  • Console
  • Learn
  • Documentation
  • API Reference
  • Pricing
  • ModelsNew

Platform

  • Cloud AgentsNew
  • AI Solutions
  • Infrastructure
  • Edge Network
  • Trust Center
  • CLI

Resources

  • Blog
  • Changelog
  • AI Leaderboard
  • Research
  • Status

Company

  • About
  • Careers
  • Legal
  • Privacy Policy

© 2012–2026 Skytells, Inc. All rights reserved.

Live rankings

AI Model Leaderboard

Every major AI model ranked across benchmark quality, inference speed, agentic capability, programming aptitude, and cost efficiency — updated continuously from published evaluation data.

Explore full leaderboardBrowse model catalog

294

Tracked models

27

Providers

251

Benchmarked

31.8

Avg. index

OverallBenchmarksInferenceAgenticProgrammingValue / Price

294 models

RankModelProviderScoreBenchmarksInferenceAgenticProgrammingValuePrice
41

Grok 4 Fast

grok-4-fast

multimodalvisionmulti-input reasoning
xAI

68.2

Inference

58.068.215.40.067.2$0.2 in / $0.5 out
42

Grok-4 Fast Non-Reasoning

grok-4-fast-non-reasoning

multimodalvisionmulti-input reasoning
xAI

68.2

Inference

0.068.20.00.067.2
43

Grok-4 Fast Reasoning

grok-4-fast-reasoning

multimodalvisionmulti-input reasoning
xAI

68.2

Inference

0.068.20.00.067.2
44

Claude 3.5 Sonnet

claude-3-5-sonnet-20240620

multimodalvisionmulti-input reasoning
Anthropic

67.4

Inference

25.667.40.00.024.5
45

Claude 3.5 Sonnet

claude-3-5-sonnet-20241022

multimodalvisionmulti-input reasoning
Anthropic

67.4

Inference

33.967.438.713.224.5
46

Gemini 3.1 Pro

gemini-3.1-pro-preview

multimodalvisionmulti-input reasoning
Google

66.8

Inference

74.366.872.365.522.1
47

Gemma 4 26B-A4B

gemma-4-26b-a4b-it

multimodalvisionmulti-input reasoning
Google

66.8

Inference

43.766.80.00.077.8
48

Gemma 4 31B

gemma-4-31b-it

multimodalvisionmulti-input reasoning
Google

66.8

Inference

56.566.80.00.076.7
49

Kimi K2 0905

kimi-k2-0905

textinference
Moonshot AI

66.8

Inference

44.466.80.00.040.0$0.6 in / $2.5 out
50

Kimi K2.5

kimi-k2.5

multimodalvisionmulti-input reasoning
Moonshot AI

66.8

Inference

68.066.849.548.538.1$0.6 in / $3 out
51

Kimi K2.6

kimi-k2.6

multimodalvisionmulti-input reasoning
Moonshot AI

66.8

Inference

68.566.845.381.033.3$0.95 in / $4 out
52

Nemotron 3 Nano (30B A3B)

nemotron-3-nano-30b-a3b

codeprogrammingtool use
NNVIDIA

66.8

Inference

45.866.83.34.490.8$0.06 in / $0.24 out
53

Qwen3-235B-A22B-Instruct-2507

qwen3-235b-a22b-instruct-2507

textinference
AAlibaba Cloud / Qwen Team

66.8

Inference

42.966.80.00.062.8$0.15 in / $0.8 out
54

Qwen3-235B-A22B-Thinking-2507

qwen3-235b-a22b-thinking-2507

textinference
AAlibaba Cloud / Qwen Team

66.8

Inference

46.966.826.80.039.4$0.3 in / $3 out
55

Qwen3.5-122B-A10B

qwen3.5-122b-a10b

multimodalvisionmulti-input reasoning
AAlibaba Cloud / Qwen Team

66.8

Inference

64.866.851.641.538.1$0.4 in / $3.2 out
56

Qwen3.5-27B

qwen3.5-27b

multimodalvisionmulti-input reasoning
AAlibaba Cloud / Qwen Team

66.8

Inference

61.966.847.542.443.9$0.3 in / $2.4 out
57

Qwen3.5-35B-A3B

qwen3.5-35b-a3b

multimodalvisionmulti-input reasoning
AAlibaba Cloud / Qwen Team

66.8

Inference

57.266.844.334.446.4$0.25 in / $2 out
58

Qwen3.5-397B-A17B

qwen3.5-397b-a17b

multimodalvisionmulti-input reasoning
AAlibaba Cloud / Qwen Team

66.8

Inference

58.666.835.660.935.3$0.6 in / $3.6 out
59

Qwen3 VL 235B A22B Instruct

qwen3-vl-235b-a22b-instruct

multimodalvisionmulti-input reasoning
AAlibaba Cloud / Qwen Team

66.8

Inference

37.166.856.70.049.4
60

Qwen3 VL 235B A22B Thinking

qwen3-vl-235b-a22b-thinking

multimodalvisionmulti-input reasoning
AAlibaba Cloud / Qwen Team

66.8

Inference

37.966.840.20.037.2
41

Grok 4 Fast

xAI

68.2

$0.2 in / $0.5 out

42

Grok-4 Fast Non-Reasoning

xAI

68.2

$0.2 in / $0.5 out

43

Grok-4 Fast Reasoning

xAI

68.2

$0.2 in / $0.5 out

44

Page 3 of 15 · 294 models

PreviousNext

Want benchmark charts, model comparison, and pricing analytics?

Sign in to access the full interactive leaderboard with deep benchmark breakdowns and model comparison tools.

Open full leaderboard

Rankings are based on multi-dimensional evaluation across benchmark quality, inference efficiency, and cost-per-output. Scores are updated continuously and may differ from individual third-party benchmarks.

$0.2 in / $0.5 out
$0.2 in / $0.5 out
$3 in / $15 out
$3 in / $15 out
$2.5 in / $15 out
$0.13 in / $0.4 out
$0.14 in / $0.4 out
$0.3 in / $1.5 out
$0.45 in / $3.49 out

Claude 3.5 Sonnet

Anthropic

67.4

$3 in / $15 out

45

Claude 3.5 Sonnet

Anthropic

67.4

$3 in / $15 out

46

Gemini 3.1 Pro

Google

66.8

$2.5 in / $15 out

47

Gemma 4 26B-A4B

Google

66.8

$0.13 in / $0.4 out

48

Gemma 4 31B

Google

66.8

$0.14 in / $0.4 out

49

Kimi K2 0905

Moonshot AI

66.8

$0.6 in / $2.5 out

50

Kimi K2.5

Moonshot AI

66.8

$0.6 in / $3 out

51

Kimi K2.6

Moonshot AI

66.8

$0.95 in / $4 out

52
N

Nemotron 3 Nano (30B A3B)

NVIDIA

66.8

$0.06 in / $0.24 out

53
A

Qwen3-235B-A22B-Instruct-2507

Alibaba Cloud / Qwen Team

66.8

$0.15 in / $0.8 out

54
A

Qwen3-235B-A22B-Thinking-2507

Alibaba Cloud / Qwen Team

66.8

$0.3 in / $3 out

55
A

Qwen3.5-122B-A10B

Alibaba Cloud / Qwen Team

66.8

$0.4 in / $3.2 out

56
A

Qwen3.5-27B

Alibaba Cloud / Qwen Team

66.8

$0.3 in / $2.4 out

57
A

Qwen3.5-35B-A3B

Alibaba Cloud / Qwen Team

66.8

$0.25 in / $2 out

58
A

Qwen3.5-397B-A17B

Alibaba Cloud / Qwen Team

66.8

$0.6 in / $3.6 out

59
A

Qwen3 VL 235B A22B Instruct

Alibaba Cloud / Qwen Team

66.8

$0.3 in / $1.5 out

60
A

Qwen3 VL 235B A22B Thinking

Alibaba Cloud / Qwen Team

66.8

$0.45 in / $3.49 out