Skytells
HomeModelsCLIChangelog
  • Home
  • Models
  • CLI
  • Changelog
Skytells

Addressing the world's greatest challenges with AI. Enterprise research, foundation models, and infrastructure trusted by organizations worldwide since 2012.

Get Started

  • Console
  • Learn
  • Documentation
  • API Reference
  • Pricing
  • ModelsNew

Platform

  • Cloud AgentsNew
  • AI Solutions
  • Infrastructure
  • Edge Network
  • Trust Center
  • CLI

Resources

  • Blog
  • Changelog
  • AI Leaderboard
  • Research
  • Status

Company

  • About
  • Careers
  • Legal
  • Privacy Policy

© 2012–2026 Skytells, Inc. All rights reserved.

Live rankings

AI Model Leaderboard

Every major AI model ranked across benchmark quality, inference speed, agentic capability, programming aptitude, and cost efficiency — updated continuously from published evaluation data.

Explore full leaderboardBrowse model catalog

309

Tracked models

27

Providers

264

Benchmarked

29.3

Avg. index

OverallBenchmarksInferenceAgenticProgrammingValue / Price

309 models

RankModelProviderScoreBenchmarksInferenceAgenticProgrammingValuePrice
41

Gemma 4 31B

gemma-4-31b-it

multimodalvisionmulti-input reasoning
Google

57.4

overall

54.941.10.00.090.5$0.14 in / $0.4 out
42

MiniMax M3

minimax-m3

multimodalvisionmulti-input reasoning
MiniMax

57.3

overall

54.672.238.774.348.1$0.6 in / $2.4 out
43

Claude Opus 4.5

claude-opus-4-5-20251101

multimodalvisionmulti-input reasoning
Anthropic

56.2

overall

55.30.041.473.50.0
44

GPT-5 Medium

gpt-5-medium-2025-08-07

multimodalvisionmulti-input reasoning
OpenAI

56.0

overall

56.00.00.00.00.0
45

GPT-5.4

gpt-5.4

texttext-to-textlanguage
OpenAI

55.5

overall

75.338.956.260.614.1
46

ChatGPT-4o Latest

chatgpt-4o-latest

multimodalvisionmulti-input reasoning
OpenAI

54.9

overall

54.90.00.00.00.0
47

GLM-5V-Turbo

glm-5v-turbo

multimodalvisionmulti-input reasoning
ZZhipu AI

54.9

overall

0.00.054.90.00.0N/A
48

GPT OSS 120B High

gpt-oss-120b-high

multimodalvisionmulti-input reasoning
OpenAI

54.1

overall

44.253.00.00.083.3
49

Kimi K2.5

kimi-k2.5

multimodalvisionmulti-input reasoning
Moonshot AI

54.0

overall

67.20.047.344.60.0N/A
50

Seed 2.0 Lite

seed-2.0-lite

multimodalvisionmulti-input reasoning
BByteDance

53.3

overall

57.60.00.047.80.0N/A
51

GPT OSS 20B High

gpt-oss-20b-high

textinference
OpenAI

53.1

overall

53.10.00.00.00.0N/A
52

MiniMax M2.1

minimax-m2.1

codeprogrammingtool use
MiniMax

53.1

overall

40.872.252.148.768.6$0.3 in / $1.2 out
53

MiMo-V2-Omni

mimo-v2-omni

multimodalvisionmulti-input reasoning
Xiaomi

53.0

overall

0.00.00.053.00.0N/A
54

GPT-5.1 Medium

gpt-5.1-medium-2025-11-12

multimodalvisionmulti-input reasoning
OpenAI

52.7

overall

64.048.40.00.027.6
55

Grok-3 Mini

grok-3-mini

multimodalvisionmulti-input reasoning
xAI

52.0

overall

52.00.00.00.00.0N/A
56

GPT-5 Codex

gpt-5-codex-2025-09-15

codeprogrammingtool use
OpenAI

51.8

overall

0.00.00.051.80.0N/A
57

Claude Sonnet 4.5

claude-sonnet-4-5-20250929

multimodalvisionmulti-input reasoning
Anthropic

51.5

overall

51.914.671.874.69.3
58

Gemma 4 26B-A4B

gemma-4-26b-a4b-it

multimodalvisionmulti-input reasoning
Google

51.5

overall

42.341.10.00.093.7
59

Nova 2 Pro

nova-2-pro

multimodalvisionmulti-input reasoning
AAmazon

51.3

overall

46.80.057.250.60.0N/A
60

Grok-4

grok-4

multimodalvisionmulti-input reasoning
xAI

50.5

overall

50.50.00.00.00.0N/A
41

Gemma 4 31B

Google

57.4

$0.14 in / $0.4 out

42

MiniMax M3

MiniMax

57.3

$0.6 in / $2.4 out

43

Claude Opus 4.5

Anthropic

56.2

N/A

44

Page 3 of 16 · 309 models

PreviousNext

Want benchmark charts, model comparison, and pricing analytics?

Sign in to access the full interactive leaderboard with deep benchmark breakdowns and model comparison tools.

Open full leaderboard

Rankings are based on multi-dimensional evaluation across benchmark quality, inference efficiency, and cost-per-output. Scores are updated continuously and may differ from individual third-party benchmarks.

N/A
N/A
$2.5 in / $15 out
N/A
$0.1 in / $0.5 out
$1.25 in / $10 out
$3 in / $15 out
$0.13 in / $0.4 out

GPT-5 Medium

OpenAI

56.0

N/A

45

GPT-5.4

OpenAI

55.5

$2.5 in / $15 out

46

ChatGPT-4o Latest

OpenAI

54.9

N/A

47
Z

GLM-5V-Turbo

Zhipu AI

54.9

N/A

48

GPT OSS 120B High

OpenAI

54.1

$0.1 in / $0.5 out

49

Kimi K2.5

Moonshot AI

54.0

N/A

50
B

Seed 2.0 Lite

ByteDance

53.3

N/A

51

GPT OSS 20B High

OpenAI

53.1

N/A

52

MiniMax M2.1

MiniMax

53.1

$0.3 in / $1.2 out

53

MiMo-V2-Omni

Xiaomi

53.0

N/A

54

GPT-5.1 Medium

OpenAI

52.7

$1.25 in / $10 out

55

Grok-3 Mini

xAI

52.0

N/A

56

GPT-5 Codex

OpenAI

51.8

N/A

57

Claude Sonnet 4.5

Anthropic

51.5

$3 in / $15 out

58

Gemma 4 26B-A4B

Google

51.5

$0.13 in / $0.4 out

59
A

Nova 2 Pro

Amazon

51.3

N/A

60

Grok-4

xAI

50.5

N/A