Skytells
HomeModelsCLIChangelog
  • Home
  • Models
  • CLI
  • Changelog
Skytells

Addressing the world's greatest challenges with AI. Enterprise research, foundation models, and infrastructure trusted by organizations worldwide since 2012.

Get Started

  • Console
  • Learn
  • Documentation
  • API Reference
  • Pricing
  • ModelsNew

Platform

  • Cloud AgentsNew
  • AI Solutions
  • Infrastructure
  • Edge Network
  • Trust Center
  • CLI

Resources

  • Blog
  • Changelog
  • AI Leaderboard
  • Research
  • Status

Company

  • About
  • Careers
  • Legal
  • Privacy Policy

© 2012–2026 Skytells, Inc. All rights reserved.

Live rankings

AI Model Leaderboard

Every major AI model ranked across benchmark quality, inference speed, agentic capability, programming aptitude, and cost efficiency — updated continuously from published evaluation data.

Explore full leaderboardBrowse model catalog

296

Tracked models

27

Providers

253

Benchmarked

30.8

Avg. index

OverallBenchmarksInferenceAgenticProgrammingValue / Price

296 models

RankModelProviderScoreBenchmarksInferenceAgenticProgrammingValuePrice
141

Gemini 2.5 Pro

gemini-2.5-pro

multimodalvisionmulti-input reasoning
Google

27.6

Value / Price

44.262.80.025.027.6$1.25 in / $10 out
142

Gemini 2.5 Pro Preview 06-05

gemini-2.5-pro-preview-06-05

multimodalvisionmulti-input reasoning
Google

27.6

Value / Price

51.262.80.029.327.6
143

GPT-5.1 Thinking

gpt-5.1-thinking-2025-11-12

multimodalvisionmulti-input reasoning
OpenAI

27.0

Value / Price

64.955.60.056.227.0
144

GPT-4o

gpt-4o-2024-08-06

multimodalvisionmulti-input reasoning
OpenAI

26.8

Value / Price

31.546.714.94.326.8
145

Mistral Large 2

mistral-large-2-2407

textinference
Mistral AI

26.7

Value / Price

0.021.40.00.026.7$2 in / $6 out
146

GPT-4o

gpt-4o-2024-05-13

multimodalvisionmulti-input reasoning
OpenAI

26.5

Value / Price

22.345.40.00.026.5
147

GPT-5.2

gpt-5.2-2025-12-11

multimodalvisionmulti-input reasoning
OpenAI

26.5

Value / Price

76.672.047.071.926.5
148

GPT-5.3 Chat

gpt-5.3-chat-latest

multimodalvisionmulti-input reasoning
OpenAI

26.5

Value / Price

0.052.70.00.026.5
149

Grok-2

grok-2

multimodalvisionmulti-input reasoning
xAI

25.4

Value / Price

27.138.30.00.025.4$2 in / $10 out
150

Jamba 1.5 Large

jamba-1.5-large

textinference
AAI21 Labs

25.2

Value / Price

8.133.60.00.025.2$2 in / $8 out
151

GPT-5.1 Codex

gpt-5.1-codex

multimodalvisionmulti-input reasoning
OpenAI

25.1

Value / Price

0.049.00.050.025.1
152

GPT-5.1 Codex High

gpt-5.1-codex-high

multimodalvisionmulti-input reasoning
OpenAI

25.1

Value / Price

61.049.00.00.025.1
153

Claude 3.5 Sonnet

claude-3-5-sonnet-20240620

multimodalvisionmulti-input reasoning
Anthropic

24.6

Value / Price

25.468.20.00.024.6
154

Claude 3.5 Sonnet

claude-3-5-sonnet-20241022

multimodalvisionmulti-input reasoning
Anthropic

24.6

Value / Price

33.768.238.712.924.6
155

Gemini 1.5 Pro

gemini-1.5-pro

multimodalvisionmulti-input reasoning
Google

24.3

Value / Price

27.665.20.00.024.3
156

Grok-3

grok-3

multimodalvisionmulti-input reasoning
xAI

22.7

Value / Price

59.352.70.00.022.7$3 in / $15 out
157

Grok-4.1

grok-4.1-2025-11-17

multimodalvisionmulti-input reasoning
xAI

22.7

Value / Price

0.064.90.00.022.7
158

Pixtral Large

pixtral-large

multimodalvisionmulti-input reasoning
Mistral AI

22.4

Value / Price

27.87.00.00.022.4
159

Gemini 3.1 Pro

gemini-3.1-pro-preview

multimodalvisionmulti-input reasoning
Google

22.1

Value / Price

74.467.171.766.122.1
160

GPT-5.2 Codex

gpt-5.2-codex

multimodalvisionmulti-input reasoning
OpenAI

19.6

Value / Price

0.049.00.044.119.6
141

Gemini 2.5 Pro

Google

27.6

$1.25 in / $10 out

142

Gemini 2.5 Pro Preview 06-05

Google

27.6

$1.25 in / $10 out

143

GPT-5.1 Thinking

OpenAI

27.0

$1.25 in / $10 out

144

Page 8 of 15 · 296 models

PreviousNext

Want benchmark charts, model comparison, and pricing analytics?

Sign in to access the full interactive leaderboard with deep benchmark breakdowns and model comparison tools.

Open full leaderboard

Rankings are based on multi-dimensional evaluation across benchmark quality, inference efficiency, and cost-per-output. Scores are updated continuously and may differ from individual third-party benchmarks.

$1.25 in / $10 out
$1.25 in / $10 out
$2.5 in / $10 out
$2.5 in / $10 out
$1.75 in / $14 out
$1.75 in / $14 out
$1.25 in / $10 out
$1.25 in / $10 out
$3 in / $15 out
$3 in / $15 out
$2.5 in / $10 out
$3 in / $15 out
$2 in / $6 out
$2.5 in / $15 out
$1.75 in / $14 out

GPT-4o

OpenAI

26.8

$2.5 in / $10 out

145

Mistral Large 2

Mistral AI

26.7

$2 in / $6 out

146

GPT-4o

OpenAI

26.5

$2.5 in / $10 out

147

GPT-5.2

OpenAI

26.5

$1.75 in / $14 out

148

GPT-5.3 Chat

OpenAI

26.5

$1.75 in / $14 out

149

Grok-2

xAI

25.4

$2 in / $10 out

150
A

Jamba 1.5 Large

AI21 Labs

25.2

$2 in / $8 out

151

GPT-5.1 Codex

OpenAI

25.1

$1.25 in / $10 out

152

GPT-5.1 Codex High

OpenAI

25.1

$1.25 in / $10 out

153

Claude 3.5 Sonnet

Anthropic

24.6

$3 in / $15 out

154

Claude 3.5 Sonnet

Anthropic

24.6

$3 in / $15 out

155

Gemini 1.5 Pro

Google

24.3

$2.5 in / $10 out

156

Grok-3

xAI

22.7

$3 in / $15 out

157

Grok-4.1

xAI

22.7

$3 in / $15 out

158

Pixtral Large

Mistral AI

22.4

$2 in / $6 out

159

Gemini 3.1 Pro

Google

22.1

$2.5 in / $15 out

160

GPT-5.2 Codex

OpenAI

19.6

$1.75 in / $14 out