Skytells
HomeModelsCLIChangelog
  • Home
  • Models
  • CLI
  • Changelog
Skytells

Addressing the world's greatest challenges with AI. Enterprise research, foundation models, and infrastructure trusted by organizations worldwide since 2012.

Get Started

  • Console
  • Learn
  • Documentation
  • API Reference
  • Pricing
  • ModelsNew

Platform

  • Cloud AgentsNew
  • AI Solutions
  • Infrastructure
  • Edge Network
  • Trust Center
  • CLI

Resources

  • Blog
  • Changelog
  • AI Leaderboard
  • Research
  • Status

Company

  • About
  • Careers
  • Legal
  • Privacy Policy

© 2012–2026 Skytells, Inc. All rights reserved.

Live rankings

AI Model Leaderboard

Every major AI model ranked across benchmark quality, inference speed, agentic capability, programming aptitude, and cost efficiency — updated continuously from published evaluation data.

Explore full leaderboardBrowse model catalog

296

Tracked models

27

Providers

253

Benchmarked

27.4

Avg. index

OverallBenchmarksInferenceAgenticProgrammingValue / Price

296 models

RankModelProviderScoreBenchmarksInferenceAgenticProgrammingValuePrice
241

Codestral-22B

codestral-22b

textinference
Mistral AI

0.0

Benchmarks

0.00.00.00.00.0N/A
242

Command R+

command-r-plus-04-2024

textinference
Cohere

0.0

Benchmarks

0.032.00.00.055.4$0.25 in / $1 out
243

DeepSeek-V3.2 (Non-thinking)

deepseek-chat

textinference
DeepSeek

0.0

Benchmarks

0.057.90.00.070.3$0.28 in / $0.42 out
244

DeepSeek-R1

deepseek-r1

textinference
DeepSeek

0.0

Benchmarks

0.014.20.00.035.4$0.55 in / $2.19 out
245

DeepSeek-V2.5

deepseek-v2.5

codeprogrammingtool use
DeepSeek

0.0

Benchmarks

0.046.50.00.979.7$0.14 in / $0.28 out
246

Devstral Medium

devstral-medium-2507

codeprogrammingtool use
Mistral AI

0.0

Benchmarks

0.065.30.024.253.8$0.4 in / $2 out
247

Devstral Small 1.1

devstral-small-2507

codeprogrammingtool use
Mistral AI

0.0

Benchmarks

0.065.30.014.785.5
248

Gemma 2 27B

gemma-2-27b-it

textinference
Google

0.0

Benchmarks

0.00.00.00.00.0N/A
249

Gemma 2 9B

gemma-2-9b-it

textinference
Google

0.0

Benchmarks

0.00.00.00.00.0N/A
250

Gemma 3n E2B

gemma-3n-e2b

multimodalvisionmulti-input reasoning
Google

0.0

Benchmarks

0.00.00.00.00.0N/A
251

Gemma 3n E4B

gemma-3n-e4b

multimodalvisionmulti-input reasoning
Google

0.0

Benchmarks

0.00.00.00.00.0N/A
252

GLM-4.5V

glm-4.5v

multimodalvisionmulti-input reasoning
ZZhipu AI

0.0

Benchmarks

0.00.00.00.00.0N/A
253

GLM-5

glm-5

codeprogrammingtool use
ZZhipu AI

0.0

Benchmarks

0.022.947.863.830.7$1 in / $3.2 out
254

GLM-5V-Turbo

glm-5v-turbo

multimodalvisionmulti-input reasoning
ZZhipu AI

0.0

Benchmarks

0.00.054.90.00.0N/A
255

GPT-5.1 Codex

gpt-5.1-codex

multimodalvisionmulti-input reasoning
OpenAI

0.0

Benchmarks

0.048.50.050.024.8
256

GPT-5.2 Codex

gpt-5.2-codex

multimodalvisionmulti-input reasoning
OpenAI

0.0

Benchmarks

0.048.50.044.119.4
257

GPT-5.3 Chat

gpt-5.3-chat-latest

multimodalvisionmulti-input reasoning
OpenAI

0.0

Benchmarks

0.052.70.00.026.4
258

GPT-5.3 Codex

gpt-5.3-codex

texttext-to-textcoding
OpenAI

0.0

Benchmarks

0.048.50.052.219.4
259

GPT-5 Codex

gpt-5-codex-2025-09-15

codeprogrammingtool use
OpenAI

0.0

Benchmarks

0.00.00.053.10.0N/A
260

Granite 3.3 8B Base

granite-3.3-8b-base

multimodalvisionmulti-input reasoning
IIBM

0.0

Benchmarks

0.00.00.00.00.0N/A
241

Codestral-22B

Mistral AI

0.0

N/A

242

Command R+

Cohere

0.0

$0.25 in / $1 out

243

DeepSeek-V3.2 (Non-thinking)

DeepSeek

0.0

$0.28 in / $0.42 out

244

Page 13 of 15 · 296 models

PreviousNext

Want benchmark charts, model comparison, and pricing analytics?

Sign in to access the full interactive leaderboard with deep benchmark breakdowns and model comparison tools.

Open full leaderboard

Rankings are based on multi-dimensional evaluation across benchmark quality, inference efficiency, and cost-per-output. Scores are updated continuously and may differ from individual third-party benchmarks.

$0.1 in / $0.3 out
$1.25 in / $10 out
$1.75 in / $14 out
$1.75 in / $14 out
$1.75 in / $14 out

DeepSeek-R1

DeepSeek

0.0

$0.55 in / $2.19 out

245

DeepSeek-V2.5

DeepSeek

0.0

$0.14 in / $0.28 out

246

Devstral Medium

Mistral AI

0.0

$0.4 in / $2 out

247

Devstral Small 1.1

Mistral AI

0.0

$0.1 in / $0.3 out

248

Gemma 2 27B

Google

0.0

N/A

249

Gemma 2 9B

Google

0.0

N/A

250

Gemma 3n E2B

Google

0.0

N/A

251

Gemma 3n E4B

Google

0.0

N/A

252
Z

GLM-4.5V

Zhipu AI

0.0

N/A

253
Z

GLM-5

Zhipu AI

0.0

$1 in / $3.2 out

254
Z

GLM-5V-Turbo

Zhipu AI

0.0

N/A

255

GPT-5.1 Codex

OpenAI

0.0

$1.25 in / $10 out

256

GPT-5.2 Codex

OpenAI

0.0

$1.75 in / $14 out

257

GPT-5.3 Chat

OpenAI

0.0

$1.75 in / $14 out

258

GPT-5.3 Codex

OpenAI

0.0

$1.75 in / $14 out

259

GPT-5 Codex

OpenAI

0.0

N/A

260
I

Granite 3.3 8B Base

IBM

0.0

N/A