Skytells
HomeModelsCLIChangelog
  • Home
  • Models
  • CLI
  • Changelog
Skytells

Addressing the world's greatest challenges with AI. Enterprise research, foundation models, and infrastructure trusted by organizations worldwide since 2012.

Get Started

  • Console
  • Learn
  • Documentation
  • API Reference
  • Pricing
  • ModelsNew

Platform

  • Cloud AgentsNew
  • AI Solutions
  • Infrastructure
  • Edge Network
  • Trust Center
  • CLI

Resources

  • Blog
  • Changelog
  • AI Leaderboard
  • Research
  • Status

Company

  • About
  • Careers
  • Legal
  • Privacy Policy

© 2012–2026 Skytells, Inc. All rights reserved.

Live rankings

AI Model Leaderboard

Every major AI model ranked across benchmark quality, inference speed, agentic capability, programming aptitude, and cost efficiency — updated continuously from published evaluation data.

Explore full leaderboardBrowse model catalog

296

Tracked models

27

Providers

253

Benchmarked

34.7

Avg. index

OverallBenchmarksInferenceAgenticProgrammingValue / Price

296 models

RankModelProviderScoreBenchmarksInferenceAgenticProgrammingValuePrice
261

DeepSeek VL2

deepseek-vl2

multimodalvisionmulti-input reasoning
DeepSeek

6.9

overall

6.90.00.00.00.0N/A
262

Mistral Small 3 24B Base

mistral-small-24b-base-2501

multimodalvisionmulti-input reasoning
Mistral AI

6.4

overall

6.40.00.00.00.0
263

DeepSeek R1 Distill Qwen 1.5B

deepseek-r1-distill-qwen-1.5b

textinference
DeepSeek

6.1

overall

6.10.00.00.00.0N/A
264

Qwen2.5 VL 7B Instruct

qwen2.5-vl-7b

multimodalvisionmulti-input reasoning
AAlibaba Cloud / Qwen Team

5.2

overall

9.60.00.00.00.0N/A
265

Gemini Diffusion

gemini-diffusion

codeprogrammingtool use
Google

4.7

overall

7.00.00.01.70.0N/A
266

DeepSeek VL2 Small

deepseek-vl2-small

multimodalvisionmulti-input reasoning
DeepSeek

4.6

overall

4.60.00.00.00.0
267

GPT-5.1 Codex Mini

gpt-5.1-codex-mini

multimodalvisionmulti-input reasoning
OpenAI

4.0

overall

4.00.00.00.00.0
268

Qwen2 7B Instruct

qwen2-7b-instruct

textinference
AAlibaba Cloud / Qwen Team

2.4

overall

2.40.00.00.00.0N/A
269

Phi-3.5-vision-instruct

phi-3.5-vision-instruct

multimodalvisionmulti-input reasoning
MMicrosoft

2.3

overall

2.30.00.00.00.0N/A
270

Phi 4 Mini

phi-4-mini

textinference
MMicrosoft

2.0

overall

2.00.00.00.00.0N/A
271

Gemma 3n E4B Instructed LiteRT Preview

gemma-3n-e4b-it-litert-preview

multimodalvisionmulti-input reasoning
Google

1.3

overall

1.30.00.00.00.0
272

DeepSeek VL2 Tiny

deepseek-vl2-tiny

multimodalvisionmulti-input reasoning
DeepSeek

1.2

overall

1.20.00.00.00.0
273

Gemma 3n E2B Instructed

gemma-3n-e2b-it

multimodalvisionmulti-input reasoning
Google

1.0

overall

1.00.00.00.00.0
274

Gemma 3n E2B Instructed LiteRT (Preview)

gemma-3n-e2b-it-litert-preview

multimodalvisionmulti-input reasoning
Google

1.0

overall

1.00.00.00.00.0
275

Gemma 3 1B

gemma-3-1b-it

textinference
Google

0.9

overall

0.90.00.00.00.0N/A
276

Codestral-22B

codestral-22b

textinference
Mistral AI

0.0

overall

0.00.00.00.00.0N/A
277

Gemma 2 27B

gemma-2-27b-it

textinference
Google

0.0

overall

0.00.00.00.00.0N/A
278

Gemma 2 9B

gemma-2-9b-it

textinference
Google

0.0

overall

0.00.00.00.00.0N/A
279

Gemma 3n E2B

gemma-3n-e2b

multimodalvisionmulti-input reasoning
Google

0.0

overall

0.00.00.00.00.0N/A
280

Gemma 3n E4B

gemma-3n-e4b

multimodalvisionmulti-input reasoning
Google

0.0

overall

0.00.00.00.00.0N/A
261

DeepSeek VL2

DeepSeek

6.9

N/A

262

Mistral Small 3 24B Base

Mistral AI

6.4

N/A

263

DeepSeek R1 Distill Qwen 1.5B

DeepSeek

6.1

N/A

264

Page 14 of 15 · 296 models

PreviousNext

Want benchmark charts, model comparison, and pricing analytics?

Sign in to access the full interactive leaderboard with deep benchmark breakdowns and model comparison tools.

Open full leaderboard

Rankings are based on multi-dimensional evaluation across benchmark quality, inference efficiency, and cost-per-output. Scores are updated continuously and may differ from individual third-party benchmarks.

N/A
N/A
N/A
N/A
N/A
N/A
N/A
A

Qwen2.5 VL 7B Instruct

Alibaba Cloud / Qwen Team

5.2

N/A

265

Gemini Diffusion

Google

4.7

N/A

266

DeepSeek VL2 Small

DeepSeek

4.6

N/A

267

GPT-5.1 Codex Mini

OpenAI

4.0

N/A

268
A

Qwen2 7B Instruct

Alibaba Cloud / Qwen Team

2.4

N/A

269
M

Phi-3.5-vision-instruct

Microsoft

2.3

N/A

270
M

Phi 4 Mini

Microsoft

2.0

N/A

271

Gemma 3n E4B Instructed LiteRT Preview

Google

1.3

N/A

272

DeepSeek VL2 Tiny

DeepSeek

1.2

N/A

273

Gemma 3n E2B Instructed

Google

1.0

N/A

274

Gemma 3n E2B Instructed LiteRT (Preview)

Google

1.0

N/A

275

Gemma 3 1B

Google

0.9

N/A

276

Codestral-22B

Mistral AI

0.0

N/A

277

Gemma 2 27B

Google

0.0

N/A

278

Gemma 2 9B

Google

0.0

N/A

279

Gemma 3n E2B

Google

0.0

N/A

280

Gemma 3n E4B

Google

0.0

N/A