Skytells
HomeModelsCLIChangelog
  • Home
  • Models
  • CLI
  • Changelog
Skytells

Addressing the world's greatest challenges with AI. Enterprise research, foundation models, and infrastructure trusted by organizations worldwide since 2012.

Get Started

  • Console
  • Learn
  • Documentation
  • API Reference
  • Pricing
  • ModelsNew

Platform

  • Cloud AgentsNew
  • AI Solutions
  • Infrastructure
  • Edge Network
  • Trust Center
  • CLI

Resources

  • Blog
  • Changelog
  • AI Leaderboard
  • Research
  • Status

Company

  • About
  • Careers
  • Legal
  • Privacy Policy

© 2012–2026 Skytells, Inc. All rights reserved.

Live rankings

AI Model Leaderboard

Every major AI model ranked across benchmark quality, inference speed, agentic capability, programming aptitude, and cost efficiency — updated continuously from published evaluation data.

Explore full leaderboardBrowse model catalog

294

Tracked models

27

Providers

251

Benchmarked

30.7

Avg. index

OverallBenchmarksInferenceAgenticProgrammingValue / Price

294 models

RankModelProviderScoreBenchmarksInferenceAgenticProgrammingValuePrice
1

Llama 3.2 3B Instruct

llama-3.2-3b-instruct

textinference
MMeta

98.8

Value / Price

5.369.00.00.098.8$0.01 in / $0.02 out
2

Min istral 3 (3B Reasoning 2512)

ministral-3b-latest

multimodalvisionmulti-input reasoning
Mistral AI

95.8

Value / Price

22.179.70.00.095.8
3

Llama 3.2 11B Instruct

llama-3.2-11b-instruct

multimodalvisionmulti-input reasoning
MMeta

94.9

Value / Price

4.160.50.00.094.9
4

Ministral 3 (8B Reasoning 2512)

ministral-8b-latest

multimodalvisionmulti-input reasoning
Mistral AI

92.1

Value / Price

31.884.80.00.092.1
5

Nova Micro

nova-micro

textinference
AAmazon

91.0

Value / Price

9.251.90.00.091.0$0.03 in / $0.14 out
6

Nemotron 3 Nano (30B A3B)

nemotron-3-nano-30b-a3b

codeprogrammingtool use
NNVIDIA

90.8

Value / Price

45.866.83.34.490.8$0.06 in / $0.24 out
7

Gemini 1.5 Flash 8B

gemini-1.5-flash-8b

multimodalvisionmulti-input reasoning
Google

88.3

Value / Price

10.492.10.00.088.3
8

Qwen3-Coder

qwen3-coder

textinference
AAlibaba Cloud / Qwen Team

88.3

Value / Price

0.055.90.00.088.3$0.18 in / $0.18 out
9

Llama 4 Scout

llama-4-scout

multimodalvisionmulti-input reasoning
MMeta

87.2

Value / Price

29.293.00.00.087.2$0.08 in / $0.3 out
10

Nova Lite

nova-lite

multimodalvisionmulti-input reasoning
AAmazon

86.4

Value / Price

13.669.90.00.086.4$0.06 in / $0.24 out
11

MiMo-V2-Flash

mimo-v2-flash

codeprogrammingtool use
Xiaomi

85.9

Value / Price

53.779.827.239.385.9$0.1 in / $0.3 out
12

Devstral Small 1.1

devstral-small-2507

codeprogrammingtool use
Mistral AI

85.0

Value / Price

0.064.50.015.085.0
13

Mistral Small 3.1 24B Base

mistral-small-3.1-24b-base-2503

multimodalvisionmulti-input reasoning
Mistral AI

85.0

Value / Price

13.564.50.00.085.0
14

Ministral 3 (14B Reasoning 2512)

ministral-14b-latest

multimodalvisionmulti-input reasoning
Mistral AI

84.5

Value / Price

37.976.80.00.084.5
15

Qwen3 235B A22B

qwen3-235b-a22b

multimodalvisionmulti-input reasoning
AAlibaba Cloud / Qwen Team

83.9

Value / Price

30.533.40.00.083.9$0.1 in / $0.1 out
16

Llama 3.1 8B Instruct

llama-3.1-8b-instruct

textinference
MMeta

83.7

Value / Price

3.226.20.00.083.7$0.03 in / $0.03 out
17

LongCat-Flash-Lite

longcat-flash-lite

codeprogrammingtool use
Meituan

83.3

Value / Price

24.783.829.525.383.3
18

GPT-4.1 nano

gpt-4.1-nano-2025-04-14

multimodalvisionmulti-input reasoning
OpenAI

83.0

Value / Price

12.693.70.00.083.0
19

Gemini 2.0 Flash

gemini-2.0-flash

multimodalvisionmulti-input reasoning
Google

82.7

Value / Price

33.494.10.00.082.7
20

Step-3.5-Flash

step-3.5-flash

codeprogrammingtool use
SStepFun

82.1

Value / Price

62.363.245.353.082.1$0.1 in / $0.4 out
1
M

Llama 3.2 3B Instruct

Meta

98.8

$0.01 in / $0.02 out

2

Min istral 3 (3B Reasoning 2512)

Mistral AI

95.8

$0.1 in / $0.1 out

3
M

Llama 3.2 11B Instruct

Meta

94.9

$0.05 in / $0.05 out

Page 1 of 15 · 294 models

Next

Want benchmark charts, model comparison, and pricing analytics?

Sign in to access the full interactive leaderboard with deep benchmark breakdowns and model comparison tools.

Open full leaderboard

Rankings are based on multi-dimensional evaluation across benchmark quality, inference efficiency, and cost-per-output. Scores are updated continuously and may differ from individual third-party benchmarks.

$0.1 in / $0.1 out
$0.05 in / $0.05 out
$0.15 in / $0.15 out
$0.07 in / $0.3 out
$0.1 in / $0.3 out
$0.1 in / $0.3 out
$0.2 in / $0.2 out
$0.1 in / $0.4 out
$0.1 in / $0.4 out
$0.1 in / $0.4 out
4

Ministral 3 (8B Reasoning 2512)

Mistral AI

92.1

$0.15 in / $0.15 out

5
A

Nova Micro

Amazon

91.0

$0.03 in / $0.14 out

6
N

Nemotron 3 Nano (30B A3B)

NVIDIA

90.8

$0.06 in / $0.24 out

7

Gemini 1.5 Flash 8B

Google

88.3

$0.07 in / $0.3 out

8
A

Qwen3-Coder

Alibaba Cloud / Qwen Team

88.3

$0.18 in / $0.18 out

9
M

Llama 4 Scout

Meta

87.2

$0.08 in / $0.3 out

10
A

Nova Lite

Amazon

86.4

$0.06 in / $0.24 out

11

MiMo-V2-Flash

Xiaomi

85.9

$0.1 in / $0.3 out

12

Devstral Small 1.1

Mistral AI

85.0

$0.1 in / $0.3 out

13

Mistral Small 3.1 24B Base

Mistral AI

85.0

$0.1 in / $0.3 out

14

Ministral 3 (14B Reasoning 2512)

Mistral AI

84.5

$0.2 in / $0.2 out

15
A

Qwen3 235B A22B

Alibaba Cloud / Qwen Team

83.9

$0.1 in / $0.1 out

16
M

Llama 3.1 8B Instruct

Meta

83.7

$0.03 in / $0.03 out

17

LongCat-Flash-Lite

Meituan

83.3

$0.1 in / $0.4 out

18

GPT-4.1 nano

OpenAI

83.0

$0.1 in / $0.4 out

19

Gemini 2.0 Flash

Google

82.7

$0.1 in / $0.4 out

20
S

Step-3.5-Flash

StepFun

82.1

$0.1 in / $0.4 out