A governed registry of 42+ inference models spanning text, image, audio, and video. Access OpenAI, Anthropic, Google, Mistral, Meta, and Skytells-native models through a single authenticated endpoint — with unified rate limits, cost attribution, and SLA-backed availability.
Covered providers
Total Models
42
Vendors
8
Official Models
20

20 models
TrueFusion
imageTrueFusion Standard
TrueFusion Pro
imageTrueFusion Pro
TrueFusion Max
imageTrueFusion Max
TrueFusion Pano
imageTrueFusion Pano
TrueFusion Video Pro
videoTrueFusion Video Pro
TrueFusion Video
videoTrueFusion Video Standard
Lumo
videoLumo is an image-to-video video model specifically made for motion, animations and general use cases
TrueFusion Variant
imageTrueFusion Variant
TrueFusion Edge
imageTrueFusion Edge Ultra-fast, lightweight AI model delivering stunningly realistic image-to-video results with minimal resource usage—optimized for mobile and real-time applications.
TrueFusion Standard
imageTrueFusion Standard
TrueFusion Ultra
imageOur flagship and most advanced text-to-image model yet. TrueFusion Ultra delivers stunning photorealism, artistic creativity, and unmatched consistency across styles. From intricate details to vivid storytelling, it redefines what's possible with generative visuals.
LipFusion
videoLipFusion is a cutting-edge AI model engineered by Skytells to deliver ultra-realistic lip-syncing capabilities across a wide range of content — from videos and animations to avatars and live streams. With advanced deep learning architectures and real-time inference optimization, LipFusion seamlessly aligns speech with visual output, creating an immersive, human-like experience that brings characters, avatars, and digital personas to life like never before.
Mera
videoOur latest video generation model is more physically accurate, super realistic, and more controllable than prior systems.
TrueFusion X
imageUltra Fast, Ultra High-Resolution - More Pixels in Every Image.
TrueFusion 2.0
imageTrueFusion 2.0 Image lets you attach up to three images as ground truth and reference them by tags in your prompt. It preserves identity, style, and materials while giving you control over angle, composition, lighting, and fine details—so the final image matches exactly what you envisioned.
Flux.1 Edge
imageSuper-fast version of Flux model, Optimized by Skytells for instant image generation.
TrueFusion Optima
imageExpert-coordinated realism at production scale, TrueFusion 2.0 Optima is a next-generation MoE architecture delivering unmatched realism,lifelike lighting, and film-grade image precision.
BeatFusion 2.0
audioSkytells's Flagship music generation model, Generate full-length songs with vocals, lyrics, and rich instrumentation from a text prompt.
BeatFusion 1.0
audioSkytells's First music generation model, Generate full-length songs with vocals, lyrics, and rich instrumentation from a text prompt.
DeepBrain Router
textDeepBrain Router is Skytells’ advanced model orchestration layer, built to intelligently choose the right model for the right task. Optimized for coding, writing, reasoning, and complex multi-domain workloads, it dynamically routes requests across a curated set of flagship models from leading providers. The result is stronger output quality, improved cost-performance balance, and a more reliable AI experience at scale.

1 models

4 models
FLUX-1.1 Pro
imageUltra Fast, Ultra High-Resolution - More Pixels in Every Image.
Flux 2 Pro Legacy
imageHigh-quality image generation and editing with support for eight reference images
Flux 2 Flex
imageMax-quality image generation and editing with support for ten reference images
FLUX.2 Pro
imageUltra Fast, Ultra High-Resolution - More Pixels in Every Image.

6 models
Imagen 3
imageGoogle's highest quality text-to-image model, capable of generating images with detail, rich lighting and beauty
Imagen 4
imageGoogle's flagship text-to-image model, capable of generating images with detail, rich lighting and beauty
Veo 3.1
videoNew and improved version of Veo 3, with higher-fidelity video, context-aware audio, reference image and last frame support
Veo 3.1 Fast
videoNew and improved version of Veo 3 Fast, with higher-fidelity video, context-aware audio and last frame support
Nano Banana
imageGoogle's latest image editing model in Gemini 2.5
Veo 3.1 (Preview)
videoNew and improved version of Veo 3, with higher-fidelity video, context-aware audio, reference image and last frame support

1 models

1 models

8 models
GPT-Image-1
imageA multimodal image generation model that creates high-quality images.
Sora 2
videoOpenAI's Most advanced synced-audio video generation
Sora 2 Pro
videoOpenAI's Most advanced synced-audio video generation
GPT-5
textOpenAI's new model excelling at coding, writing, and reasoning.
GPT-5.3 Codex
textGPT‑5.3‑Codex achieves state-of-the-art performance on SWE-Bench Pro, a rigorous evaluation of real-world software engineering. Where SWE‑bench Verified only tests Python, SWE‑Bench Pro spans four languages and is more contamination‑resistant, challenging, diverse and industry-relevant. It also far exceeds the previous state-of-the-art performance on Terminal-Bench 2.0, which measures the terminal skills a coding agent like Codex needs. Notably, GPT‑5.3‑Codex does so with fewer tokens than any prior model, letting users build more.
GPT-5.4
textGPT-5.4 is OpenAI’s latest frontier model, unifying the Codex and GPT lines into a single system. It features a 1M+ token context window (922K input, 128K output) with support for text and image inputs, enabling high-context reasoning, coding, and multimodal analysis within the same workflow. The model delivers improved performance in coding, document understanding, tool use, and instruction following. It is designed as a strong default for both general-purpose tasks and software engineering, capable of generating production-quality code, synthesizing information across multiple sources, and executing complex multi-step workflows with fewer iterations and greater token efficiency.
GPT-5.4 Mini
textGPT-5.4 mini brings the strengths of GPT-5.4 to a faster, more efficient model designed for high-volume workloads.
GPT Image 2
imageGPT Image 2 is OpenAI's state-of-the-art image generation model for fast, high-quality image generation and editing. It supports flexible image sizes and high-fidelity image inputs.
1 models