Skip to content

Our AI Models

From concept art to full cinematic video, Layer brings together the industry's leading AI models in one place.

Image

FLUX.1 [dev]

Powerful, open-weight 12B image model. Excels in image quality, prompt adherence, and commercial use.

Black Forest Labs
Image

FLUX.1 Krea [dev]

Open-weight model co-developed with Krea AI. Excels in photorealism and aesthetics.

Black Forest Labs
Image

FLUX.1 SRPO [dev]

12B flow transformer fine-tuned with SRPO for exceptional photorealism and polished composition.

Black Forest Labs
Image

FLUX.1 [pro]

Flagship commercial T2I model offering superior prompt adherence, quality, and stylistic outputs.

Black Forest Labs
Image

FLUX 1.1 [pro]

Next-gen FLUX model. 6x faster with enhanced prompt adherence and top-tier quality for production.

Black Forest Labs
Image

FLUX 1.1 [pro] Ultra

Delivers ultra-high-res (up to 4MP) images with superior photorealism, detail, and speed.

Black Forest Labs
Image

FLUX.1 Kontext [dev]

Open-weight, multimodal model for context-aware image editing. Excels at iterative edits via text.

Black Forest Labs
Image

FLUX.1 Kontext [pro]

Pro-grade multimodal model for fast, iterative editing, style transfer, and consistency.

Black Forest Labs
Image

FLUX.1 Kontext [max]

Premium model for max editing performance, superior typography, and visual narrative consistency.

Black Forest Labs
Image

FLUX.1 [schnell]

Ultra-fast, open-source model. Generates high-quality images in 1-4 steps for rapid prototyping.

Black Forest Labs
Image

FLUX.2 [dev]

**FLUX.2 [dev]** is a powerful 32-billion-parameter open-weight model from [Black Forest Labs](https://blackforestlabs.ai/), representing the most advanced open-weight image generation and editing model available today. This groundbreaking checkpoint combines text-to-image synthesis and multi-image editing capabilities in a single unified model. It's designed for developers and researchers who want the freedom to run state-of-the-art image generation locally, with commercial licensing available for production use.

Black Forest Labs
Image

FLUX.2 [flex]

**FLUX.2 [flex]** is a highly controllable text-to-image model from [Black Forest Labs](https://blackforestlabs.ai/) that puts full creative control in your hands. This model excels at rendering text and fine details while allowing developers to fine-tune parameters like inference steps and guidance scale to perfectly balance quality, prompt adherence, and speed. It's ideal for creators who need precise control over the generation process and want to optimize their workflow for specific creative requirements.

Black Forest Labs
Image

FLUX.2 [pro]

**FLUX.2 [pro]** is the flagship commercial model from [Black Forest Labs](https://blackforestlabs.ai/), delivering state-of-the-art image quality that rivals the best closed-source models. This production-ready API offers no compromise between speed and quality, matching competitors in prompt adherence and visual fidelity while generating images faster and at lower cost. It's designed for professionals and production environments that demand the absolute best results without sacrificing efficiency.

Black Forest Labs
Image

FLUX.2 [dev] Edit

**FLUX.2 [dev] Edit** is the editing variant of the powerful 32B open-weight model from [Black Forest Labs](https://blackforestlabs.ai/). This advanced checkpoint enables sophisticated multi-image editing workflows, allowing creators to perform complex text-guided modifications with multiple reference images in a single unified model. It's ideal for developers and artists who want the flexibility of open-weight editing capabilities that can be run locally or integrated into custom workflows.

Black Forest Labs
Image

FLUX.2 [flex] Edit

**FLUX.2 [flex] Edit** is the editing variant of the highly controllable FLUX.2 model from [Black Forest Labs](https://blackforestlabs.ai/). This model excels at rendering text and fine details while giving developers full control over quality and speed through adjustable inference steps and guidance scale. It's perfect for creators who need precise parameter control for instruction-based image editing, allowing them to optimize the editing process for their specific requirements.

Black Forest Labs
Image

FLUX.2 [pro] Edit

**FLUX.2 [pro] Edit** is the flagship editing model from [Black Forest Labs](https://blackforestlabs.ai/), bringing state-of-the-art quality to instruction-based image editing. This production-ready API delivers no compromise between speed and quality, performing complex multi-image edits faster and at lower cost while maintaining exceptional visual fidelity and prompt adherence. It's designed for professional workflows and production environments that require the absolute best editing results with maximum efficiency.

Black Forest Labs
Image

FLUX.2 [max]

[FLUX.2 [max]](https://bfl.ai/models/flux-2-max) represents the pinnacle of AI image generation from Black Forest Labs, offering unparalleled quality for professional creative workflows. This high-performance model generates stunning 4-megapixel photorealistic images while ensuring the most precise adherence to even the most complex descriptive prompts. Its innovative grounded generation capability sets it apart by integrating real-time web context to accurately visualize current trends and global events with incredible accuracy. The model features industry-leading multi-reference support, allowing users to maintain perfect character consistency and spatial reasoning across diverse visual assets. Beyond standard generation, it provides sophisticated text rendering and lighting controls that are essential for high-end marketing, cinematic production, and product design. Discover how the [FLUX.2 model family](https://bfl.ai/models/flux-2-max) can transform your digital artistry with state-of-the-art visual intelligence and unmatched editing consistency.

Black Forest Labs
Image

FLUX.2 [max] Edit

[FLUX.2 [max]](https://bfl.ai/models/flux-2-max) represents the pinnacle of AI image generation from Black Forest Labs, offering unparalleled quality for professional creative workflows. This high-performance model generates stunning 4-megapixel photorealistic images while ensuring the most precise adherence to even the most complex descriptive prompts. Its innovative grounded generation capability sets it apart by integrating real-time web context to accurately visualize current trends and global events with incredible accuracy. The model features industry-leading multi-reference support, allowing users to maintain perfect character consistency and spatial reasoning across diverse visual assets. Beyond standard generation, it provides sophisticated text rendering and lighting controls that are essential for high-end marketing, cinematic production, and product design. Discover how the [FLUX.2 model family](https://bfl.ai/models/flux-2-max) can transform your digital artistry with state-of-the-art visual intelligence and unmatched editing consistency.

Black Forest Labs
Image

FLUX.2 [klein] 4B

**FLUX.2 [klein] 4B** is a lightweight text-to-image model from [Black Forest Labs](https://blackforestlabs.ai/), featuring enhanced realism, crisper text generation, and native editing capabilities. The 4B parameter variant offers an excellent balance of quality and speed for professional creative workflows.

Black Forest Labs
Image

FLUX.2 [klein] 9B

**FLUX.2 [klein] 9B** is the larger variant of the FLUX.2 [klein] model family from [Black Forest Labs](https://blackforestlabs.ai/). With 9 billion parameters, it delivers superior image quality and more detailed outputs for demanding professional applications.

Black Forest Labs
Image

FLUX.2 [klein] 4B Edit

**FLUX.2 [klein] 4B Edit** brings native editing capabilities to the FLUX.2 [klein] model family from [Black Forest Labs](https://blackforestlabs.ai/). Perform precise image modifications using natural language descriptions with the efficient 4B parameter model.

Black Forest Labs
Image

FLUX.2 [klein] 9B Edit

**FLUX.2 [klein] 9B Edit** is the premium editing model in the FLUX.2 [klein] family from [Black Forest Labs](https://blackforestlabs.ai/). With 9 billion parameters, it delivers the highest quality image editing results for professional production workflows.

Black Forest Labs
Image

GPT Image 1

Powerful, versatile OpenAI image model for creative and professional apps.

OpenAI
Image

GPT Image 1.5

GPT Image 1.5 is the latest image generation model from OpenAI, with better instruction following and adherence to prompts.

OpenAI
Image

Qwen-Image

Open-source T2I model. Excels at high-res images from complex text, notable for clear, stylized text.

Qwen
Image

Qwen-Image Edit

Versatile, open-source image editor. Performs modifications via text instructions.

Qwen
Image

Qwen-Image Edit 2509

Advanced image editing model. Enhanced performance for fine-grained manipulation.

Qwen
Image

Qwen-Image Layered

Split an image into layers using Qwen-Image Layered.

Qwen
Image

Qwen-Image Edit 2511

Latest Qwen image editing model. Supports prompt-guided transformations with enhanced quality.

Qwen
Image

Qwen-Image 2512

Latest Qwen text-to-image model with enhanced quality, detail, and prompt adherence.

Qwen
Image

Qwen Image 2

Qwen Image 2 standard text-to-image model with strong prompt adherence and diverse style support.

Qwen
Image

Qwen Image 2 Pro

Qwen Image 2 Pro tier with higher quality text-to-image generation and enhanced detail.

Qwen
Image

Qwen Image 2 Edit

Qwen Image 2 standard image editing with prompt-guided transformations and multi-image input.

Qwen
Image

Qwen Image 2 Pro Edit

Qwen Image 2 Pro image editing with higher quality prompt-guided transformations.

Qwen
Image

Grok Imagine Image

xAI's image generation model capable of creating high-quality images from text prompts.

xAI
Image

Grok Imagine Image Edit

xAI's image editing model for modifying existing images using text prompts.

xAI
Image

Z-Image Turbo

Ultra-fast 6B parameter image model from Tongyi-MAI, optimized for near real-time generation.

Qwen
Image

DALL·E 3

OpenAI's flagship image model. Excels at complex prompts, high detail, and text rendering.

OpenAI
Image

Stable Diffusion 3

Latest open-weight MM-DiT model. Major improvements in quality, prompt following, and text rendering.

Stability AI
Image

Imagen 3

Cutting-edge T2I model. Exceptional detail, photorealism, and accurate text rendering.

Google
Image

Imagen 3 Fast

Speed-optimized Imagen 3. Delivers high-quality images fast, ideal for real-time previews.

Google
Image

Imagen 4

Next-gen image model for professionals. Unparalleled prompt adherence and high-resolution output.

Google
Image

Imagen 4 Fast

High-velocity Imagen 4 model, optimized for speed. Essential for quick iteration and interactive creative tools.

Google
Image

Imagen 4 Ultra

Google's highest quality image generation model.

Google
Image

Gemini 2.0 Flash Preview Image Generation

Conversational image generation model from Google.

Google
Image

Gemini 2.5 Flash Image

Google's image generation and editing model capable of multimodal reasoning.

Google
Image

Gemini 2.5 Flash Image Edit

Google's image generation and editing model capable of multimodal reasoning.

Google
Image

Gemini 3 Pro Image

Google's state-of-the-art image generation and editing model.

Google
Image

Gemini 3 Pro Image Edit

Google's state-of-the-art image generation and editing model.

Google
Image

Gemini 3.1 Flash Image

Google's fast, high-quality image generation model with multimodal reasoning.

Google
Image

Gemini 3.1 Flash Image Edit

Google's fast, high-quality image editing model with multimodal reasoning.

Google
Image

Step1X Edit

Cutting-edge, open-source image editor. Performs powerful, instruction-based edits via text.

Stepfun
Image

Recraft V3

Versatile image model for graphic design. Generates legible, stylized text and scalable vector art (SVG).

Recraft
Image

Seedream 4.0

Unified architecture for image generation and editing. Allows fluid movement from concept to refinement.

ByteDance
Image

Seedream 4.5

A new-generation image creation model from ByteDance, for both generation and editing.

ByteDance
Image

Seedream 5.0 Lite

Fast, high-quality image generation from ByteDance, optimized for creative advertising.

ByteDance
Image

Seedream 5.0 Lite Edit

Intelligent image editing from ByteDance, with multi-reference support for creative advertising.

ByteDance
Image

Kling Image O3

Kling Omni 3 image model. High-quality images with text rendering capabilities up to 4K resolution.

Kling
Image

Kling Image V3

Kling V3 image model. High-quality images with negative prompts, supports up to 2K resolution.

Kling
Image

Creative

Upscale images with high fidelity or creativity.

Clarity Upscaler
Image

Upscaler

Very fast upscaling with good quality.

ESRGAN
Image

Enhance

Enhance image quality with advanced upscaling.

Topaz
Image

Generative Enhance

Generative upscaling for enhanced details.

Topaz
Image

SeedVR2 Image Upscaler

ByteDance's SeedVR2 model for high-quality image upscaling.

ByteDance
Video

Veo 2

Legacy video model generating high-definition, long-form cinematic video content.

Google
Video

Veo 3

Next-gen video model with enhanced control over narrative, tone, and shot composition. Includes audio.

Google
Video

Veo 3 Fast

Speed-optimized Veo 3 variant for rapid video creation and iteration, ideal for short-form content.

Google
Video

Veo 3.1

Flagship video update: refined control, enhanced visual fidelity, and improved subtle motion details.

Google
Video

Veo 3.1 Fast

Speed-optimized Veo 3.1 model. Delivers high-quality video rapidly for dynamic workflows.

Google
Video

Kling v1.6 Pro

Powerful video model for high-fidelity, imaginative content with complex character motion.

Kling
Video

Kling v2.0 Master

Master-grade video model. Enhanced realism and physics simulation for cinematic, high-impact clips.

Kling
Video

Kling v2.1 Pro

Refined Kling model delivering professional-grade 1080p video with improved clarity and motion.

Kling
Video

Kling v2.1 Master

Premium 1080p video model. Maximum visual fidelity, capturing intricate details and lifelike expressions.

Kling
Video

Kling v2.5 Turbo Pro

High-speed video model. Produces rapid, professional-quality 1080p video for fast-paced content.

Kling
Video

Kling v2.6 Pro

Generate videos from images with native audio generation and fluid motion.

Kling
Video

Kling O1

Generate new videos from first and last frame images.

Kling
Video

Kling O1 Reference

Generate new videos guided by prompts, images or videos.

Kling
Video

Kling O1 Edit

Edit videos guided by a prompt or images.

Kling
Video

Kling O3 Pro

Generate high-quality videos with prompts, images, or reference elements. Pro tier for premium quality.

Kling
Video

Kling O3 Pro Edit

Edit videos guided by prompts and reference elements. Pro tier.

Kling
Video

Kling O3 Standard

Generate videos with prompts, images, or reference elements. Standard tier for balanced quality and speed.

Kling
Video

Kling O3 Standard Edit

Edit videos guided by prompts and reference elements. Standard tier.

Kling
Video

Kling V3 Pro

Generate high-quality videos with advanced control. Pro tier with negative prompts.

Kling
Video

Kling V3 Standard

Generate videos with prompts and images. Standard tier with negative prompts.

Kling
Video

Seedance Lite

Versatile and efficient video model. Optimized for speed and ideal for short clips and rapid prototypes.

ByteDance
Video

Seedance Pro

High-quality video model. Tuned for maximum visual fidelity and broadcast-quality 1080p output.

ByteDance
Video

Seedance 1.5 Pro

Next-generation Seedance video model with improved quality, 1080p output, and integrated audio generation.

ByteDance
Video

Wan 2.1

State-of-the-art video model. Generates detailed, stylistically diverse clips with fluid motion.

Alibaba
Video

Wan 2.2

Advanced video model. Enhanced visual consistency and detail for high-resolution 1080p content.

Alibaba
Video

Wan 2.5

Powerful video model. Optimized for top-tier 1080p cinematic quality and consistency.

Alibaba
Video

Wan 2.6

State-of-the-art multimodal video generation model from Alibaba, with native audio support

Alibaba
Video

Minimax Video 01

Accessible video model. Generates engaging 720p clips from text with good visual consistency.

MiniMax
Video

Minimax Video 01 Live

Specialized video model. Optimized for a dynamic, live-action feel with naturalistic camera work.

MiniMax
Video

Minimax Hailuo-02 Standard

Robust video model producing crisp 768p video. Offers a solid balance of quality and reliable performance.

MiniMax
Video

Minimax Hailuo-02 Pro

Premium video model engineered for professional-grade 1080p output, superior fidelity, and smoother motion.

MiniMax
Video

Minimax Hailuo-2.3 Standard

Latest standard video model. Improved prompt understanding and visual consistency for daily creation.

MiniMax
Video

Minimax Hailuo-2.3 Pro

Pinnacle of Hailuo T2V series. Cinematic quality with superior coherence, detail, and artistic control.

MiniMax
Video

Minimax Hailuo-2.3 Fast

High-speed T2V variant. Optimized for rapid creation, iteration, and social media content workflows.

MiniMax
Video

Hunyuan Video

High quality open-source video model from Tencent Hunyuan.

Tencent
Video

Hunyuan Video Foley

Specialized model to automatically create and sync sound effects (foley) for video content.

Tencent
Video

Higgsfield Turbo

High-speed video model for quick, efficient content production. Specializes in dynamic video effects and styles.

Higgsfield
Video

Framepack

Highly efficient, open-source I2V model that generates video by predicting the next frame.

Layer
Video

Magi Distilled

Fast, efficient open-source I2V model. Animates still images using an autoregressive approach.

Sand AI
Video

AI Avatar

Specialized model for creating realistic, audio-driven talking avatars with accurate lip-sync and expressions.

MultiTalk
Video

OmniHuman

Advanced video model bringing a still image of a person to life using audio, producing expressive videos.

ByteDance
Video

Ray 2

Large-scale, state-of-the-art video model for stunning realistic and coherent 1080p motion.

Luma Labs
Video

Ray 2 Flash

High-speed variant of Ray 2, optimized for rapid video creation. Perfect blend of speed and quality.

Luma Labs
Video

PixVerse v5

Powerful, user-friendly video model producing high-quality, stylized 1080p video with consistent motion.

PixVerse AI
Video

PixVerse v5.5

Latest PixVerse video model with enhanced quality and audio generation, supporting up to 1080p resolution.

PixVerse AI
Video

Grok Imagine Video

xAI's video generation model capable of creating high-quality 720p video from text and images.

xAI
Video

Grok Imagine Video Edit

xAI's video editing model for modifying existing videos using text prompts.

xAI
Video

Sora 2

Next-gen video model generating long, high-fidelity 720p video with unparalleled narrative understanding.

OpenAI
Video

Sora 2 Pro

Premium video version offering higher resolution (up to 1024p) and enhanced controls for pro projects.

OpenAI
Video

LTX Video 2.0 Fast

Versatile video model integrating video and audio creation in one seamless, speed-optimized workflow.

Lightricks
Video

LTX Video 2.0 Pro

High-fidelity video model designed for professional-quality results, offering superior detail and nuanced audio.

Lightricks
Video

LTX Video 2.3

Latest LTX video model with sharper details, cleaner audio, and portrait support for professional content creation.

Lightricks
Video

LTX Video 2.3 Fast

Speed-optimized LTX video model with extended duration support up to 20 seconds and portrait mode.

Lightricks
Video

ESRGAN Video Upscaler

ESRGAN model for video upscaling, enhancing resolution and detail.

ESRGAN
Video

Topaz Video Upscaler

Professional-grade video upscaling solution utilizing Topaz AI technology for high-quality enhancement.

Topaz
Video

SeedVR2 Video Upscaler

Powerful video upscaling model from ByteDance, optimized for high-quality resolution and fidelity improvement.

ByteDance
Video

Bria Video Increase Resolution

Bria's advanced AI technology designed to increase the resolution of video content efficiently.

Bria AI
Video

Bria Video Background Removal

Remove video backgrounds for advanced video editing.

Bria AI
3D

Rodin v2

Advanced 3D generation model creating high-quality, textured T-pose avatars from a single image.

Hyper3D
3D

Tripo v2.0

3D generator model featuring quad mesh output, PBR materials, and animation-ready outputs.

Tripo3D
3D

Tripo v2.5

Incremental 3D update. Refined performance, improved mesh topology, and texture fidelity.

Tripo3D
3D

Tripo v3.0

Latest 3D model for production-quality assets with superior textures and clean geometry.

Tripo3D
3D

Tripo Turbo v1.0

Speed-optimized 3D generation model designed for rapid prototyping and fast generation times.

Tripo3D
3D

Trellis

Open-source 3D model creating high-quality objects with realistic materials and geometry from text.

Microsoft
3D

Trellis 2

Open-source, high-quality 3D model from Microsoft, leveraging a novel field-free sparse voxel structure.

Microsoft
3D

Hunyuan 3D v2

Powerful, open-source 3D model producing high-res, textured 3D objects from text or image inputs.

Tencent
3D

Hunyuan 3D v2 Mini

Lightweight, efficient 3D version optimized for less powerful hardware. Delivers good quality assets.

Tencent
3D

Hunyuan 3D 3.0

Professional-grade 3D model optimized for high-quality, detailed assets with advanced features.

Tencent
3D

Hunyuan 3D v3.1 Pro

Latest Hunyuan 3D model with enhanced quality, multi-view input, and PBR material support.

Tencent
3D

Meshy V6 Preview

Preview of Meshy V6. High-fidelity 3D asset creation focusing on PBR, quad mesh, and face rigging.

Meshy
3D

Bytedance Seed 3D

Powerful 3D model focusing on high-quality objects from a single image. Adept at geometry & texture.

ByteDance
3D

Anything World Rig

Specialized mesh rigging model that automatically prepares 3D models with skeletons for animation.

Anything World
3D

Meshy V5 Retexture

Specialized Meshy 3D tool for quickly retexturing imported meshes using text prompts.

Meshy
3D

Meshy V5 Remesh

Dedicated Meshy 3D tool for remeshing models to optimize topology and reduce polygon count.

Meshy
Audio

Amelia - English Voice

A young British English woman's voice, clear and easy to understand. Expressive and enthusiastic, it's beautiful for narration, podcasts and social media such as YouTube, Tiktok, Reels and Stories. This studio-produced audio is great for a young woman's Gen-Z voice in audiobooks, high-quality video dubbing, advertising and reading.

Layer
Audio

David - Newsreader and Educator

A clear and crisp middle age professional American voice that is in the style of Broadcast news presenters. Great for the reader app for long form content.

Layer
Audio

Hope - Upbeat and clear

A young, friendly, and professional English voice. She is ideal for tutorials, presentations, and any content that requires a clear, engaging, and professional tone.

Layer
Audio

John - Husky & Engaging

A slightly husky and bassy voice with a standard American accent. Modulated, controlled, and direct and perfect for audiobooks, captivating narrations, or storytelling, or other professional voiceover work.

Layer
Audio

Matthew - Anti-Hero, Villain, Rogue, Tough Guy

A deep, commanding voice with cinematic gravitas - rich, resonant, and powerful. It carries mystery and authority, sculpted with precision, like a dark guardian emerging from the shadows, unforgettable and bold. Perfect for an evil villain or rugged anti-hero. Voiced by Matthew Schmitz, a professional audiobook narrator with a large fan-base.

Layer
Audio

Oxley - Grandpa

A friendly grandpa who knows how to enthrall his audience with tall tales and fun adventures.

Layer
Audio

Rubi - Playful and Teasing

A silky, feline-like American sultry voice that flows playfully. It drips with charm, every word laced with subtle amusement and quiet danger. Whether it’s a breathy whisper against your ear or a slow, deliberate taunt, this voice is pure enchantment with a dangerous edge. Ideal for a characters like villain, succubus, siren, mommy, witch, malefic queen or a demoness. Can be used for any entertainment content such as video games, animation & audiobook narrations.

Layer
Audio

Sage - Wise, Deliberate, Captivating

A deep, resonant male voice with a standard American accent and a slight husky and raspy quality but still pleasant. It has a controlled, measured, and direct delivery, perfect for authoritative narrations, compelling audiobooks, and professional voiceover requiring gravitas and clarity.

Layer
Audio

Unreal Tournament

The 'Unreal Tonemanagement 2003' voice, originally voiced by Christian Plasa, is a retro-futuristic announcer style inspired by classic arena shooters. With its sharp, metallic authority, it’s ideal for domination calls, game intros, and immersive content that channels high-tech tournament energy.

Layer
Audio

Wyatt - Wise Rustic Cowboy

Weathered wisdom from a Cowboy who's lived a hard life on the range. An older American Deep Male voice with a Southern flavor. Excellent for reading stories of the Wild West or American history.

Layer
Audio

ElevenLabs Sound Effects

Sound effects model from ElevenLabs: https://elevenlabs.io/

ElevenLabs

Start generating with leading AI models today