Skip to content

Models

mold supports 8 model families spanning different architectures, quality levels, and VRAM requirements.

Choosing a Model

NeedRecommendedWhy
Fast iterationsflux2-klein:q84 steps, ungated, Apache 2.0
Best qualityflux-dev:q425 steps, excellent detail
Low VRAM (<8 GB)flux2-klein:q42.6 GB, 4 steps
Classic ecosystemsd15:fp16 or dreamshaper-v8Huge model library, ControlNet
Fast + greatz-image-turbo:q89 steps, excellent quality
SDXLsdxl-turbo:fp164 steps, 1024x1024

VRAM Guide

ModelVariantApprox. VRAMSpeedQuality
flux-schnell:q8Q8~12 GBFast, 4 stepsGood
flux-schnell:q6Q6~14 GBFast, 4 stepsBetter than Q8
flux-dev:q4Q4~8 GBSlow, 25 stepsExcellent
flux-dev:q6Q6~10 GBSlow, 25 stepsBest FLUX quality/size trade
flux-dev:bf16BF16~24 GBSlow, 25 stepsBest FLUX quality
flux2-klein:q4Q4~4 GBFast, 4 stepsGood for very small GPUs
z-image-turbo:q8Q8~10 GBFast, 9 stepsExcellent
sdxl-turbo:fp16FP16~10 GBVery fast, 4 stepsGood
sd15:fp16FP16~6 GBMedium, 25 stepsGood, broad ecosystem
qwen-image:q4Q4~14 GBSlow, 50 stepsStrong

If you are close to your card limit, start with a smaller quantization or use --offload. Full BF16 FLUX can run on 24 GB cards, but offloading may kick in automatically and slow generation down.

Model Management

bash
mold pull flux2-klein:q8     # Download a model
mold list                    # See what you have
mold info                    # Installation overview
mold info flux-dev:q4        # Model details + disk usage
mold rm dreamshaper-v8       # Remove a model
mold default flux-dev:q4     # Set default model

Name Resolution

Bare names auto-resolve by trying :q8:fp16:bf16:fp8:

bash
mold run flux2-klein "a cat"   # resolves to flux2-klein:q8
mold run sdxl-base "a cat"     # resolves to sdxl-base:fp16

HuggingFace Auth

Some model repos require authentication:

bash
export HF_TOKEN=hf_...
mold pull flux-dev:q4

All Families

FamilyNative ResolutionArchitecture
FLUX.21024x1024Qwen3 encoder, 4B transformer
FLUX.11024x1024Flow-matching transformer
SDXL1024x1024Dual-CLIP, UNet
SD 1.5512x512CLIP-L, UNet
SD 3.51024x1024Triple encoder, MMDiT
Z-Image1024x1024Qwen3 encoder, 3D RoPE
Wuerstchen1024x10243-stage cascade, 42x compress
Qwen-Image1024x1024Qwen2.5-VL, flow-matching, CFG

Each family page lists recommended dimensions for non-square aspect ratios. Using non-recommended dimensions will trigger a warning.