Skip to content

Upscaler Models

mold supports Real-ESRGAN super-resolution models for image upscaling. These models enhance image resolution by 2x or 4x using neural networks trained on image restoration tasks.

Model List

RRDBNet Architecture (High Quality)

The Residual-in-Residual Dense Block Network uses deep convolutional layers with dense connections for maximum quality.

ModelScaleParamsSize (FP16)Description
real-esrgan-x4plus:fp164x16.7M32 MBGeneral-purpose, best quality
real-esrgan-x4plus:fp324x16.7M64 MBSame model, FP32 precision
real-esrgan-x2plus:fp162x16.7M32 MB2x upscale, best quality
real-esrgan-x4plus-anime:fp164x4.5M8.5 MBAnime/illustration optimized (6 blocks)

SRVGGNetCompact Architecture (Fast)

A lightweight linear chain architecture optimized for speed. Uses significantly less compute than RRDBNet while maintaining good quality.

ModelScaleParamsSizeDescription
real-esrgan-anime-v3:fp324x0.6M2.4 MBFast anime/video upscaler

Architecture Details

RRDBNet

Input (3, H, W)
  ↓ Conv2d(3 → 64)
  ↓ [RRDB block × 23] — each: 3 × ResidualDenseBlock (5 convs with dense connections)
  ↓ Conv2d(64 → 64)
  ↓ Upsample 2x (nearest) + Conv2d
  ↓ Upsample 2x (nearest) + Conv2d   ← only for 4x models
  ↓ Conv2d(64 → 64) + LeakyReLU
  ↓ Conv2d(64 → 3)
Output (3, H×scale, W×scale)

SRVGGNetCompact

Input (3, H, W)
  ↓ Conv2d(3 → 64)
  ↓ [PReLU + Conv2d(64 → 64)] × N
  ↓ PReLU
  ↓ Conv2d(64 → 3×scale²)
  ↓ PixelShuffle(scale)
Output (3, H×scale, W×scale)

Downloading

bash
# Pull the default high-quality upscaler
mold pull real-esrgan-x4plus:fp16

# Pull the fast compact upscaler
mold pull real-esrgan-anime-v3:fp32

# List all available models including upscalers
mold list

HuggingFace Sources

All upscaler models are sourced from trusted HuggingFace repositories:

Comparison

Use CaseRecommended ModelWhy
Photo upscalingreal-esrgan-x4plus:fp16Best detail preservation
Anime/mangareal-esrgan-x4plus-anime:fp16Trained on anime data
Batch processingreal-esrgan-anime-v3:fp325-10x faster
Video framesreal-esrgan-anime-v3:fp32Smallest, fastest
Subtle enhancementreal-esrgan-x2plus:fp162x is less aggressive