NixOS

mold provides a NixOS module for declarative server and Discord bot deployment.

Flake Setup

Add mold to your flake inputs and import the module:

nix

{
  inputs.mold.url = "github:utensils/mold";

  outputs = { self, nixpkgs, mold, ... }: {
    nixosConfigurations.myhost = nixpkgs.lib.nixosSystem {
      modules = [
        mold.nixosModules.default
        ./mold.nix  # your mold config (see below)
      ];
    };
  };
}

Minimal Configuration

nix

{ inputs, system, ... }:
{
  services.mold = {
    enable = true;
    package = inputs.mold.packages.${system}.default;  # Ada / RTX 40-series
  };
}

This starts mold serve on port 7680 with sensible defaults, creates a mold system user, and manages the data directory at /var/lib/mold.

Web gallery is bundled

Since v0.8.1 the Vue 3 gallery SPA is embedded directly into the mold binary at compile time — visiting http://<host>:7680/ opens the gallery with no extra configuration. Earlier versions required staging web/dist/ into ~/.mold/web or pointing MOLD_WEB_DIR at a built SPA. That override still works for SPA hot-iteration without recompiling Rust.

Full Configuration Example

nix

{ inputs, system, config, ... }:
{
  services.mold = {
    enable = true;

    # Package — must match your GPU architecture
    package = inputs.mold.packages.${system}.default;     # Ada (RTX 4090, sm_89)
    # package = inputs.mold.packages.${system}.mold-sm120; # Blackwell (RTX 5090, sm_120)

    # Advisory hint — emits a build warning if package doesn't match
    # cudaArch = "blackwell";

    # Server
    port = 7680;
    bindAddress = "0.0.0.0";
    logLevel = "info";         # trace, debug, info, warn, error
    openFirewall = false;      # set true to allow LAN access

    # Directories
    homeDir = "/var/lib/mold";           # MOLD_HOME
    # modelsDir = "/var/lib/mold/models"; # defaults to homeDir/models

    # Models
    defaultModel = "flux2-klein:q8";

    # Multi-GPU — pin the server to specific cards (null = use all visible)
    # gpus = "0,1";
    # queueSize = 200; # max queued jobs; overflow returns HTTP 503

    # Image persistence — save copies of all server-generated images
    # outputDir = "/srv/mold/gallery";

    # CORS — restrict to specific origin (null = permissive)
    # corsOrigin = "https://mysite.example.com";

    # HuggingFace auth — for gated model repos
    # Points to a file containing the token (e.g. agenix secret)
    hfTokenFile = config.age.secrets.hf-token.path;

    # API key authentication — file with one key per line (e.g. agenix secret)
    # When set, all API requests require an X-Api-Key header
    # apiKeyFile = config.age.secrets.mold-api-key.path;

    # Rate limiting — per-IP, generation endpoints at configured rate, reads at 10x
    # rateLimit = "10/min";
    # rateLimitBurst = 20;

    # Extra environment variables
    environment = {
      MOLD_EAGER = "1";        # keep all components loaded
      MOLD_T5_VARIANT = "q4";  # use Q4 T5 encoder
      # MOLD_THUMBNAIL_WARMUP = "1"; # opt in to startup gallery thumbnail warmup
    };

    # Discord bot
    discord = {
      enable = true;
      # Must be an EnvironmentFile: MOLD_DISCORD_TOKEN=your-token-here
      tokenFile = config.age.secrets.discord-token.path;
      # moldHost = "http://localhost:7680";  # defaults to main server
      cooldownSeconds = 10;
      # allowedRoles = "artist, 1234567890";  # restrict to specific roles
      # dailyQuota = 20;                       # max generations per user per day
      logLevel = "info";
    };
  };
}

Module Options Reference

Server Options

Option	Type	Default	Description
`enable`	bool	`false`	Enable the mold server
`package`	package	—	The mold package (must set explicitly)
`cudaArch`	null/enum	`null`	`"ada"` or `"blackwell"` — advisory warning only
`port`	port	`7680`	HTTP server port
`bindAddress`	string	`"0.0.0.0"`	Address to bind
`homeDir`	string	`"/var/lib/mold"`	Base directory (MOLD_HOME)
`modelsDir`	string	`homeDir + /models`	Model storage directory
`logLevel`	enum	`"info"`	Log level (trace/debug/info/warn/error)
`corsOrigin`	null/string	`null`	CORS origin restriction (null = permissive)
`openFirewall`	bool	`false`	Open firewall port
`defaultModel`	null/string	`null`	Default model name
`gpus`	null/string	`null`	Which GPUs to use: `"0,1"` or `"all"` (null = every visible GPU)
`queueSize`	null/int	`null`	Max queued generation jobs (null = default 200)
`outputDir`	null/string	`null`	Image output directory (default: `homeDir/output`)
`hfTokenFile`	null/path	`null`	Path to file with HuggingFace token
`apiKeyFile`	null/path	`null`	Path to file with API key(s) for authentication (e.g. agenix secret)
`rateLimit`	null/string	`null`	Per-IP rate limit (e.g. `"10/min"`)
`rateLimitBurst`	null/int	`null`	Override burst allowance (defaults to 2x rate)
`logToFile`	bool	`false`	Enable file logging (in addition to journal)
`logDir`	string	`homeDir + /logs`	Directory for log files when `logToFile` is enabled
`logRetentionDays`	int	`7`	Days to retain rotated log files
`environment`	attrs	`{}`	Extra environment variables

Monitoring

Nix builds include the metrics feature. The server exposes GET /metrics in Prometheus text exposition format (HTTP request rates, generation duration, queue depth, GPU memory, uptime). The endpoint is excluded from auth and rate limiting, so Prometheus/Grafana Agent can scrape it without an API key.

Discord Bot Options

Option	Type	Default	Description
`discord.enable`	bool	`false`	Enable Discord bot service
`discord.package`	package	`config.services.mold.package`	Package for the bot
`discord.tokenFile`	path	—	File containing bot token
`discord.moldHost`	string	`"http://localhost:{port}"`	mold server URL
`discord.cooldownSeconds`	int	`10`	Per-user generation cooldown
`discord.allowedRoles`	string?	`null`	Comma-separated role names/IDs (`null` = all)
`discord.dailyQuota`	int?	`null`	Max generations per user per day (`null` = unlimited)
`discord.logLevel`	enum	`"info"`	Bot log level
`discord.environment`	attrs	`{}`	Extra environment variables for the Discord bot

What the Module Creates

System user mold:mold with home at homeDir
Directories via tmpfiles: homeDir, modelsDir, and outputDir (if set)
Systemd service mold.service — runs mold serve with:
- LD_LIBRARY_PATH=/run/opengl-driver/lib for NixOS CUDA driver access
- video and render supplementary groups for GPU access
- Hardened: NoNewPrivileges, ProtectSystem=full, ProtectHome, PrivateTmp
- HuggingFace token loaded via EnvironmentFile (never in process env)
Systemd service mold-discord.service (if discord.enable) — runs mold discord, depends on mold.service, further hardened with ProtectSystem=strict and PrivateDevices (no GPU needed)
Firewall rule if openFirewall = true

GPU Architecture

The module cannot auto-select the flake package — you must set package to match your GPU:

GPU	Package
RTX 40-series (Ada)	`inputs.mold.packages.${system}.default`
RTX 50-series (Blackwell)	`inputs.mold.packages.${system}.mold-sm120`

Set cudaArch = "blackwell" as a reminder — it emits a build warning if you forget to switch the package.

Build Variants

AdaBlackwell

bash

nix build github:utensils/mold

bash

nix build github:utensils/mold#mold-sm120

Development Shell

bash

nix develop github:utensils/mold

The devshell includes Rust toolchain, CUDA toolkit, and convenience commands:

Command	Description
`build`	Fast local `mold` build (`dev-fast`) with embedded web UI
`build-workspace`	`cargo build` (debug, all crates)
`build-release`	Shipping release build with the full feature set
`build-server`	Fast local server build with GPU + preview + expand
`serve`	Start the mold server
`generate`	Generate an image
`mold`	Run any mold CLI command
`check`	`cargo check`
`clippy`	`cargo clippy`
`fmt`	`cargo fmt`
`run-tests`	`cargo test`
`coverage`	Test coverage report
`docs-dev`	Start VitePress docs dev server
`docs-build`	Build the documentation site
`docs-fmt`	Format docs with prettier

NixOS ​

Flake Setup ​

Minimal Configuration ​

Full Configuration Example ​

Module Options Reference ​

Server Options ​

Monitoring ​

Discord Bot Options ​

What the Module Creates ​

GPU Architecture ​

Build Variants ​

Development Shell ​

NixOS

Flake Setup

Minimal Configuration

Full Configuration Example

Module Options Reference

Server Options

Monitoring

Discord Bot Options

What the Module Creates

GPU Architecture

Build Variants

Development Shell