Local AI · Image generation

Generate and edit images locally.

LumaBrowser bundles a local image-generation server built on stable-diffusion.cpp. Text-to-image and image editing run entirely on your own machine, no cloud service, no per-image fee, and nothing leaves your device. Describe a picture and get a render; open one you already have and repaint just the part you want.

Download LumaBrowser - free See the model catalog

stable-diffusion.cpp under the hood · runs on your GPU · nothing leaves your machine

This is local inference, not a hosted API. The generation server runs on your own machine. Your prompts and the images you create, including any photos you bring in to edit, never leave the device. There is no inference provider in the loop, no upload of your source images, and no per-image cost.

Two things it does

The same local server covers both ends of the workflow: making an image from nothing, and reworking one you already have.

Create

Text-to-image

Describe an image in plain language and get a render. Pick a generation model, set the size, and the local server produces the picture on your GPU, no queue, no credits, no upload.

Edit

Image editing

Open an image and change it without starting over:

Image-to-image - re-render an existing picture toward a new prompt while keeping its composition.
Inpainting - mask the region you want changed and repaint only that area, leaving the rest untouched.
Reference / identity images - supply a subject so a character or object stays consistent across edits.

A curated model catalog

Models download on demand from Hugging Face; nothing is bundled until you choose it. The catalog spans fast turbo models, high-fidelity renderers, and a dedicated editing model, each listed with a short note and an approximate download size.

Model	What it's for	Approx size
Z-Image Turbo	Fast 8-step bilingual text-to-image - a quick default for everyday renders.	~6 GB
Qwen-Image-Edit 2509	20B editing model for character / subject continuity via reference images.	~18 GB
Stable Diffusion 1.5	The classic - runs on 4 GB cards, native 512×512.	~2 GB
SDXL 1.0	Higher fidelity, native 1024×1024.	~6.5 GB
FLUX.1-schnell	Photorealistic 4-step generation.	~12 GB

An optional Anima anime model add-on is available too. And the catalog isn't a ceiling: you can bring your own single-file .safetensors or .gguf model, or import a Hugging Face repo directly. Approximate sizes are for the on-demand download; the exact figure depends on the variant you choose.

Faster renders and new styles with LoRAs

Drop a LoRA into your library and attach it to a model from a per-model “LoRAs / speed” panel. Two kinds are worth knowing: acceleration LoRAs that collapse a render to a handful of steps, and style LoRAs that reshape the look.

Speed

Acceleration LoRAs - about 8 steps

Attach a distillation LoRA (DMD2, SDXL-Lightning, or Hyper) and tick the speed preset. LumaBrowser flips the model to 8 steps, CFG 1.0, and the euler sampler, so a render that took dozens of steps lands in a fraction of the time on the same GPU.

Style

Style LoRAs - a new look on tap

Any style .safetensors in your loras/ folder can be attached to an SD or SDXL model with a per-LoRA weight. Stack a look on top of a base model without swapping the whole checkpoint.

LoRAs load with the model, so attaching one restarts that model's slot; the next render picks it up. They apply to the still-image SD/SDXL family. See the docs for the exact controls.

Two slots, both resident

LumaBrowser keeps a generation model and an edit model loaded at the same time, with separate VRAM reservations. You can create and refine in the same session without paying the reload tax between steps.

Slot 1

Generation model

Your text-to-image model stays loaded, Z-Image Turbo, SDXL, FLUX.1-schnell, or whatever you picked, ready to render the next prompt instantly.

Slot 2

Edit model

An editing model like Qwen-Image-Edit holds its own VRAM reservation in parallel, so jumping from a fresh render to an inpaint or reference-image edit doesn't unload anything.

Ask the AI for a picture

Image generation isn't a separate tool you have to switch into. From the built-in AI Chat, the agent has generate_image and edit_image tools; just ask in the conversation.

generate_image

“Make me a picture of…”

Describe what you want in chat and the agent calls the local generation model. The result comes back as an image artifact you can open full-size or download, rendered on your machine, not fetched from a service.

edit_image

“Change this part…”

Point the agent at an image and ask for an edit, image-to-image, an inpaint, or a reference-guided change. It runs the edit model and returns the new image as an artifact in the conversation.

For setting up the chat model itself, hardware-aware picks, resumable downloads, streaming chat, see Local AI.

Runs on your GPU

The image server uses the same hardware backends as the rest of LumaBrowser's local AI, so it runs across the major vendors and Apple Silicon, with VRAM-aware placement so you can pin image models to specific GPUs.

NVIDIACUDA

The fast path on NVIDIA cards; the larger models in the catalog are happiest here.

Cross-vendorVulkan

Vulkan covers AMD, Intel, and other GPUs so you're not locked to one vendor.

Apple SiliconMetal

M-series Macs run the same models locally; no external GPU required.

Bigger models want more VRAM, the editing and FLUX models in particular. VAE tiling helps Qwen-Image-Edit fit under 24 GB. VRAM-aware placement lets you pin a specific image model to a specific GPU, which matters when you keep both the generation and edit slots resident. The same hardware detection that powers Local AI applies here.

Image generation that stays on your machine

Free, local, and private, text-to-image and full image editing on your own GPU, with a curated catalog and your own models welcome. No cloud service, no per-image fee, nothing leaving your device. Pair it with Local AI and the built-in APIs, or head home to see everything LumaBrowser does.

Download LumaBrowser - free Set up Local AI