OpenSource-Hub

stable-diffusion-webui

Application

AUTOMATIC1111/stable-diffusion-webui

A browser interface for Stable Diffusion text-to-image generation.

Overview

Provides a comprehensive web-based UI for generating images using Stable Diffusion models. Supports txt2img, img2img, inpainting, upscaling, and many advanced features like prompt editing, attention control, and custom scripts.

README Preview

# Stable Diffusion web UI\r\nA web interface for Stable Diffusion, implemented using Gradio library.\r\n\r\n\r\n\r\n## Features\r\n[Detailed feature showcase with images](https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Features):\r\n- Original txt2img and img2img modes\r\n- One click install and run script (but you still must install python and git)\r\n- Outpainting\r\n- Inpainting\r\n- Color Sketch\r\n- Prompt Matrix\r\n- Stable Diffusion Upscale\r\n- Attention, specify parts of text that the model should pay more attention to\r\n    - a man in a `((tuxedo))` - will pay more attention to tuxedo\r\n    - a man in a `(tuxedo:1.21)` - alternative syntax\r\n    - select text and press `Ctrl+Up` or `Ctrl+Down` (or `Command+Up` or `Command+Down` if you're on a MacOS) to automatically adjust attention to selected text (code contributed by anonymous user)\r\n- Loopback, run img2img processing multiple times\r\n- X/Y/Z plot, a way to draw a 3 dimensional plot of images with different parameters\r\n- Textual Inversion\r\n    - have as many embeddings as you want and use any names you like for them\r\n    - use multiple embeddings with different numbers of vectors per token\r\n    - works with half precision floating point numbers\r\n    - train embeddings on 8GB (also reports of 6GB working)\r\n- Extras tab with:\r\n    - GFPGAN, neural network that fixes faces\r\n    - CodeFormer, face restoration tool as an alternative to GFPGAN\r\n    - RealESRGAN, neural network upscaler\r\n    - ESRGAN, neural network upscaler with a lot of third party models\r\n    - SwinIR and Swin2SR ([see here](https://github.com/AUTOMATIC1111/stable-diffusion-webui/pull/2092)), neural network upscalers\r\n    - LDSR, Latent diffusion super resolution upscaling\r\n- Resizing aspect ratio options\r\n- Sampling method selection\r\n    - Adjust sampler eta values (noise multiplier)\r\n    - More advanced noise setting options\r\n- Interrupt processing at any time\r\n- 4GB video card suppor

FAQ (2)

Troubleshooting
How to fix 'RuntimeError: Couldn't install clip' when launching Stable Diffusion webui on macOS?

This error occurs due to pip failing to install the openai/CLIP dependency from the URL. Workaround: 1. Clone the CLIP repo manually: git clone https://github.com/openai/CLIP.git. 2. Install it using the virtual environment's pip: ./stable-diffusion-webui/venv/bin/pip install ./CLIP. 3. Run ./webui.sh again; the installer will skip CLIP since it's already present.

GitHub Issue #17410
Troubleshooting
How to fix HSA_STATUS_ERROR_INVALID_ISA on RX 7600 XT with PyTorch ROCm 6.0?

Downgrade to PyTorch 2.2.0 with ROCm 5.7 and set HSA_OVERRIDE_GFX_VERSION=11.0.0. ROCm 6.0 lacks stable support for gfx1102 (RX 7600 series); use 5.7 until fixed.

GitHub Issue #15434