stable-diffusion-webui
ApplicationAUTOMATIC1111/stable-diffusion-webui
A browser interface for Stable Diffusion text-to-image generation.
Overview
Provides a comprehensive web-based UI for generating images using Stable Diffusion models. Supports txt2img, img2img, inpainting, upscaling, and many advanced features like prompt editing, attention control, and custom scripts.
README Preview
# Stable Diffusion web UI\r\nA web interface for Stable Diffusion, implemented using Gradio library.\r\n\r\n\r\n\r\n## Features\r\n[Detailed feature showcase with images](https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Features):\r\n- Original txt2img and img2img modes\r\n- One click install and run script (but you still must install python and git)\r\n- Outpainting\r\n- Inpainting\r\n- Color Sketch\r\n- Prompt Matrix\r\n- Stable Diffusion Upscale\r\n- Attention, specify parts of text that the model should pay more attention to\r\n - a man in a `((tuxedo))` - will pay more attention to tuxedo\r\n - a man in a `(tuxedo:1.21)` - alternative syntax\r\n - select text and press `Ctrl+Up` or `Ctrl+Down` (or `Command+Up` or `Command+Down` if you're on a MacOS) to automatically adjust attention to selected text (code contributed by anonymous user)\r\n- Loopback, run img2img processing multiple times\r\n- X/Y/Z plot, a way to draw a 3 dimensional plot of images with different parameters\r\n- Textual Inversion\r\n - have as many embeddings as you want and use any names you like for them\r\n - use multiple embeddings with different numbers of vectors per token\r\n - works with half precision floating point numbers\r\n - train embeddings on 8GB (also reports of 6GB working)\r\n- Extras tab with:\r\n - GFPGAN, neural network that fixes faces\r\n - CodeFormer, face restoration tool as an alternative to GFPGAN\r\n - RealESRGAN, neural network upscaler\r\n - ESRGAN, neural network upscaler with a lot of third party models\r\n - SwinIR and Swin2SR ([see here](https://github.com/AUTOMATIC1111/stable-diffusion-webui/pull/2092)), neural network upscalers\r\n - LDSR, Latent diffusion super resolution upscaling\r\n- Resizing aspect ratio options\r\n- Sampling method selection\r\n - Adjust sampler eta values (noise multiplier)\r\n - More advanced noise setting options\r\n- Interrupt processing at any time\r\n- 4GB video card suppor
FAQ (2)
TroubleshootingHow to fix 'RuntimeError: Couldn't install clip' when launching Stable Diffusion webui on macOS?
This error occurs due to pip failing to install the openai/CLIP dependency from the URL. Workaround: 1. Clone the CLIP repo manually: git clone https://github.com/openai/CLIP.git. 2. Install it using the virtual environment's pip: ./stable-diffusion-webui/venv/bin/pip install ./CLIP. 3. Run ./webui.sh again; the installer will skip CLIP since it's already present.
TroubleshootingHow to fix HSA_STATUS_ERROR_INVALID_ISA on RX 7600 XT with PyTorch ROCm 6.0?
Downgrade to PyTorch 2.2.0 with ROCm 5.7 and set HSA_OVERRIDE_GFX_VERSION=11.0.0. ROCm 6.0 lacks stable support for gfx1102 (RX 7600 series); use 5.7 until fixed.