Diffusion Networks & GANs
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
Custom nodes pack for ComfyUI This custom node helps to conveniently enhance images through Detector, Detailer, Upscaler, Pipe, and more.
Extended faceswap extension for StableDiffusion web-ui with multiple faceswaps, inpainting, checkpoints, ....
FLUX, Stable Diffusion, SDXL, SD3, LoRA, Fine Tuning, DreamBooth, Training, Automatic1111, Forge WebUI, SwarmUI, DeepFake, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, Comfy…
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
A powerful and modular stable diffusion GUI with a graph/nodes interface.
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
WebUI extension for ControlNet
Based on GroundingDino and SAM, use semantic strings to segment any element in an image. The comfyui version of sd-webui-segment-anything.
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
Convert your videos to densepose and use it on MagicAnimate
Fast and Simple Face Swap Extension Node for ComfyUI
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
OneTrainer is a one-stop solution for all your stable diffusion training needs.
Various AI scripts. Mostly Stable Diffusion stuff.
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Official repository of In-Context LoRA for Diffusion Transformers
Official inference repo for FLUX.1 models
Arbitrary-steps Image Super-resolution via Diffusion Inversion (CVPR 2025)