Documentation

About

Summary

Project NamePurposeTypeDiffusers?ComfyUI?
Instant FamilyIdentity-picture to pictureID Prompt
Stylus DiffusionSelects LoRAs from a databaseMeta-System
Align Your StepsOptimum scheduler; better images and better prompt adherenceScheduler
HiDiffusionFaster inference, better images at 2048px and 4096pxModel-Modifier
Hyper-SDOnly 1 - 8 steps in the scheduler; uses TCDLoRA Scheduler
VideoGigaGANUpscale videosModel
PanFusion360-degree panoramic image generationModel
TCD (Trajectory Consistent Distillation)Turbo SchedulerScheduler
Stable Diffusion 3Text-To-Image ModelModel
PhotoMakerUses its own model to process images; adds identity to existing modelsModel Model-Modifier
GigaGANText to imageModel

Instant Family

Description: Specify faces of people, and generate images with all of their faces present.

  • Date: May 2024
  • Authors: Chanran Kim, Jeongin Lee, Shichang Joung, Bongmo Kim, Yeul-Min Baek
  • Paper: https://arxiv.org/abs/2404.19427
  • Code: No weights or code available yet; coming very soon?

Story Diffusion

Description: Some attention-technique to generate consistent characters somehow?


Stylus

Description: When the user types a prompt, this meta-expert-system selects from a list of available LoRAs to help it better fulfill the user's request. It automatically mixes these in.


Align Your Steps

Description: Nvidia's mathematical/theoretical analysis to find the optimal denoising schedule for diffusion models, resulting in better-quality images and prompt adherence.


HiDiffusion

Description: Modifies existing Stable Diffusion models to generate higher-resolution images (2048px or 4096px) directly, without duplication artifacts, and provides a speed improvement.


Hyper-SD

Description: Provided as a LoRA add-on for SD1 and SDXL; enables these models to run in 1 - 8 steps, greatly reducing inference time.


VideoGigaGAN

Description: A video-super-resolution model from Adobe that upscales videos from 128px to 1024px.


PanFusion

Description: 360-degree panoramic image generation, trained on Matterport 3D data. Works well for generating Skyboxes.


Trajectory Consistent Distillation (TCD)

Description: A new scheduler used with turbo-diffusion models. Replaces the LCM (latent consistency model) scheduler.