Diffusers documentation
Normalization layers
Get started
Tutorials
OverviewUnderstanding pipelines, models and schedulersAutoPipelineTrain a diffusion modelLoad LoRAs for inferenceAccelerate inference of text-to-image diffusion models
Using Diffusers
Loading & Hub
OverviewLoad pipelines, models, and schedulersLoad and compare different schedulersLoad community pipelines and componentsLoad safetensorsLoad different Stable Diffusion formatsLoad adaptersPush files to the Hub
Tasks
OverviewUnconditional image generationText-to-imageImage-to-imageInpaintingText or image-to-videoDepth-to-image
Techniques
Textual inversionIP-AdapterMerge LoRAsDistributed inference with multiple GPUsImprove image quality with deterministic generationControl image brightnessPrompt weightingImprove generation quality with FreeU
Specific pipeline examples
OverviewStable Diffusion XLSDXL TurboKandinskyControlNetShap-EDiffEditDistilled Stable Diffusion inferencePipeline callbacksCreate reproducible pipelinesCommunity pipelinesContribute a community pipelineLatent Consistency Model-LoRALatent Consistency ModelTrajectory Consistency Distillation-LoRAStable Video Diffusion
Training
OverviewCreate a dataset for trainingAdapt a model to a new task
Models
Unconditional image generationText-to-imageStable Diffusion XLKandinsky 2.2WuerstchenControlNetT2I-AdaptersInstructPix2Pix
Methods
Taking Diffusers Beyond Images
Optimization
Conceptual Guides
PhilosophyControlled generationHow to contribute?Diffusers' Ethical GuidelinesEvaluating Diffusion Models
API
Main Classes
Loaders
Models
OverviewUNet1DModelUNet2DModelUNet2DConditionModelUNet3DConditionModelUNetMotionModelUViT2DModelVQModelAutoencoderKLAsymmetricAutoencoderKLTiny AutoEncoderConsistencyDecoderVAETransformer2DTransformer TemporalPrior TransformerControlNet
Pipelines
OverviewaMUSEdAnimateDiffAttend-and-ExciteAudioLDMAudioLDM 2AutoPipelineBLIP-DiffusionConsistency ModelsControlNetControlNet with Stable Diffusion XLDance DiffusionDDIMDDPMDeepFloyd IFDiffEditDiTI2VGen-XLInstructPix2PixKandinsky 2.1Kandinsky 2.2Kandinsky 3Latent Consistency ModelsLatent DiffusionLEDITS++MultiDiffusionMusicLDMPaint by ExamplePersonalized Image Animator (PIA)PixArt-αSelf-Attention GuidanceSemantic GuidanceShap-EStable Cascade
Stable Diffusion
OverviewText-to-imageImage-to-imageImage-to-videoInpaintingDepth-to-imageImage variationSafe Stable DiffusionStable Diffusion 2Stable Diffusion XLSDXL TurboLatent upscalerSuper-resolutionK-DiffusionLDM3D Text-to-(RGB, Depth), Text-to-(RGB-pano, Depth-pano), LDM3D UpscalerStable Diffusion T2I-AdapterGLIGEN (Grounded Language-to-Image Generation)
Stable unCLIPText-to-videoText2Video-ZerounCLIPUniDiffuserValue-guided samplingWuerstchenSchedulers
OverviewCMStochasticIterativeSchedulerConsistencyDecoderSchedulerDDIMInverseSchedulerDDIMSchedulerDDPMSchedulerDEISMultistepSchedulerDPMSolverMultistepInverseDPMSolverMultistepSchedulerDPMSolverSDESchedulerDPMSolverSinglestepSchedulerEulerAncestralDiscreteSchedulerEulerDiscreteSchedulerEDMEulerSchedulerEDMDPMSolverMultistepSchedulerHeunDiscreteSchedulerIPNDMSchedulerKarrasVeSchedulerKDPM2AncestralDiscreteSchedulerKDPM2DiscreteSchedulerLCMSchedulerLMSDiscreteSchedulerPNDMSchedulerRePaintSchedulerScoreSdeVeSchedulerScoreSdeVpSchedulerTCDSchedulerUniPCMultistepSchedulerVQDiffusionScheduler
Internal classes
You are viewing v0.27.2 version. A newer version v0.38.0 is available.
Normalization layers
Customized normalization layers for supporting various models in 🤗 Diffusers.
AdaLayerNorm
class diffusers.models.normalization.AdaLayerNorm
< source >( embedding_dim: int num_embeddings: int )
Norm layer modified to incorporate timestep embeddings.
AdaLayerNormZero
class diffusers.models.normalization.AdaLayerNormZero
< source >( embedding_dim: int num_embeddings: int )
Norm layer adaptive layer norm zero (adaLN-Zero).
AdaLayerNormSingle
class diffusers.models.normalization.AdaLayerNormSingle
< source >( embedding_dim: int use_additional_conditions: bool = False )
Norm layer adaptive layer norm single (adaLN-single).
As proposed in PixArt-Alpha (see: https://arxiv.org/abs/2310.00426; Section 2.3).
AdaGroupNorm
class diffusers.models.normalization.AdaGroupNorm
< source >( embedding_dim: int out_dim: int num_groups: int act_fn: Optional = None eps: float = 1e-05 )
Parameters
- embedding_dim (
int) — The size of each embedding vector. - num_embeddings (
int) — The size of the embeddings dictionary. - num_groups (
int) — The number of groups to separate the channels into. - act_fn (
str, optional, defaults toNone) — The activation function to use. - eps (
float, optional, defaults to1e-5) — The epsilon value to use for numerical stability.
GroupNorm layer modified to incorporate timestep embeddings.