Saved in:
Bibliographic Details
Main Authors: Wang, Xi, Dufour, Nicolas, Andreou, Nefeli, Cani, Marie-Paule, Abrevaya, Victoria Fernandez, Picard, David, Kalogeiton, Vicky
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2404.13040
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909463512875008
author Wang, Xi
Dufour, Nicolas
Andreou, Nefeli
Cani, Marie-Paule
Abrevaya, Victoria Fernandez
Picard, David
Kalogeiton, Vicky
author_facet Wang, Xi
Dufour, Nicolas
Andreou, Nefeli
Cani, Marie-Paule
Abrevaya, Victoria Fernandez
Picard, David
Kalogeiton, Vicky
contents Classifier-Free Guidance (CFG) enhances the quality and condition adherence of text-to-image diffusion models. It operates by combining the conditional and unconditional predictions using a fixed weight. However, recent works vary the weights throughout the diffusion process, reporting superior results but without providing any rationale or analysis. By conducting comprehensive experiments, this paper provides insights into CFG weight schedulers. Our findings suggest that simple, monotonically increasing weight schedulers consistently lead to improved performances, requiring merely a single line of code. In addition, more complex parametrized schedulers can be optimized for further improvement, but do not generalize across different models and tasks.
format Preprint
id arxiv_https___arxiv_org_abs_2404_13040
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Analysis of Classifier-Free Guidance Weight Schedulers
Wang, Xi
Dufour, Nicolas
Andreou, Nefeli
Cani, Marie-Paule
Abrevaya, Victoria Fernandez
Picard, David
Kalogeiton, Vicky
Computer Vision and Pattern Recognition
Machine Learning
Classifier-Free Guidance (CFG) enhances the quality and condition adherence of text-to-image diffusion models. It operates by combining the conditional and unconditional predictions using a fixed weight. However, recent works vary the weights throughout the diffusion process, reporting superior results but without providing any rationale or analysis. By conducting comprehensive experiments, this paper provides insights into CFG weight schedulers. Our findings suggest that simple, monotonically increasing weight schedulers consistently lead to improved performances, requiring merely a single line of code. In addition, more complex parametrized schedulers can be optimized for further improvement, but do not generalize across different models and tasks.
title Analysis of Classifier-Free Guidance Weight Schedulers
topic Computer Vision and Pattern Recognition
Machine Learning
url https://arxiv.org/abs/2404.13040