Saved in:
| Main Authors: | , , , , , , |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.13040 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1866909463512875008 |
|---|---|
| author | Wang, Xi Dufour, Nicolas Andreou, Nefeli Cani, Marie-Paule Abrevaya, Victoria Fernandez Picard, David Kalogeiton, Vicky |
| author_facet | Wang, Xi Dufour, Nicolas Andreou, Nefeli Cani, Marie-Paule Abrevaya, Victoria Fernandez Picard, David Kalogeiton, Vicky |
| contents | Classifier-Free Guidance (CFG) enhances the quality and condition adherence of text-to-image diffusion models. It operates by combining the conditional and unconditional predictions using a fixed weight. However, recent works vary the weights throughout the diffusion process, reporting superior results but without providing any rationale or analysis. By conducting comprehensive experiments, this paper provides insights into CFG weight schedulers. Our findings suggest that simple, monotonically increasing weight schedulers consistently lead to improved performances, requiring merely a single line of code. In addition, more complex parametrized schedulers can be optimized for further improvement, but do not generalize across different models and tasks. |
| format | Preprint |
| id |
arxiv_https___arxiv_org_abs_2404_13040 |
| institution | arXiv |
| publishDate | 2024 |
| record_format | arxiv |
| spellingShingle | Analysis of Classifier-Free Guidance Weight Schedulers Wang, Xi Dufour, Nicolas Andreou, Nefeli Cani, Marie-Paule Abrevaya, Victoria Fernandez Picard, David Kalogeiton, Vicky Computer Vision and Pattern Recognition Machine Learning Classifier-Free Guidance (CFG) enhances the quality and condition adherence of text-to-image diffusion models. It operates by combining the conditional and unconditional predictions using a fixed weight. However, recent works vary the weights throughout the diffusion process, reporting superior results but without providing any rationale or analysis. By conducting comprehensive experiments, this paper provides insights into CFG weight schedulers. Our findings suggest that simple, monotonically increasing weight schedulers consistently lead to improved performances, requiring merely a single line of code. In addition, more complex parametrized schedulers can be optimized for further improvement, but do not generalize across different models and tasks. |
| title | Analysis of Classifier-Free Guidance Weight Schedulers |
| topic | Computer Vision and Pattern Recognition Machine Learning |
| url | https://arxiv.org/abs/2404.13040 |