Saved in:
Bibliographic Details
Main Authors: Zeng, E. Zhixuan, Chen, Yuhao, Wong, Alexander
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2406.03582
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917686264463360
author Zeng, E. Zhixuan
Chen, Yuhao
Wong, Alexander
author_facet Zeng, E. Zhixuan
Chen, Yuhao
Wong, Alexander
contents Image generation techniques, particularly latent diffusion models, have exploded in popularity in recent years. Many techniques have been developed to manipulate and clarify the semantic concepts these large-scale models learn, offering crucial insights into biases and concept relationships. However, these techniques are often only validated in conventional realms of human or animal faces and artistic style transitions. The food domain offers unique challenges through complex compositions and regional biases, which can shed light on the limitations and opportunities within existing methods. Through the lens of food imagery, we analyze both qualitative and quantitative patterns within a concept traversal technique. We reveal measurable insights into the model's ability to capture and represent the nuances of culinary diversity, while also identifying areas where the model's biases and limitations emerge.
format Preprint
id arxiv_https___arxiv_org_abs_2406_03582
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Understanding the Limitations of Diffusion Concept Algebra Through Food
Zeng, E. Zhixuan
Chen, Yuhao
Wong, Alexander
Computer Vision and Pattern Recognition
Artificial Intelligence
Image generation techniques, particularly latent diffusion models, have exploded in popularity in recent years. Many techniques have been developed to manipulate and clarify the semantic concepts these large-scale models learn, offering crucial insights into biases and concept relationships. However, these techniques are often only validated in conventional realms of human or animal faces and artistic style transitions. The food domain offers unique challenges through complex compositions and regional biases, which can shed light on the limitations and opportunities within existing methods. Through the lens of food imagery, we analyze both qualitative and quantitative patterns within a concept traversal technique. We reveal measurable insights into the model's ability to capture and represent the nuances of culinary diversity, while also identifying areas where the model's biases and limitations emerge.
title Understanding the Limitations of Diffusion Concept Algebra Through Food
topic Computer Vision and Pattern Recognition
Artificial Intelligence
url https://arxiv.org/abs/2406.03582