Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Epstein, Susan, Chen, Li, Vecchiato, Alessandro, Jain, Ankit
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence I.I.2 I.2.1
Online Access:	https://arxiv.org/abs/2402.17101
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866909121119256576
author	Epstein, Susan Chen, Li Vecchiato, Alessandro Jain, Ankit
author_facet	Epstein, Susan Chen, Li Vecchiato, Alessandro Jain, Ankit
contents	Generative AI image models may inadvertently generate problematic representations of people. Past research has noted that millions of users engage daily across the world with these models and that the models, including through problematic representations of people, have the potential to compound and accelerate real-world discrimination and other harms (Bianchi et al, 2023). In this paper, we focus on addressing the generation of problematic associations between demographic groups and semantic concepts that may reflect and reinforce negative narratives embedded in social data. Building on sociological literature (Blumer, 1958) and mapping representations to model behaviors, we have developed a taxonomy to study problematic associations in image generation models. We explore the effectiveness of fine tuning at the model level as a method to address these associations, identifying a potential reduction in visual quality as a limitation of traditional fine tuning. We also propose a new methodology with twice-human-in-the-loop (T-HITL) that promises improvements in both reducing problematic associations and also maintaining visual quality. We demonstrate the effectiveness of T-HITL by providing evidence of three problematic associations addressed by T-HITL at the model level. Our contributions to scholarship are two-fold. By defining problematic associations in the context of machine learning models and generative AI, we introduce a conceptual and technical taxonomy for addressing some of these associations. Finally, we provide a method, T-HITL, that addresses these associations and simultaneously maintains visual quality of image model generations. This mitigation need not be a tradeoff, but rather an enhancement.
format	Preprint
id	arxiv_https___arxiv_org_abs_2402_17101
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	T-HITL Effectively Addresses Problematic Associations in Image Generation and Maintains Overall Visual Quality Epstein, Susan Chen, Li Vecchiato, Alessandro Jain, Ankit Computer Vision and Pattern Recognition Artificial Intelligence I.I.2 I.2.1 Generative AI image models may inadvertently generate problematic representations of people. Past research has noted that millions of users engage daily across the world with these models and that the models, including through problematic representations of people, have the potential to compound and accelerate real-world discrimination and other harms (Bianchi et al, 2023). In this paper, we focus on addressing the generation of problematic associations between demographic groups and semantic concepts that may reflect and reinforce negative narratives embedded in social data. Building on sociological literature (Blumer, 1958) and mapping representations to model behaviors, we have developed a taxonomy to study problematic associations in image generation models. We explore the effectiveness of fine tuning at the model level as a method to address these associations, identifying a potential reduction in visual quality as a limitation of traditional fine tuning. We also propose a new methodology with twice-human-in-the-loop (T-HITL) that promises improvements in both reducing problematic associations and also maintaining visual quality. We demonstrate the effectiveness of T-HITL by providing evidence of three problematic associations addressed by T-HITL at the model level. Our contributions to scholarship are two-fold. By defining problematic associations in the context of machine learning models and generative AI, we introduce a conceptual and technical taxonomy for addressing some of these associations. Finally, we provide a method, T-HITL, that addresses these associations and simultaneously maintains visual quality of image model generations. This mitigation need not be a tradeoff, but rather an enhancement.
title	T-HITL Effectively Addresses Problematic Associations in Image Generation and Maintains Overall Visual Quality
topic	Computer Vision and Pattern Recognition Artificial Intelligence I.I.2 I.2.1
url	https://arxiv.org/abs/2402.17101

Similar Items