Saved in:
Bibliographic Details
Main Authors: Aziz, Memoona, Rehman, Umair, Safi, Syed Ali, Abbasi, Amir Zaib
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2408.12762
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866929481043673088
author Aziz, Memoona
Rehman, Umair
Safi, Syed Ali
Abbasi, Amir Zaib
author_facet Aziz, Memoona
Rehman, Umair
Safi, Syed Ali
Abbasi, Amir Zaib
contents The rapid advancements in AI technologies have revolutionized the production of graphical content across various sectors, including entertainment, advertising, and e-commerce. These developments have spurred the need for robust evaluation methods to assess the quality and realism of AI-generated images. To address this, we conducted three studies. First, we introduced and validated a questionnaire called Visual Verity, which measures photorealism, image quality, and text-image alignment. Second, we applied this questionnaire to assess images from AI models (DALL-E2, DALL-E3, GLIDE, Stable Diffusion) and camera-generated images, revealing that camera-generated images excelled in photorealism and text-image alignment, while AI models led in image quality. We also analyzed statistical properties, finding that camera-generated images scored lower in hue, saturation, and brightness. Third, we evaluated computational metrics' alignment with human judgments, identifying MS-SSIM and CLIP as the most consistent with human assessments. Additionally, we proposed the Neural Feature Similarity Score (NFSS) for assessing image quality. Our findings highlight the need for refining computational metrics to better capture human visual perception, thereby enhancing AI-generated content evaluation.
format Preprint
id arxiv_https___arxiv_org_abs_2408_12762
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Visual Verity in AI-Generated Imagery: Computational Metrics and Human-Centric Analysis
Aziz, Memoona
Rehman, Umair
Safi, Syed Ali
Abbasi, Amir Zaib
Human-Computer Interaction
Artificial Intelligence
The rapid advancements in AI technologies have revolutionized the production of graphical content across various sectors, including entertainment, advertising, and e-commerce. These developments have spurred the need for robust evaluation methods to assess the quality and realism of AI-generated images. To address this, we conducted three studies. First, we introduced and validated a questionnaire called Visual Verity, which measures photorealism, image quality, and text-image alignment. Second, we applied this questionnaire to assess images from AI models (DALL-E2, DALL-E3, GLIDE, Stable Diffusion) and camera-generated images, revealing that camera-generated images excelled in photorealism and text-image alignment, while AI models led in image quality. We also analyzed statistical properties, finding that camera-generated images scored lower in hue, saturation, and brightness. Third, we evaluated computational metrics' alignment with human judgments, identifying MS-SSIM and CLIP as the most consistent with human assessments. Additionally, we proposed the Neural Feature Similarity Score (NFSS) for assessing image quality. Our findings highlight the need for refining computational metrics to better capture human visual perception, thereby enhancing AI-generated content evaluation.
title Visual Verity in AI-Generated Imagery: Computational Metrics and Human-Centric Analysis
topic Human-Computer Interaction
Artificial Intelligence
url https://arxiv.org/abs/2408.12762