:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Yu, Yunkai, Wang, Yingying, Zheng, Rong
Format:	Preprint
Published:	2025
Subjects:	Computers and Society Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2511.11713
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

The Cadaver in the Machine: The Social Practices of Measurement and Validation in Motion Capture Technology
by: Harvey, Emma, et al.
Published: (2024)

ViMU: Benchmarking Video Metaphorical Understanding
by: Li, Qi, et al.
Published: (2026)

RemoCap: Disentangled Representation Learning for Motion Capture
by: Wang, Hongsheng, et al.
Published: (2024)

BLEnD-Vis: Benchmarking Multimodal Cultural Understanding in Vision Language Models
by: Tan, Bryan Chen Zhengyu, et al.
Published: (2025)

Learning Under Low Illumination: A Dataset and Algorithm for Traffic Sign Recognition
by: Mishra, Aditya, et al.
Published: (2025)

Metrics for Dataset Demographic Bias: A Case Study on Facial Expression Recognition
by: Dominguez-Catena, Iris, et al.
Published: (2023)

Statistical Challenges with Dataset Construction: Why You Will Never Have Enough Images
by: Goldman, Josh, et al.
Published: (2024)

Text-to-Image Models and Their Representation of People from Different Nationalities Engaging in Activities
by: Alsudais, Abdulkareem
Published: (2025)

Generalized People Diversity: Learning a Human Perception-Aligned Diversity Representation for People Images
by: Srinivasan, Hansa, et al.
Published: (2024)

Breaking the Global North Stereotype: A Global South-centric Benchmark Dataset for Auditing and Mitigating Biases in Facial Recognition Systems
by: Jaiswal, Siddharth D, et al.
Published: (2024)

Capturing the Unseen: Vision-Free Facial Motion Capture Using Inertial Measurement Units
by: Wang, Youjia, et al.
Published: (2024)

Ego4o: Egocentric Human Motion Capture and Understanding from Multi-Modal Input
by: Wang, Jian, et al.
Published: (2025)

Investigating Disability Representations in Text-to-Image Models
by: Tian, Yang, et al.
Published: (2026)

A Unified Framework and Dataset for Assessing Societal Bias in Vision-Language Models
by: Sathe, Ashutosh, et al.
Published: (2024)

ERIT Lightweight Multimodal Dataset for Elderly Emotion Recognition and Multimodal Fusion Evaluation
by: Frieske, Rita, et al.
Published: (2024)

FigSIM: A Dataset for Fine-grained Suicide Severity and Figurative Language in Suicide Memes
by: Chen, Liuliu, et al.
Published: (2026)

Sparkle: A Robust and Versatile Representation for Point Cloud based Human Motion Capture
by: Ren, Yiming, et al.
Published: (2026)

Examining Gender and Racial Bias in Large Vision-Language Models Using a Novel Dataset of Parallel Images
by: Fraser, Kathleen C., et al.
Published: (2024)

A Dataset and Evaluation for Complex 4D Markerless Human Motion Capture
by: Park, Yeeun, et al.
Published: (2026)

On Occlusions in Video Action Detection: Benchmark Datasets And Training Recipes
by: Modi, Rajat, et al.
Published: (2024)

From Discrete to Continuous: Deep Fair Clustering With Transferable Representations
by: Zhang, Xiang
Published: (2024)

Detecting Visual Triggers in Cannabis Imagery: A CLIP-Based Multi-Labeling Framework with Local-Global Aggregation
by: Lu, Linqi, et al.
Published: (2024)

Neglected Risks: The Disturbing Reality of Children's Images in Datasets and the Urgent Call for Accountability
by: Caetano, Carlos, et al.
Published: (2025)

A satellite foundation model for improved wealth monitoring
by: Zheng, Zhuo, et al.
Published: (2026)

Vision-Language Models for Autonomous Driving: CLIP-Based Dynamic Scene Understanding
by: Elhenawy, Mohammed, et al.
Published: (2025)

Towards Understanding Unsafe Video Generation
by: Pang, Yan, et al.
Published: (2024)

URBAN-SPIN: A street-level bikeability index to inform design implementations in historical city centres
by: Ding, Haining, et al.
Published: (2026)

Ethical Challenges in Computer Vision: Ensuring Privacy and Mitigating Bias in Publicly Available Datasets
by: Tahir, Ghalib Ahmed
Published: (2024)

AI-EDI-SPACE: A Co-designed Dataset for Evaluating the Quality of Public Spaces
by: Gowaikar, Shreeyash, et al.
Published: (2024)

From Drone Imagery to Livability Mapping: AI-powered Environment Perception in Rural China
by: Deng, Weihuan, et al.
Published: (2025)

MetaphorStar: Image Metaphor Understanding and Reasoning with End-to-End Visual Reinforcement Learning
by: Zhang, Chenhao, et al.
Published: (2026)

Unmasking Illusions: Understanding Human Perception of Audiovisual Deepfakes
by: Hashmi, Ammarah, et al.
Published: (2024)

Beyond Performance Disparities: A Three-Level Audit of Representational Harm in CelebA
by: Park, Sieun, et al.
Published: (2026)

Can MLLMs Understand the Deep Implication Behind Chinese Images?
by: Zhang, Chenhao, et al.
Published: (2024)

ChaosBench: A Multi-Channel, Physics-Based Benchmark for Subseasonal-to-Seasonal Climate Prediction
by: Nathaniel, Juan, et al.
Published: (2024)

Exploring Disparity-Accuracy Trade-offs in Face Recognition Systems: The Role of Datasets, Architectures, and Loss Functions
by: Jaiswal, Siddharth D, et al.
Published: (2025)

Debiasing Diffusion Model: Enhancing Fairness through Latent Representation Learning in Stable Diffusion Model
by: Huang, Lin-Chun, et al.
Published: (2025)

Smiling Women Pitching Down: Auditing Representational and Presentational Gender Biases in Image Generative AI
by: Sun, Luhang, et al.
Published: (2023)

The TYC Dataset for Understanding Instance-Level Semantics and Motions of Cells in Microstructures
by: Reich, Christoph, et al.
Published: (2023)

Attention to Neural Plagiarism: Diffusion Models Can Plagiarize Your Copyrighted Images!
by: Zou, Zihang, et al.
Published: (2026)