Saved in:
| Main Authors: | Yu, Yunkai, Wang, Yingying, Zheng, Rong |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.11713 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
The Cadaver in the Machine: The Social Practices of Measurement and Validation in Motion Capture Technology
by: Harvey, Emma, et al.
Published: (2024)
by: Harvey, Emma, et al.
Published: (2024)
ViMU: Benchmarking Video Metaphorical Understanding
by: Li, Qi, et al.
Published: (2026)
by: Li, Qi, et al.
Published: (2026)
RemoCap: Disentangled Representation Learning for Motion Capture
by: Wang, Hongsheng, et al.
Published: (2024)
by: Wang, Hongsheng, et al.
Published: (2024)
BLEnD-Vis: Benchmarking Multimodal Cultural Understanding in Vision Language Models
by: Tan, Bryan Chen Zhengyu, et al.
Published: (2025)
by: Tan, Bryan Chen Zhengyu, et al.
Published: (2025)
Learning Under Low Illumination: A Dataset and Algorithm for Traffic Sign Recognition
by: Mishra, Aditya, et al.
Published: (2025)
by: Mishra, Aditya, et al.
Published: (2025)
Metrics for Dataset Demographic Bias: A Case Study on Facial Expression Recognition
by: Dominguez-Catena, Iris, et al.
Published: (2023)
by: Dominguez-Catena, Iris, et al.
Published: (2023)
Statistical Challenges with Dataset Construction: Why You Will Never Have Enough Images
by: Goldman, Josh, et al.
Published: (2024)
by: Goldman, Josh, et al.
Published: (2024)
Text-to-Image Models and Their Representation of People from Different Nationalities Engaging in Activities
by: Alsudais, Abdulkareem
Published: (2025)
by: Alsudais, Abdulkareem
Published: (2025)
Generalized People Diversity: Learning a Human Perception-Aligned Diversity Representation for People Images
by: Srinivasan, Hansa, et al.
Published: (2024)
by: Srinivasan, Hansa, et al.
Published: (2024)
Breaking the Global North Stereotype: A Global South-centric Benchmark Dataset for Auditing and Mitigating Biases in Facial Recognition Systems
by: Jaiswal, Siddharth D, et al.
Published: (2024)
by: Jaiswal, Siddharth D, et al.
Published: (2024)
Capturing the Unseen: Vision-Free Facial Motion Capture Using Inertial Measurement Units
by: Wang, Youjia, et al.
Published: (2024)
by: Wang, Youjia, et al.
Published: (2024)
Ego4o: Egocentric Human Motion Capture and Understanding from Multi-Modal Input
by: Wang, Jian, et al.
Published: (2025)
by: Wang, Jian, et al.
Published: (2025)
Investigating Disability Representations in Text-to-Image Models
by: Tian, Yang, et al.
Published: (2026)
by: Tian, Yang, et al.
Published: (2026)
A Unified Framework and Dataset for Assessing Societal Bias in Vision-Language Models
by: Sathe, Ashutosh, et al.
Published: (2024)
by: Sathe, Ashutosh, et al.
Published: (2024)
ERIT Lightweight Multimodal Dataset for Elderly Emotion Recognition and Multimodal Fusion Evaluation
by: Frieske, Rita, et al.
Published: (2024)
by: Frieske, Rita, et al.
Published: (2024)
FigSIM: A Dataset for Fine-grained Suicide Severity and Figurative Language in Suicide Memes
by: Chen, Liuliu, et al.
Published: (2026)
by: Chen, Liuliu, et al.
Published: (2026)
Sparkle: A Robust and Versatile Representation for Point Cloud based Human Motion Capture
by: Ren, Yiming, et al.
Published: (2026)
by: Ren, Yiming, et al.
Published: (2026)
Examining Gender and Racial Bias in Large Vision-Language Models Using a Novel Dataset of Parallel Images
by: Fraser, Kathleen C., et al.
Published: (2024)
by: Fraser, Kathleen C., et al.
Published: (2024)
A Dataset and Evaluation for Complex 4D Markerless Human Motion Capture
by: Park, Yeeun, et al.
Published: (2026)
by: Park, Yeeun, et al.
Published: (2026)
On Occlusions in Video Action Detection: Benchmark Datasets And Training Recipes
by: Modi, Rajat, et al.
Published: (2024)
by: Modi, Rajat, et al.
Published: (2024)
From Discrete to Continuous: Deep Fair Clustering With Transferable Representations
by: Zhang, Xiang
Published: (2024)
by: Zhang, Xiang
Published: (2024)
Detecting Visual Triggers in Cannabis Imagery: A CLIP-Based Multi-Labeling Framework with Local-Global Aggregation
by: Lu, Linqi, et al.
Published: (2024)
by: Lu, Linqi, et al.
Published: (2024)
Neglected Risks: The Disturbing Reality of Children's Images in Datasets and the Urgent Call for Accountability
by: Caetano, Carlos, et al.
Published: (2025)
by: Caetano, Carlos, et al.
Published: (2025)
A satellite foundation model for improved wealth monitoring
by: Zheng, Zhuo, et al.
Published: (2026)
by: Zheng, Zhuo, et al.
Published: (2026)
Vision-Language Models for Autonomous Driving: CLIP-Based Dynamic Scene Understanding
by: Elhenawy, Mohammed, et al.
Published: (2025)
by: Elhenawy, Mohammed, et al.
Published: (2025)
Towards Understanding Unsafe Video Generation
by: Pang, Yan, et al.
Published: (2024)
by: Pang, Yan, et al.
Published: (2024)
URBAN-SPIN: A street-level bikeability index to inform design implementations in historical city centres
by: Ding, Haining, et al.
Published: (2026)
by: Ding, Haining, et al.
Published: (2026)
Ethical Challenges in Computer Vision: Ensuring Privacy and Mitigating Bias in Publicly Available Datasets
by: Tahir, Ghalib Ahmed
Published: (2024)
by: Tahir, Ghalib Ahmed
Published: (2024)
AI-EDI-SPACE: A Co-designed Dataset for Evaluating the Quality of Public Spaces
by: Gowaikar, Shreeyash, et al.
Published: (2024)
by: Gowaikar, Shreeyash, et al.
Published: (2024)
From Drone Imagery to Livability Mapping: AI-powered Environment Perception in Rural China
by: Deng, Weihuan, et al.
Published: (2025)
by: Deng, Weihuan, et al.
Published: (2025)
MetaphorStar: Image Metaphor Understanding and Reasoning with End-to-End Visual Reinforcement Learning
by: Zhang, Chenhao, et al.
Published: (2026)
by: Zhang, Chenhao, et al.
Published: (2026)
Unmasking Illusions: Understanding Human Perception of Audiovisual Deepfakes
by: Hashmi, Ammarah, et al.
Published: (2024)
by: Hashmi, Ammarah, et al.
Published: (2024)
Beyond Performance Disparities: A Three-Level Audit of Representational Harm in CelebA
by: Park, Sieun, et al.
Published: (2026)
by: Park, Sieun, et al.
Published: (2026)
Can MLLMs Understand the Deep Implication Behind Chinese Images?
by: Zhang, Chenhao, et al.
Published: (2024)
by: Zhang, Chenhao, et al.
Published: (2024)
ChaosBench: A Multi-Channel, Physics-Based Benchmark for Subseasonal-to-Seasonal Climate Prediction
by: Nathaniel, Juan, et al.
Published: (2024)
by: Nathaniel, Juan, et al.
Published: (2024)
Exploring Disparity-Accuracy Trade-offs in Face Recognition Systems: The Role of Datasets, Architectures, and Loss Functions
by: Jaiswal, Siddharth D, et al.
Published: (2025)
by: Jaiswal, Siddharth D, et al.
Published: (2025)
Debiasing Diffusion Model: Enhancing Fairness through Latent Representation Learning in Stable Diffusion Model
by: Huang, Lin-Chun, et al.
Published: (2025)
by: Huang, Lin-Chun, et al.
Published: (2025)
Smiling Women Pitching Down: Auditing Representational and Presentational Gender Biases in Image Generative AI
by: Sun, Luhang, et al.
Published: (2023)
by: Sun, Luhang, et al.
Published: (2023)
The TYC Dataset for Understanding Instance-Level Semantics and Motions of Cells in Microstructures
by: Reich, Christoph, et al.
Published: (2023)
by: Reich, Christoph, et al.
Published: (2023)
Attention to Neural Plagiarism: Diffusion Models Can Plagiarize Your Copyrighted Images!
by: Zou, Zihang, et al.
Published: (2026)
by: Zou, Zihang, et al.
Published: (2026)
Similar Items
-
The Cadaver in the Machine: The Social Practices of Measurement and Validation in Motion Capture Technology
by: Harvey, Emma, et al.
Published: (2024) -
ViMU: Benchmarking Video Metaphorical Understanding
by: Li, Qi, et al.
Published: (2026) -
RemoCap: Disentangled Representation Learning for Motion Capture
by: Wang, Hongsheng, et al.
Published: (2024) -
BLEnD-Vis: Benchmarking Multimodal Cultural Understanding in Vision Language Models
by: Tan, Bryan Chen Zhengyu, et al.
Published: (2025) -
Learning Under Low Illumination: A Dataset and Algorithm for Traffic Sign Recognition
by: Mishra, Aditya, et al.
Published: (2025)