:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zuo, Xiaoye, Athanasiou, Nikos, Delmas, Ginger, Huang, Yiming, Fu, Xingyu, Liu, Lingjie
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Human-Computer Interaction
Online Access:	https://arxiv.org/abs/2508.07501
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Vid2Coach: Transforming How-To Videos into Task Assistants
by: Huh, Mina, et al.
Published: (2025)

Jenga Stacking Based on 6D Pose Estimation for Architectural Form Finding Process
by: Huang, Zixun
Published: (2023)

AIris: An AI-powered Wearable Assistive Device for the Visually Impaired
by: Brilli, Dionysia Danai, et al.
Published: (2024)

Exploring Emotion Expression Recognition in Older Adults Interacting with a Virtual Coach
by: Palmero, Cristina, et al.
Published: (2023)

Hybrid 3D Human Pose Estimation with Monocular Video and Sparse IMUs
by: Bao, Yiming, et al.
Published: (2024)

Towards Consumer-Grade Cybersickness Prediction: Multi-Model Alignment for Real-Time Vision-Only Inference
by: Zhu, Yitong, et al.
Published: (2025)

Interact with me: Joint Egocentric Forecasting of Intent to Interact, Attitude and Social Actions
by: Bian, Tongfei, et al.
Published: (2024)

EgoPressure: A Dataset for Hand Pressure and Pose Estimation in Egocentric Vision
by: Zhao, Yiming, et al.
Published: (2024)

SuDA: Support-based Domain Adaptation for Sim2Real Motion Capture with Flexible Sensors
by: Fang, Jiawei, et al.
Published: (2024)

CoEditor++: Instruction-based Visual Editing via Cognitive Reasoning
by: Ni, Minheng, et al.
Published: (2026)

MILE: A Mechanically Isomorphic Exoskeleton Data Collection System with Fingertip Visuotactile Sensing for Dexterous Manipulation
by: Du, Jinda, et al.
Published: (2025)

A Survey on Drowsiness Detection -- Modern Applications and Methods
by: Fu, Biying, et al.
Published: (2024)

Towards user-centered interactive medical image segmentation in VR with an assistive AI agent
by: Spiegler, Pascal, et al.
Published: (2025)

Low Latency Gaze Tracking via Latent Optical Sensing
by: Zheng, Yidan, et al.
Published: (2026)

Breaking Coordinate Overfitting: Geometry-Aware WiFi Sensing for Cross-Layout 3D Pose Estimation
by: Jia, Songming, et al.
Published: (2026)

ChatStitch: Visualizing Through Structures via Surround-View Unsupervised Deep Image Stitching with Collaborative LLM-Agents
by: Liang, Hao, et al.
Published: (2025)

Multi-Masked Querying Network for Robust Emotion Recognition from Incomplete Multi-Modal Physiological Signals
by: Xu, Geng-Xin, et al.
Published: (2025)

WheelPose: Data Synthesis Techniques to Improve Pose Estimation Performance on Wheelchair Users
by: Huang, William, et al.
Published: (2024)

A Monocular SLAM-based Multi-User Positioning System with Image Occlusion in Augmented Reality
by: Lien, Wei-Hsiang, et al.
Published: (2024)

Navi-plus: Managing Ambiguous GUI Navigation Tasks with Follow-up Questions
by: Cheng, Ziming, et al.
Published: (2025)

FineCLIPER: Multi-modal Fine-grained CLIP for Dynamic Facial Expression Recognition with AdaptERs
by: Chen, Haodong, et al.
Published: (2024)

MAGE: A Multi-task Architecture for Gaze Estimation with an Efficient Calibration Module
by: Huang, Haoming, et al.
Published: (2025)

Finger in Camera Speaks Everything: Unconstrained Air-Writing for Real-World
by: Wu, Meiqi, et al.
Published: (2024)

AttributionScanner: A Visual Analytics System for Model Validation with Metadata-Free Slice Finding
by: Xuan, Xiwei, et al.
Published: (2024)

3DArticCyclists: Generating Synthetic Articulated 8D Pose-Controllable Cyclist Data for Computer Vision Applications
by: Corral-Soto, Eduardo R., et al.
Published: (2024)

Motion Sickness Modeling with Visual Vertical Estimation and Its Application to Autonomous Personal Mobility Vehicles
by: Liu, Hailong, et al.
Published: (2022)

Self-Supervised Continuous Colormap Recovery from a 2D Scalar Field Visualization without a Legend
by: Liu, Hongxu, et al.
Published: (2025)

Robustness-enhanced Myoelectric Control with GAN-based Open-set Recognition
by: Wang, Cheng, et al.
Published: (2024)

An Egocentric Vision-Language Model based Portable Real-time Smart Assistant
by: Huang, Yifei, et al.
Published: (2025)

Extend Your Horizon: A Device-Agnostic Surgical Tool Tracking Framework with Multi-View Optimization for Augmented Reality
by: Zhang, Jiaming, et al.
Published: (2026)

VIS-Shepherd: Constructing Critic for LLM-based Data Visualization Generation
by: Pan, Bo, et al.
Published: (2025)

Shu Dao: A Calligraphy Score Framework Linking Calligraphy, Music, and Performance
by: Huang, Lican
Published: (2026)

Automated Image-Based Identification and Consistent Classification of Fire Patterns with Quantitative Shape Analysis and Spatial Location Identification
by: Liu, Pengkun, et al.
Published: (2024)

OW-CLIP: Data-Efficient Visual Supervision for Open-World Object Detection via Human-AI Collaboration
by: Duan, Junwen, et al.
Published: (2025)

Computational Scaffolding of Composition, Value, and Color for Disciplined Drawing
by: Ma, Jiaju, et al.
Published: (2025)

Bridging Text and Image for Artist Style Transfer via Contrastive Learning
by: Liu, Zhi-Song, et al.
Published: (2024)

See Through Their Minds: Learning Transferable Neural Representation from Cross-Subject fMRI
by: Liu, Yulong, et al.
Published: (2024)

SimVecVis: A Dataset for Enhancing MLLMs in Visualization Understanding
by: Liu, Can, et al.
Published: (2025)

QueryCraft: Transformer-Guided Query Initialization for Enhanced Human-Object Interaction Detection
by: Wang, Yuxiao, et al.
Published: (2025)

EduGage: Methods and Dataset for Sensor-Based Momentary Assessment of Engagement in Self-Guided Video Learning
by: Leng, Zikang, et al.
Published: (2026)