Saved in:
| Main Authors: | Zuo, Xiaoye, Athanasiou, Nikos, Delmas, Ginger, Huang, Yiming, Fu, Xingyu, Liu, Lingjie |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.07501 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Vid2Coach: Transforming How-To Videos into Task Assistants
by: Huh, Mina, et al.
Published: (2025)
by: Huh, Mina, et al.
Published: (2025)
Jenga Stacking Based on 6D Pose Estimation for Architectural Form Finding Process
by: Huang, Zixun
Published: (2023)
by: Huang, Zixun
Published: (2023)
AIris: An AI-powered Wearable Assistive Device for the Visually Impaired
by: Brilli, Dionysia Danai, et al.
Published: (2024)
by: Brilli, Dionysia Danai, et al.
Published: (2024)
Exploring Emotion Expression Recognition in Older Adults Interacting with a Virtual Coach
by: Palmero, Cristina, et al.
Published: (2023)
by: Palmero, Cristina, et al.
Published: (2023)
Hybrid 3D Human Pose Estimation with Monocular Video and Sparse IMUs
by: Bao, Yiming, et al.
Published: (2024)
by: Bao, Yiming, et al.
Published: (2024)
Towards Consumer-Grade Cybersickness Prediction: Multi-Model Alignment for Real-Time Vision-Only Inference
by: Zhu, Yitong, et al.
Published: (2025)
by: Zhu, Yitong, et al.
Published: (2025)
Interact with me: Joint Egocentric Forecasting of Intent to Interact, Attitude and Social Actions
by: Bian, Tongfei, et al.
Published: (2024)
by: Bian, Tongfei, et al.
Published: (2024)
EgoPressure: A Dataset for Hand Pressure and Pose Estimation in Egocentric Vision
by: Zhao, Yiming, et al.
Published: (2024)
by: Zhao, Yiming, et al.
Published: (2024)
SuDA: Support-based Domain Adaptation for Sim2Real Motion Capture with Flexible Sensors
by: Fang, Jiawei, et al.
Published: (2024)
by: Fang, Jiawei, et al.
Published: (2024)
CoEditor++: Instruction-based Visual Editing via Cognitive Reasoning
by: Ni, Minheng, et al.
Published: (2026)
by: Ni, Minheng, et al.
Published: (2026)
MILE: A Mechanically Isomorphic Exoskeleton Data Collection System with Fingertip Visuotactile Sensing for Dexterous Manipulation
by: Du, Jinda, et al.
Published: (2025)
by: Du, Jinda, et al.
Published: (2025)
A Survey on Drowsiness Detection -- Modern Applications and Methods
by: Fu, Biying, et al.
Published: (2024)
by: Fu, Biying, et al.
Published: (2024)
Towards user-centered interactive medical image segmentation in VR with an assistive AI agent
by: Spiegler, Pascal, et al.
Published: (2025)
by: Spiegler, Pascal, et al.
Published: (2025)
Low Latency Gaze Tracking via Latent Optical Sensing
by: Zheng, Yidan, et al.
Published: (2026)
by: Zheng, Yidan, et al.
Published: (2026)
Breaking Coordinate Overfitting: Geometry-Aware WiFi Sensing for Cross-Layout 3D Pose Estimation
by: Jia, Songming, et al.
Published: (2026)
by: Jia, Songming, et al.
Published: (2026)
ChatStitch: Visualizing Through Structures via Surround-View Unsupervised Deep Image Stitching with Collaborative LLM-Agents
by: Liang, Hao, et al.
Published: (2025)
by: Liang, Hao, et al.
Published: (2025)
Multi-Masked Querying Network for Robust Emotion Recognition from Incomplete Multi-Modal Physiological Signals
by: Xu, Geng-Xin, et al.
Published: (2025)
by: Xu, Geng-Xin, et al.
Published: (2025)
WheelPose: Data Synthesis Techniques to Improve Pose Estimation Performance on Wheelchair Users
by: Huang, William, et al.
Published: (2024)
by: Huang, William, et al.
Published: (2024)
A Monocular SLAM-based Multi-User Positioning System with Image Occlusion in Augmented Reality
by: Lien, Wei-Hsiang, et al.
Published: (2024)
by: Lien, Wei-Hsiang, et al.
Published: (2024)
Navi-plus: Managing Ambiguous GUI Navigation Tasks with Follow-up Questions
by: Cheng, Ziming, et al.
Published: (2025)
by: Cheng, Ziming, et al.
Published: (2025)
FineCLIPER: Multi-modal Fine-grained CLIP for Dynamic Facial Expression Recognition with AdaptERs
by: Chen, Haodong, et al.
Published: (2024)
by: Chen, Haodong, et al.
Published: (2024)
MAGE: A Multi-task Architecture for Gaze Estimation with an Efficient Calibration Module
by: Huang, Haoming, et al.
Published: (2025)
by: Huang, Haoming, et al.
Published: (2025)
Finger in Camera Speaks Everything: Unconstrained Air-Writing for Real-World
by: Wu, Meiqi, et al.
Published: (2024)
by: Wu, Meiqi, et al.
Published: (2024)
AttributionScanner: A Visual Analytics System for Model Validation with Metadata-Free Slice Finding
by: Xuan, Xiwei, et al.
Published: (2024)
by: Xuan, Xiwei, et al.
Published: (2024)
3DArticCyclists: Generating Synthetic Articulated 8D Pose-Controllable Cyclist Data for Computer Vision Applications
by: Corral-Soto, Eduardo R., et al.
Published: (2024)
by: Corral-Soto, Eduardo R., et al.
Published: (2024)
Motion Sickness Modeling with Visual Vertical Estimation and Its Application to Autonomous Personal Mobility Vehicles
by: Liu, Hailong, et al.
Published: (2022)
by: Liu, Hailong, et al.
Published: (2022)
Self-Supervised Continuous Colormap Recovery from a 2D Scalar Field Visualization without a Legend
by: Liu, Hongxu, et al.
Published: (2025)
by: Liu, Hongxu, et al.
Published: (2025)
Robustness-enhanced Myoelectric Control with GAN-based Open-set Recognition
by: Wang, Cheng, et al.
Published: (2024)
by: Wang, Cheng, et al.
Published: (2024)
An Egocentric Vision-Language Model based Portable Real-time Smart Assistant
by: Huang, Yifei, et al.
Published: (2025)
by: Huang, Yifei, et al.
Published: (2025)
Extend Your Horizon: A Device-Agnostic Surgical Tool Tracking Framework with Multi-View Optimization for Augmented Reality
by: Zhang, Jiaming, et al.
Published: (2026)
by: Zhang, Jiaming, et al.
Published: (2026)
VIS-Shepherd: Constructing Critic for LLM-based Data Visualization Generation
by: Pan, Bo, et al.
Published: (2025)
by: Pan, Bo, et al.
Published: (2025)
Shu Dao: A Calligraphy Score Framework Linking Calligraphy, Music, and Performance
by: Huang, Lican
Published: (2026)
by: Huang, Lican
Published: (2026)
Automated Image-Based Identification and Consistent Classification of Fire Patterns with Quantitative Shape Analysis and Spatial Location Identification
by: Liu, Pengkun, et al.
Published: (2024)
by: Liu, Pengkun, et al.
Published: (2024)
OW-CLIP: Data-Efficient Visual Supervision for Open-World Object Detection via Human-AI Collaboration
by: Duan, Junwen, et al.
Published: (2025)
by: Duan, Junwen, et al.
Published: (2025)
Computational Scaffolding of Composition, Value, and Color for Disciplined Drawing
by: Ma, Jiaju, et al.
Published: (2025)
by: Ma, Jiaju, et al.
Published: (2025)
Bridging Text and Image for Artist Style Transfer via Contrastive Learning
by: Liu, Zhi-Song, et al.
Published: (2024)
by: Liu, Zhi-Song, et al.
Published: (2024)
See Through Their Minds: Learning Transferable Neural Representation from Cross-Subject fMRI
by: Liu, Yulong, et al.
Published: (2024)
by: Liu, Yulong, et al.
Published: (2024)
SimVecVis: A Dataset for Enhancing MLLMs in Visualization Understanding
by: Liu, Can, et al.
Published: (2025)
by: Liu, Can, et al.
Published: (2025)
QueryCraft: Transformer-Guided Query Initialization for Enhanced Human-Object Interaction Detection
by: Wang, Yuxiao, et al.
Published: (2025)
by: Wang, Yuxiao, et al.
Published: (2025)
EduGage: Methods and Dataset for Sensor-Based Momentary Assessment of Engagement in Self-Guided Video Learning
by: Leng, Zikang, et al.
Published: (2026)
by: Leng, Zikang, et al.
Published: (2026)
Similar Items
-
Vid2Coach: Transforming How-To Videos into Task Assistants
by: Huh, Mina, et al.
Published: (2025) -
Jenga Stacking Based on 6D Pose Estimation for Architectural Form Finding Process
by: Huang, Zixun
Published: (2023) -
AIris: An AI-powered Wearable Assistive Device for the Visually Impaired
by: Brilli, Dionysia Danai, et al.
Published: (2024) -
Exploring Emotion Expression Recognition in Older Adults Interacting with a Virtual Coach
by: Palmero, Cristina, et al.
Published: (2023) -
Hybrid 3D Human Pose Estimation with Monocular Video and Sparse IMUs
by: Bao, Yiming, et al.
Published: (2024)