:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Qian, Chen, Li, Danyang, Yu, Xinran, Yang, Zheng, Ma, Qiang
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2508.12610
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

edgeVLM: Cloud-edge Collaborative Real-time VLM based on Context Transfer
by: Qian, Chen, et al.
Published: (2025)

SwiftVLM: Efficient Vision-Language Model Inference via Cross-Layer Token Bypass
by: Qian, Chen, et al.
Published: (2026)

FreeCap: Hybrid Calibration-Free Motion Capture in Open Environments
by: Xue, Aoru, et al.
Published: (2024)

CapHuman: Capture Your Moments in Parallel Universes
by: Liang, Chao, et al.
Published: (2024)

MoCapAnything V2: End-to-End Motion Capture for Arbitrary Skeletons
by: Gong, Kehong, et al.
Published: (2026)

Mesquite MoCap: Democratizing Real-Time Motion Capture with Affordable, Bodyworn IoT Sensors and WebXR SLAM
by: Vanani, Poojan, et al.
Published: (2025)

MoCapAnything: Unified 3D Motion Capture for Arbitrary Skeletons from Monocular Videos
by: Gong, Kehong, et al.
Published: (2025)

RemoCap: Disentangled Representation Learning for Motion Capture
by: Wang, Hongsheng, et al.
Published: (2024)

RoMo: A Robust Solver for Full-body Unlabeled Optical Motion Capture
by: Pan, Xiaoyu, et al.
Published: (2024)

OccluTrack: Rethinking Awareness of Occlusion for Enhancing Multiple Pedestrian Tracking
by: Gao, Jianjun, et al.
Published: (2023)

Comparison of Kinematics and Kinetics Between OpenCap and a Marker-Based Motion Capture System in Cycling
by: Kakavand, Reza, et al.
Published: (2024)

Seeing through Unclear Glass: Occlusion Removal with One Shot
by: Li, Qiang, et al.
Published: (2025)

Re$^2$MoGen: Open-Vocabulary Motion Generation via LLM Reasoning and Physics-Aware Refinement
by: Zheng, Jiakun, et al.
Published: (2026)

Rethinking Genomic Modeling Through Optical Character Recognition
by: Xiang, Hongxin, et al.
Published: (2026)

MotionPRO: Exploring the Role of Pressure in Human MoCap and Beyond
by: Ren, Shenghao, et al.
Published: (2025)

CapsFusion: Rethinking Image-Text Data at Scale
by: Yu, Qiying, et al.
Published: (2023)

ELMO: Enhanced Real-time LiDAR Motion Capture through Upsampling
by: Jang, Deok-Kyeong, et al.
Published: (2024)

BEACON: Language-Conditioned Navigation Affordance Prediction under Occlusion
by: Gao, Xinyu, et al.
Published: (2026)

Observation-Aligned Mask Priors for Learning Physical Dynamics from Authentic Occlusions
by: Ma, Chiyuan, et al.
Published: (2026)

DiffCap: Diffusion-based Real-time Human Motion Capture using Sparse IMUs and a Monocular Camera
by: Pan, Shaohua, et al.
Published: (2025)

SOAP: Enhancing Spatio-Temporal Relation and Motion Information Capturing for Few-Shot Action Recognition
by: Huang, Wenbo, et al.
Published: (2024)

Transformer-Based Framework for Motion Capture Denoising and Anomaly Detection in Medical Rehabilitation
by: Cai, Yeming, et al.
Published: (2025)

FlashCap: Millisecond-Accurate Human Motion Capture via Flashing LEDs and Event-Based Vision
by: Wu, Zekai, et al.
Published: (2026)

OmniOVCD: Streamlining Open-Vocabulary Change Detection with SAM 3
by: Zhang, Xu, et al.
Published: (2026)

FreeMotion: MoCap-Free Human Motion Synthesis with Multimodal Large Language Models
by: Zhang, Zhikai, et al.
Published: (2024)

Masked Modeling for Human Motion Recovery Under Occlusions
by: Qian, Zhiyin, et al.
Published: (2026)

CapGeo: A Caption-Assisted Approach to Geometric Reasoning
by: Li, Yuying, et al.
Published: (2025)

Motion-Guided Latent Diffusion for Temporally Consistent Real-world Video Super-resolution
by: Yang, Xi, et al.
Published: (2023)

CapRecover: A Cross-Modality Feature Inversion Attack Framework on Vision Language Models
by: Xiu, Kedong, et al.
Published: (2025)

Rethinking Epistemic and Aleatoric Uncertainty for Active Open-Set Annotation: An Energy-Based Approach
by: Zong, Chen-Chen, et al.
Published: (2025)

OpenT2M: No-frill Motion Generation with Open-source,Large-scale, High-quality Data
by: Cao, Bin, et al.
Published: (2026)

Rethink Predicting the Optical Flow with the Kinetics Perspective
by: Cheng, Yuhao, et al.
Published: (2024)

GameGen-X: Interactive Open-world Game Video Generation
by: Che, Haoxuan, et al.
Published: (2024)

HMAFlow: Learning More Accurate Optical Flow via Hierarchical Motion Field Alignment
by: Ma, Dianbo, et al.
Published: (2024)

Behave Your Motion: Habit-preserved Cross-category Animal Motion Transfer
by: Zhang, Zhimin, et al.
Published: (2025)

Rethinking Video Tokenization: A Conditioned Diffusion-based Approach
by: Yang, Nianzu, et al.
Published: (2025)

AnyMo: Scaling Any-Modality Conditional Motion Generation with Masked Modeling
by: Li, Yiheng, et al.
Published: (2026)

Occlusion-Aware Diffusion Model for Pedestrian Intention Prediction
by: Liu, Yu, et al.
Published: (2025)

Motion Capture from Inertial and Vision Sensors
by: Chen, Xiaodong, et al.
Published: (2024)

Occlusion-Ordered Semantic Instance Segmentation
by: Baselizadeh, Soroosh, et al.
Published: (2025)