Saved in:
| Main Authors: | Qian, Chen, Li, Danyang, Yu, Xinran, Yang, Zheng, Ma, Qiang |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.12610 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
edgeVLM: Cloud-edge Collaborative Real-time VLM based on Context Transfer
by: Qian, Chen, et al.
Published: (2025)
by: Qian, Chen, et al.
Published: (2025)
SwiftVLM: Efficient Vision-Language Model Inference via Cross-Layer Token Bypass
by: Qian, Chen, et al.
Published: (2026)
by: Qian, Chen, et al.
Published: (2026)
FreeCap: Hybrid Calibration-Free Motion Capture in Open Environments
by: Xue, Aoru, et al.
Published: (2024)
by: Xue, Aoru, et al.
Published: (2024)
CapHuman: Capture Your Moments in Parallel Universes
by: Liang, Chao, et al.
Published: (2024)
by: Liang, Chao, et al.
Published: (2024)
MoCapAnything V2: End-to-End Motion Capture for Arbitrary Skeletons
by: Gong, Kehong, et al.
Published: (2026)
by: Gong, Kehong, et al.
Published: (2026)
Mesquite MoCap: Democratizing Real-Time Motion Capture with Affordable, Bodyworn IoT Sensors and WebXR SLAM
by: Vanani, Poojan, et al.
Published: (2025)
by: Vanani, Poojan, et al.
Published: (2025)
MoCapAnything: Unified 3D Motion Capture for Arbitrary Skeletons from Monocular Videos
by: Gong, Kehong, et al.
Published: (2025)
by: Gong, Kehong, et al.
Published: (2025)
RemoCap: Disentangled Representation Learning for Motion Capture
by: Wang, Hongsheng, et al.
Published: (2024)
by: Wang, Hongsheng, et al.
Published: (2024)
RoMo: A Robust Solver for Full-body Unlabeled Optical Motion Capture
by: Pan, Xiaoyu, et al.
Published: (2024)
by: Pan, Xiaoyu, et al.
Published: (2024)
OccluTrack: Rethinking Awareness of Occlusion for Enhancing Multiple Pedestrian Tracking
by: Gao, Jianjun, et al.
Published: (2023)
by: Gao, Jianjun, et al.
Published: (2023)
Comparison of Kinematics and Kinetics Between OpenCap and a Marker-Based Motion Capture System in Cycling
by: Kakavand, Reza, et al.
Published: (2024)
by: Kakavand, Reza, et al.
Published: (2024)
Seeing through Unclear Glass: Occlusion Removal with One Shot
by: Li, Qiang, et al.
Published: (2025)
by: Li, Qiang, et al.
Published: (2025)
Re$^2$MoGen: Open-Vocabulary Motion Generation via LLM Reasoning and Physics-Aware Refinement
by: Zheng, Jiakun, et al.
Published: (2026)
by: Zheng, Jiakun, et al.
Published: (2026)
Rethinking Genomic Modeling Through Optical Character Recognition
by: Xiang, Hongxin, et al.
Published: (2026)
by: Xiang, Hongxin, et al.
Published: (2026)
MotionPRO: Exploring the Role of Pressure in Human MoCap and Beyond
by: Ren, Shenghao, et al.
Published: (2025)
by: Ren, Shenghao, et al.
Published: (2025)
CapsFusion: Rethinking Image-Text Data at Scale
by: Yu, Qiying, et al.
Published: (2023)
by: Yu, Qiying, et al.
Published: (2023)
ELMO: Enhanced Real-time LiDAR Motion Capture through Upsampling
by: Jang, Deok-Kyeong, et al.
Published: (2024)
by: Jang, Deok-Kyeong, et al.
Published: (2024)
BEACON: Language-Conditioned Navigation Affordance Prediction under Occlusion
by: Gao, Xinyu, et al.
Published: (2026)
by: Gao, Xinyu, et al.
Published: (2026)
Observation-Aligned Mask Priors for Learning Physical Dynamics from Authentic Occlusions
by: Ma, Chiyuan, et al.
Published: (2026)
by: Ma, Chiyuan, et al.
Published: (2026)
DiffCap: Diffusion-based Real-time Human Motion Capture using Sparse IMUs and a Monocular Camera
by: Pan, Shaohua, et al.
Published: (2025)
by: Pan, Shaohua, et al.
Published: (2025)
SOAP: Enhancing Spatio-Temporal Relation and Motion Information Capturing for Few-Shot Action Recognition
by: Huang, Wenbo, et al.
Published: (2024)
by: Huang, Wenbo, et al.
Published: (2024)
Transformer-Based Framework for Motion Capture Denoising and Anomaly Detection in Medical Rehabilitation
by: Cai, Yeming, et al.
Published: (2025)
by: Cai, Yeming, et al.
Published: (2025)
FlashCap: Millisecond-Accurate Human Motion Capture via Flashing LEDs and Event-Based Vision
by: Wu, Zekai, et al.
Published: (2026)
by: Wu, Zekai, et al.
Published: (2026)
OmniOVCD: Streamlining Open-Vocabulary Change Detection with SAM 3
by: Zhang, Xu, et al.
Published: (2026)
by: Zhang, Xu, et al.
Published: (2026)
FreeMotion: MoCap-Free Human Motion Synthesis with Multimodal Large Language Models
by: Zhang, Zhikai, et al.
Published: (2024)
by: Zhang, Zhikai, et al.
Published: (2024)
Masked Modeling for Human Motion Recovery Under Occlusions
by: Qian, Zhiyin, et al.
Published: (2026)
by: Qian, Zhiyin, et al.
Published: (2026)
CapGeo: A Caption-Assisted Approach to Geometric Reasoning
by: Li, Yuying, et al.
Published: (2025)
by: Li, Yuying, et al.
Published: (2025)
Motion-Guided Latent Diffusion for Temporally Consistent Real-world Video Super-resolution
by: Yang, Xi, et al.
Published: (2023)
by: Yang, Xi, et al.
Published: (2023)
CapRecover: A Cross-Modality Feature Inversion Attack Framework on Vision Language Models
by: Xiu, Kedong, et al.
Published: (2025)
by: Xiu, Kedong, et al.
Published: (2025)
Rethinking Epistemic and Aleatoric Uncertainty for Active Open-Set Annotation: An Energy-Based Approach
by: Zong, Chen-Chen, et al.
Published: (2025)
by: Zong, Chen-Chen, et al.
Published: (2025)
OpenT2M: No-frill Motion Generation with Open-source,Large-scale, High-quality Data
by: Cao, Bin, et al.
Published: (2026)
by: Cao, Bin, et al.
Published: (2026)
Rethink Predicting the Optical Flow with the Kinetics Perspective
by: Cheng, Yuhao, et al.
Published: (2024)
by: Cheng, Yuhao, et al.
Published: (2024)
GameGen-X: Interactive Open-world Game Video Generation
by: Che, Haoxuan, et al.
Published: (2024)
by: Che, Haoxuan, et al.
Published: (2024)
HMAFlow: Learning More Accurate Optical Flow via Hierarchical Motion Field Alignment
by: Ma, Dianbo, et al.
Published: (2024)
by: Ma, Dianbo, et al.
Published: (2024)
Behave Your Motion: Habit-preserved Cross-category Animal Motion Transfer
by: Zhang, Zhimin, et al.
Published: (2025)
by: Zhang, Zhimin, et al.
Published: (2025)
Rethinking Video Tokenization: A Conditioned Diffusion-based Approach
by: Yang, Nianzu, et al.
Published: (2025)
by: Yang, Nianzu, et al.
Published: (2025)
AnyMo: Scaling Any-Modality Conditional Motion Generation with Masked Modeling
by: Li, Yiheng, et al.
Published: (2026)
by: Li, Yiheng, et al.
Published: (2026)
Occlusion-Aware Diffusion Model for Pedestrian Intention Prediction
by: Liu, Yu, et al.
Published: (2025)
by: Liu, Yu, et al.
Published: (2025)
Motion Capture from Inertial and Vision Sensors
by: Chen, Xiaodong, et al.
Published: (2024)
by: Chen, Xiaodong, et al.
Published: (2024)
Occlusion-Ordered Semantic Instance Segmentation
by: Baselizadeh, Soroosh, et al.
Published: (2025)
by: Baselizadeh, Soroosh, et al.
Published: (2025)
Similar Items
-
edgeVLM: Cloud-edge Collaborative Real-time VLM based on Context Transfer
by: Qian, Chen, et al.
Published: (2025) -
SwiftVLM: Efficient Vision-Language Model Inference via Cross-Layer Token Bypass
by: Qian, Chen, et al.
Published: (2026) -
FreeCap: Hybrid Calibration-Free Motion Capture in Open Environments
by: Xue, Aoru, et al.
Published: (2024) -
CapHuman: Capture Your Moments in Parallel Universes
by: Liang, Chao, et al.
Published: (2024) -
MoCapAnything V2: End-to-End Motion Capture for Arbitrary Skeletons
by: Gong, Kehong, et al.
Published: (2026)