Saved in:
| Main Authors: | Li, Ao, Ling, Yonggen, Lin, Yiyang, Wang, Yuji, Deng, Yong, Tang, Yansong |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.08921 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
ScoreHOI: Physically Plausible Reconstruction of Human-Object Interaction via Score-Guided Diffusion
by: Li, Ao, et al.
Published: (2025)
by: Li, Ao, et al.
Published: (2025)
Adept: Annotation-Denoising Auxiliary Tasks with Discrete Cosine Transform Map and Keypoint for Human-Centric Pretraining
by: He, Weizhen, et al.
Published: (2025)
by: He, Weizhen, et al.
Published: (2025)
VAPO: Visibility-Aware Keypoint Localization for Efficient 6DoF Object Pose Estimation
by: Lian, Ruyi, et al.
Published: (2024)
by: Lian, Ruyi, et al.
Published: (2024)
DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery
by: Zhu, Yixuan, et al.
Published: (2024)
by: Zhu, Yixuan, et al.
Published: (2024)
Morphology-Aware Interactive Keypoint Estimation
by: Kim, Jinhee, et al.
Published: (2022)
by: Kim, Jinhee, et al.
Published: (2022)
SAM2-LOVE: Segment Anything Model 2 in Language-aided Audio-Visual Scenes
by: Wang, Yuji, et al.
Published: (2025)
by: Wang, Yuji, et al.
Published: (2025)
IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Primary Word Emphasis
by: Wang, Yuji, et al.
Published: (2025)
by: Wang, Yuji, et al.
Published: (2025)
PA-HOI: A Physics-Aware Human and Object Interaction Dataset
by: Wang, Ruiyan, et al.
Published: (2025)
by: Wang, Ruiyan, et al.
Published: (2025)
LocLLM: Exploiting Generalizable Human Keypoint Localization via Large Language Model
by: Wang, Dongkai, et al.
Published: (2024)
by: Wang, Dongkai, et al.
Published: (2024)
Boosting Zero-Shot 3D Style Transfer with 2D Pre-trained Priors
by: Dong, Xin, et al.
Published: (2026)
by: Dong, Xin, et al.
Published: (2026)
Locality-Aware Zero-Shot Human-Object Interaction Detection
by: Kim, Sanghyun, et al.
Published: (2025)
by: Kim, Sanghyun, et al.
Published: (2025)
Semi-supervised 2D Human Pose Estimation via Adaptive Keypoint Masking
by: Meng, Kexin, et al.
Published: (2024)
by: Meng, Kexin, et al.
Published: (2024)
UPTor: Unified 3D Human Pose Dynamics and Trajectory Prediction for Human-Robot Interaction
by: Nilavadi, Nisarga, et al.
Published: (2025)
by: Nilavadi, Nisarga, et al.
Published: (2025)
3D Human-Human Interaction Anomaly Detection
by: Maeda, Shun, et al.
Published: (2025)
by: Maeda, Shun, et al.
Published: (2025)
Disambiguating Monocular Reconstruction of 3D Clothed Human with Spatial-Temporal Transformer
by: Deng, Yong, et al.
Published: (2024)
by: Deng, Yong, et al.
Published: (2024)
Sparse2Dense: A Keypoint-driven Generative Framework for Human Video Compression and Vertex Prediction
by: Chen, Bolin, et al.
Published: (2025)
by: Chen, Bolin, et al.
Published: (2025)
LLM-Grounded Dynamic Task Planning with Hierarchical Temporal Logic for Human-Aware Multi-Robot Collaboration
by: Hu, Shuyuan, et al.
Published: (2026)
by: Hu, Shuyuan, et al.
Published: (2026)
Harmony4D: A Video Dataset for In-The-Wild Close Human Interactions
by: Khirodkar, Rawal, et al.
Published: (2024)
by: Khirodkar, Rawal, et al.
Published: (2024)
Ultra-Range Gesture Recognition using a Web-Camera in Human-Robot Interaction
by: Bamani, Eran, et al.
Published: (2023)
by: Bamani, Eran, et al.
Published: (2023)
Keypoints as Dynamic Centroids for Unified Human Pose and Segmentation
by: Ahmad, Niaz, et al.
Published: (2025)
by: Ahmad, Niaz, et al.
Published: (2025)
Reconstructing Close Human Interactions from Multiple Views
by: Shuai, Qing, et al.
Published: (2024)
by: Shuai, Qing, et al.
Published: (2024)
Reconstructing Close Human Interaction with Appearance and Proxemics Reasoning
by: Huang, Buzhen, et al.
Published: (2025)
by: Huang, Buzhen, et al.
Published: (2025)
PhysiGen: Integrating Collision-Aware Physical Constraints for High-Fidelity Human-Human Interaction Generation
by: Lei, Nan, et al.
Published: (2026)
by: Lei, Nan, et al.
Published: (2026)
Unified Understanding of Environment, Task, and Human for Human-Robot Interaction in Real-World Environments
by: Yano, Yuga, et al.
Published: (2024)
by: Yano, Yuga, et al.
Published: (2024)
MoCap-to-Visual Domain Adaptation for Efficient Human Mesh Estimation from 2D Keypoints
by: Uguz, Bedirhan, et al.
Published: (2024)
by: Uguz, Bedirhan, et al.
Published: (2024)
Localization-Aware Multi-Scale Representation Learning for Repetitive Action Counting
by: Wang, Sujia, et al.
Published: (2025)
by: Wang, Sujia, et al.
Published: (2025)
WildSeg3D: Segment Any 3D Objects in the Wild from 2D Images
by: Guo, Yansong, et al.
Published: (2025)
by: Guo, Yansong, et al.
Published: (2025)
An Embeddable Implicit IUVD Representation for Part-based 3D Human Surface Reconstruction
by: Li, Baoxing, et al.
Published: (2024)
by: Li, Baoxing, et al.
Published: (2024)
Dynamic Gesture Recognition in Ultra-Range Distance for Effective Human-Robot Interaction
by: Beeri, Eran Bamani, et al.
Published: (2024)
by: Beeri, Eran Bamani, et al.
Published: (2024)
VAGNet: Grounding 3D Affordance from Human-Object Interactions in Videos
by: Mao, Aihua, et al.
Published: (2026)
by: Mao, Aihua, et al.
Published: (2026)
UNet-Based Keypoint Regression for 3D Cone Localization in Autonomous Racing
by: Baidachna, Mariia, et al.
Published: (2026)
by: Baidachna, Mariia, et al.
Published: (2026)
VG-Refiner: Towards Tool-Refined Referring Grounded Reasoning via Agentic Reinforcement Learning
by: Wang, Yuji, et al.
Published: (2025)
by: Wang, Yuji, et al.
Published: (2025)
Segment Anything with Motion, Geometry, and Semantic Adaptation for Complex Nonlinear Visual Object Tracking
by: Zhu, Deyi, et al.
Published: (2026)
by: Zhu, Deyi, et al.
Published: (2026)
HumanGaussian: Text-Driven 3D Human Generation with Gaussian Splatting
by: Liu, Xian, et al.
Published: (2023)
by: Liu, Xian, et al.
Published: (2023)
Social 3D Scene Graphs: Modeling Human Actions and Relations for Interactive Service Robots
by: Bartoli, Ermanno, et al.
Published: (2025)
by: Bartoli, Ermanno, et al.
Published: (2025)
Closely Interactive Human Reconstruction with Proxemics and Physics-Guided Adaption
by: Huang, Buzhen, et al.
Published: (2024)
by: Huang, Buzhen, et al.
Published: (2024)
Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose Estimation
by: Lin, Xiao, et al.
Published: (2024)
by: Lin, Xiao, et al.
Published: (2024)
CT3D++: Improving 3D Object Detection with Keypoint-induced Channel-wise Transformer
by: Sheng, Hualian, et al.
Published: (2024)
by: Sheng, Hualian, et al.
Published: (2024)
CORE4D: A 4D Human-Object-Human Interaction Dataset for Collaborative Object REarrangement
by: Liu, Yun, et al.
Published: (2024)
by: Liu, Yun, et al.
Published: (2024)
FDDet: Achieving Data-Efficient Food Defect Detection Under Real-World Scenarios
by: Xu, Ruihao, et al.
Published: (2026)
by: Xu, Ruihao, et al.
Published: (2026)
Similar Items
-
ScoreHOI: Physically Plausible Reconstruction of Human-Object Interaction via Score-Guided Diffusion
by: Li, Ao, et al.
Published: (2025) -
Adept: Annotation-Denoising Auxiliary Tasks with Discrete Cosine Transform Map and Keypoint for Human-Centric Pretraining
by: He, Weizhen, et al.
Published: (2025) -
VAPO: Visibility-Aware Keypoint Localization for Efficient 6DoF Object Pose Estimation
by: Lian, Ruyi, et al.
Published: (2024) -
DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery
by: Zhu, Yixuan, et al.
Published: (2024) -
Morphology-Aware Interactive Keypoint Estimation
by: Kim, Jinhee, et al.
Published: (2022)