:: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Li, Ao, Ling, Yonggen, Lin, Yiyang, Wang, Yuji, Deng, Yong, Tang, Yansong
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2604.08921
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

ScoreHOI: Physically Plausible Reconstruction of Human-Object Interaction via Score-Guided Diffusion
by: Li, Ao, et al.
Published: (2025)

Adept: Annotation-Denoising Auxiliary Tasks with Discrete Cosine Transform Map and Keypoint for Human-Centric Pretraining
by: He, Weizhen, et al.
Published: (2025)

VAPO: Visibility-Aware Keypoint Localization for Efficient 6DoF Object Pose Estimation
by: Lian, Ruyi, et al.
Published: (2024)

DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery
by: Zhu, Yixuan, et al.
Published: (2024)

Morphology-Aware Interactive Keypoint Estimation
by: Kim, Jinhee, et al.
Published: (2022)

SAM2-LOVE: Segment Anything Model 2 in Language-aided Audio-Visual Scenes
by: Wang, Yuji, et al.
Published: (2025)

IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Primary Word Emphasis
by: Wang, Yuji, et al.
Published: (2025)

PA-HOI: A Physics-Aware Human and Object Interaction Dataset
by: Wang, Ruiyan, et al.
Published: (2025)

LocLLM: Exploiting Generalizable Human Keypoint Localization via Large Language Model
by: Wang, Dongkai, et al.
Published: (2024)

Boosting Zero-Shot 3D Style Transfer with 2D Pre-trained Priors
by: Dong, Xin, et al.
Published: (2026)

Locality-Aware Zero-Shot Human-Object Interaction Detection
by: Kim, Sanghyun, et al.
Published: (2025)

Semi-supervised 2D Human Pose Estimation via Adaptive Keypoint Masking
by: Meng, Kexin, et al.
Published: (2024)

UPTor: Unified 3D Human Pose Dynamics and Trajectory Prediction for Human-Robot Interaction
by: Nilavadi, Nisarga, et al.
Published: (2025)

3D Human-Human Interaction Anomaly Detection
by: Maeda, Shun, et al.
Published: (2025)

Disambiguating Monocular Reconstruction of 3D Clothed Human with Spatial-Temporal Transformer
by: Deng, Yong, et al.
Published: (2024)

Sparse2Dense: A Keypoint-driven Generative Framework for Human Video Compression and Vertex Prediction
by: Chen, Bolin, et al.
Published: (2025)

LLM-Grounded Dynamic Task Planning with Hierarchical Temporal Logic for Human-Aware Multi-Robot Collaboration
by: Hu, Shuyuan, et al.
Published: (2026)

Harmony4D: A Video Dataset for In-The-Wild Close Human Interactions
by: Khirodkar, Rawal, et al.
Published: (2024)

Ultra-Range Gesture Recognition using a Web-Camera in Human-Robot Interaction
by: Bamani, Eran, et al.
Published: (2023)

Keypoints as Dynamic Centroids for Unified Human Pose and Segmentation
by: Ahmad, Niaz, et al.
Published: (2025)

Reconstructing Close Human Interactions from Multiple Views
by: Shuai, Qing, et al.
Published: (2024)

Reconstructing Close Human Interaction with Appearance and Proxemics Reasoning
by: Huang, Buzhen, et al.
Published: (2025)

PhysiGen: Integrating Collision-Aware Physical Constraints for High-Fidelity Human-Human Interaction Generation
by: Lei, Nan, et al.
Published: (2026)

Unified Understanding of Environment, Task, and Human for Human-Robot Interaction in Real-World Environments
by: Yano, Yuga, et al.
Published: (2024)

MoCap-to-Visual Domain Adaptation for Efficient Human Mesh Estimation from 2D Keypoints
by: Uguz, Bedirhan, et al.
Published: (2024)

Localization-Aware Multi-Scale Representation Learning for Repetitive Action Counting
by: Wang, Sujia, et al.
Published: (2025)

WildSeg3D: Segment Any 3D Objects in the Wild from 2D Images
by: Guo, Yansong, et al.
Published: (2025)

An Embeddable Implicit IUVD Representation for Part-based 3D Human Surface Reconstruction
by: Li, Baoxing, et al.
Published: (2024)

Dynamic Gesture Recognition in Ultra-Range Distance for Effective Human-Robot Interaction
by: Beeri, Eran Bamani, et al.
Published: (2024)

VAGNet: Grounding 3D Affordance from Human-Object Interactions in Videos
by: Mao, Aihua, et al.
Published: (2026)

UNet-Based Keypoint Regression for 3D Cone Localization in Autonomous Racing
by: Baidachna, Mariia, et al.
Published: (2026)

VG-Refiner: Towards Tool-Refined Referring Grounded Reasoning via Agentic Reinforcement Learning
by: Wang, Yuji, et al.
Published: (2025)

Segment Anything with Motion, Geometry, and Semantic Adaptation for Complex Nonlinear Visual Object Tracking
by: Zhu, Deyi, et al.
Published: (2026)

HumanGaussian: Text-Driven 3D Human Generation with Gaussian Splatting
by: Liu, Xian, et al.
Published: (2023)

Social 3D Scene Graphs: Modeling Human Actions and Relations for Interactive Service Robots
by: Bartoli, Ermanno, et al.
Published: (2025)

Closely Interactive Human Reconstruction with Proxemics and Physics-Guided Adaption
by: Huang, Buzhen, et al.
Published: (2024)

Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose Estimation
by: Lin, Xiao, et al.
Published: (2024)

CT3D++: Improving 3D Object Detection with Keypoint-induced Channel-wise Transformer
by: Sheng, Hualian, et al.
Published: (2024)

CORE4D: A 4D Human-Object-Human Interaction Dataset for Collaborative Object REarrangement
by: Liu, Yun, et al.
Published: (2024)

FDDet: Achieving Data-Efficient Food Defect Detection Under Real-World Scenarios
by: Xu, Ruihao, et al.
Published: (2026)