:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wang, Chengfeng, Zhai, Wei, Yang, Yuhang, Cao, Yang, Zha, Zhengjun
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2505.06575
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

EgoChoir: Capturing 3D Human-Object Interaction Regions from Egocentric Views
by: Yang, Yuhang, et al.
Published: (2024)

EMoTive: Event-guided Trajectory Modeling for 3D Motion Estimation
by: Wan, Zengyu, et al.
Published: (2025)

LEMON: Learning 3D Human-Object Interaction Relation from 2D Images
by: Yang, Yuhang, et al.
Published: (2023)

Grounding 3D Scene Affordance From Egocentric Interactions
by: Liu, Cuiyu, et al.
Published: (2024)

GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance Grounding
by: Shao, Yawen, et al.
Published: (2024)

HERO: Human Reaction Generation from Videos
by: Yu, Chengjun, et al.
Published: (2025)

End-to-End Spatial-Temporal Transformer for Real-time 4D HOI Reconstruction
by: Zhang, Haoyu, et al.
Published: (2026)

SIGMAN:Scaling 3D Human Gaussian Generation with Millions of Assets
by: Yang, Yuhang, et al.
Published: (2025)

TOUCH: Text-guided Controllable Generation of Free-Form Hand-Object Interactions
by: Han, Guangyi, et al.
Published: (2025)

Free Geometry: Refining 3D Reconstruction from Longer Versions of Itself
by: Dai, Yuhang, et al.
Published: (2026)

Constructing a 3D Scene from a Single Image
by: Zheng, Kaizhi, et al.
Published: (2025)

Analyzing Image Beyond Visual Aspect: Image Emotion Classification via Multiple-Affective Captioning
by: Zhou, Zibo, et al.
Published: (2025)

PFSD: A Multi-Modal Pedestrian-Focus Scene Dataset for Rich Tasks in Semi-Structured Environments
by: Liu, Yueting, et al.
Published: (2025)

EF-3DGS: Event-Aided Free-Trajectory 3D Gaussian Splatting
by: Liao, Bohao, et al.
Published: (2024)

Event Stream Filtering via Probability Flux Estimation
by: Chen, Jinze, et al.
Published: (2025)

LiveHPS: LiDAR-based Scene-level Human Pose and Shape Estimation in Free Environment
by: Ren, Yiming, et al.
Published: (2024)

PhySIC: Physically Plausible 3D Human-Scene Interaction and Contact from a Single Image
by: Muralidhar, Pradyumna Yalandur, et al.
Published: (2025)

Reg3D: Reconstructive Geometry Instruction Tuning for 3D Scene Understanding
by: Zheng, Hongpei, et al.
Published: (2025)

DecoDINO: 3D Human-Scene Contact Prediction with Semantic Classification
by: Bierling, Lukas, et al.
Published: (2025)

Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning
by: Lu, Fan, et al.
Published: (2024)

GS-STVSR: Ultra-Efficient Continuous Spatio-Temporal Video Super-Resolution via 2D Gaussian Splatting
by: Shi, Mingyu, et al.
Published: (2026)

DreamDance: Animating Human Images by Enriching 3D Geometry Cues from 2D Poses
by: Pang, Yatian, et al.
Published: (2024)

SyncHuman: Synchronizing 2D and 3D Generative Models for Single-view Human Reconstruction
by: Chen, Wenyue, et al.
Published: (2025)

LPA3D: 3D Room-Level Scene Generation from In-the-Wild Images
by: Yang, Ming-Jia, et al.
Published: (2025)

3D-Fixer: Coarse-to-Fine In-place Completion for 3D Scenes from a Single Image
by: Yin, Ze-Xin, et al.
Published: (2026)

Gloria: Consistent Character Video Generation via Content Anchors
by: Yang, Yuhang, et al.
Published: (2026)

Pi-HOC: Pairwise 3D Human-Object Contact Estimation
by: Chittupalli, Sravan, et al.
Published: (2026)

Unified Human-Scene Interaction via Prompted Chain-of-Contacts
by: Xiao, Zeqi, et al.
Published: (2023)

Unbiased Gradient Estimation for Event Binning via Functional Backpropagation
by: Chen, Jinze, et al.
Published: (2026)

Visual-Geometric Collaborative Guidance for Affordance Learning
by: Luo, Hongchen, et al.
Published: (2024)

Event-guided 3D Gaussian Splatting for Dynamic Human and Scene Reconstruction
by: Yin, Xiaoting, et al.
Published: (2025)

EXPLORE-Bench: Egocentric Scene Prediction with Long-Horizon Reasoning
by: Yu, Chengjun, et al.
Published: (2026)

Iterative Inference-time Scaling with Adaptive Frequency Steering for Image Super-Resolution
by: Zhang, Hexin, et al.
Published: (2025)

Geometry-as-context: Modulating Explicit 3D in Scene-consistent Video Generation to Geometry Context
by: Hu, JiaKui, et al.
Published: (2026)

Placing Human Animations into 3D Scenes by Learning Interaction- and Geometry-Driven Keyframes
by: Mullen Jr, James F., et al.
Published: (2022)

InstaScene: Towards Complete 3D Instance Decomposition and Reconstruction from Cluttered Scenes
by: Yang, Zesong, et al.
Published: (2025)

Kinematics-based 3D Human-Object Interaction Reconstruction from Single View
by: Chen, Yuhang, et al.
Published: (2024)

Generating Human Motion in 3D Scenes from Text Descriptions
by: Cen, Zhi, et al.
Published: (2024)

CAST: Component-Aligned 3D Scene Reconstruction from an RGB Image
by: Yao, Kaixin, et al.
Published: (2025)

GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image
by: Fu, Xiao, et al.
Published: (2024)