Saved in:
| Main Authors: | Wang, Chengfeng, Zhai, Wei, Yang, Yuhang, Cao, Yang, Zha, Zhengjun |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.06575 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
EgoChoir: Capturing 3D Human-Object Interaction Regions from Egocentric Views
by: Yang, Yuhang, et al.
Published: (2024)
by: Yang, Yuhang, et al.
Published: (2024)
EMoTive: Event-guided Trajectory Modeling for 3D Motion Estimation
by: Wan, Zengyu, et al.
Published: (2025)
by: Wan, Zengyu, et al.
Published: (2025)
LEMON: Learning 3D Human-Object Interaction Relation from 2D Images
by: Yang, Yuhang, et al.
Published: (2023)
by: Yang, Yuhang, et al.
Published: (2023)
Grounding 3D Scene Affordance From Egocentric Interactions
by: Liu, Cuiyu, et al.
Published: (2024)
by: Liu, Cuiyu, et al.
Published: (2024)
GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance Grounding
by: Shao, Yawen, et al.
Published: (2024)
by: Shao, Yawen, et al.
Published: (2024)
HERO: Human Reaction Generation from Videos
by: Yu, Chengjun, et al.
Published: (2025)
by: Yu, Chengjun, et al.
Published: (2025)
End-to-End Spatial-Temporal Transformer for Real-time 4D HOI Reconstruction
by: Zhang, Haoyu, et al.
Published: (2026)
by: Zhang, Haoyu, et al.
Published: (2026)
SIGMAN:Scaling 3D Human Gaussian Generation with Millions of Assets
by: Yang, Yuhang, et al.
Published: (2025)
by: Yang, Yuhang, et al.
Published: (2025)
TOUCH: Text-guided Controllable Generation of Free-Form Hand-Object Interactions
by: Han, Guangyi, et al.
Published: (2025)
by: Han, Guangyi, et al.
Published: (2025)
Free Geometry: Refining 3D Reconstruction from Longer Versions of Itself
by: Dai, Yuhang, et al.
Published: (2026)
by: Dai, Yuhang, et al.
Published: (2026)
Constructing a 3D Scene from a Single Image
by: Zheng, Kaizhi, et al.
Published: (2025)
by: Zheng, Kaizhi, et al.
Published: (2025)
Analyzing Image Beyond Visual Aspect: Image Emotion Classification via Multiple-Affective Captioning
by: Zhou, Zibo, et al.
Published: (2025)
by: Zhou, Zibo, et al.
Published: (2025)
PFSD: A Multi-Modal Pedestrian-Focus Scene Dataset for Rich Tasks in Semi-Structured Environments
by: Liu, Yueting, et al.
Published: (2025)
by: Liu, Yueting, et al.
Published: (2025)
EF-3DGS: Event-Aided Free-Trajectory 3D Gaussian Splatting
by: Liao, Bohao, et al.
Published: (2024)
by: Liao, Bohao, et al.
Published: (2024)
Event Stream Filtering via Probability Flux Estimation
by: Chen, Jinze, et al.
Published: (2025)
by: Chen, Jinze, et al.
Published: (2025)
LiveHPS: LiDAR-based Scene-level Human Pose and Shape Estimation in Free Environment
by: Ren, Yiming, et al.
Published: (2024)
by: Ren, Yiming, et al.
Published: (2024)
PhySIC: Physically Plausible 3D Human-Scene Interaction and Contact from a Single Image
by: Muralidhar, Pradyumna Yalandur, et al.
Published: (2025)
by: Muralidhar, Pradyumna Yalandur, et al.
Published: (2025)
Reg3D: Reconstructive Geometry Instruction Tuning for 3D Scene Understanding
by: Zheng, Hongpei, et al.
Published: (2025)
by: Zheng, Hongpei, et al.
Published: (2025)
DecoDINO: 3D Human-Scene Contact Prediction with Semantic Classification
by: Bierling, Lukas, et al.
Published: (2025)
by: Bierling, Lukas, et al.
Published: (2025)
Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning
by: Lu, Fan, et al.
Published: (2024)
by: Lu, Fan, et al.
Published: (2024)
GS-STVSR: Ultra-Efficient Continuous Spatio-Temporal Video Super-Resolution via 2D Gaussian Splatting
by: Shi, Mingyu, et al.
Published: (2026)
by: Shi, Mingyu, et al.
Published: (2026)
DreamDance: Animating Human Images by Enriching 3D Geometry Cues from 2D Poses
by: Pang, Yatian, et al.
Published: (2024)
by: Pang, Yatian, et al.
Published: (2024)
SyncHuman: Synchronizing 2D and 3D Generative Models for Single-view Human Reconstruction
by: Chen, Wenyue, et al.
Published: (2025)
by: Chen, Wenyue, et al.
Published: (2025)
LPA3D: 3D Room-Level Scene Generation from In-the-Wild Images
by: Yang, Ming-Jia, et al.
Published: (2025)
by: Yang, Ming-Jia, et al.
Published: (2025)
3D-Fixer: Coarse-to-Fine In-place Completion for 3D Scenes from a Single Image
by: Yin, Ze-Xin, et al.
Published: (2026)
by: Yin, Ze-Xin, et al.
Published: (2026)
Gloria: Consistent Character Video Generation via Content Anchors
by: Yang, Yuhang, et al.
Published: (2026)
by: Yang, Yuhang, et al.
Published: (2026)
Pi-HOC: Pairwise 3D Human-Object Contact Estimation
by: Chittupalli, Sravan, et al.
Published: (2026)
by: Chittupalli, Sravan, et al.
Published: (2026)
Unified Human-Scene Interaction via Prompted Chain-of-Contacts
by: Xiao, Zeqi, et al.
Published: (2023)
by: Xiao, Zeqi, et al.
Published: (2023)
Unbiased Gradient Estimation for Event Binning via Functional Backpropagation
by: Chen, Jinze, et al.
Published: (2026)
by: Chen, Jinze, et al.
Published: (2026)
Visual-Geometric Collaborative Guidance for Affordance Learning
by: Luo, Hongchen, et al.
Published: (2024)
by: Luo, Hongchen, et al.
Published: (2024)
Event-guided 3D Gaussian Splatting for Dynamic Human and Scene Reconstruction
by: Yin, Xiaoting, et al.
Published: (2025)
by: Yin, Xiaoting, et al.
Published: (2025)
EXPLORE-Bench: Egocentric Scene Prediction with Long-Horizon Reasoning
by: Yu, Chengjun, et al.
Published: (2026)
by: Yu, Chengjun, et al.
Published: (2026)
Iterative Inference-time Scaling with Adaptive Frequency Steering for Image Super-Resolution
by: Zhang, Hexin, et al.
Published: (2025)
by: Zhang, Hexin, et al.
Published: (2025)
Geometry-as-context: Modulating Explicit 3D in Scene-consistent Video Generation to Geometry Context
by: Hu, JiaKui, et al.
Published: (2026)
by: Hu, JiaKui, et al.
Published: (2026)
Placing Human Animations into 3D Scenes by Learning Interaction- and Geometry-Driven Keyframes
by: Mullen Jr, James F., et al.
Published: (2022)
by: Mullen Jr, James F., et al.
Published: (2022)
InstaScene: Towards Complete 3D Instance Decomposition and Reconstruction from Cluttered Scenes
by: Yang, Zesong, et al.
Published: (2025)
by: Yang, Zesong, et al.
Published: (2025)
Kinematics-based 3D Human-Object Interaction Reconstruction from Single View
by: Chen, Yuhang, et al.
Published: (2024)
by: Chen, Yuhang, et al.
Published: (2024)
Generating Human Motion in 3D Scenes from Text Descriptions
by: Cen, Zhi, et al.
Published: (2024)
by: Cen, Zhi, et al.
Published: (2024)
CAST: Component-Aligned 3D Scene Reconstruction from an RGB Image
by: Yao, Kaixin, et al.
Published: (2025)
by: Yao, Kaixin, et al.
Published: (2025)
GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image
by: Fu, Xiao, et al.
Published: (2024)
by: Fu, Xiao, et al.
Published: (2024)
Similar Items
-
EgoChoir: Capturing 3D Human-Object Interaction Regions from Egocentric Views
by: Yang, Yuhang, et al.
Published: (2024) -
EMoTive: Event-guided Trajectory Modeling for 3D Motion Estimation
by: Wan, Zengyu, et al.
Published: (2025) -
LEMON: Learning 3D Human-Object Interaction Relation from 2D Images
by: Yang, Yuhang, et al.
Published: (2023) -
Grounding 3D Scene Affordance From Egocentric Interactions
by: Liu, Cuiyu, et al.
Published: (2024) -
GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance Grounding
by: Shao, Yawen, et al.
Published: (2024)