Saved in:
| Main Authors: | Zou, Yude, Gong, Junji, Gao, Xing, Li, Zixuan, Chen, Tianxing, Zheng, Guanjie |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.04843 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
HOSIG: Full-Body Human-Object-Scene Interaction Generation with Hierarchical Scene Perception
by: Yao, Wei, et al.
Published: (2025)
by: Yao, Wei, et al.
Published: (2025)
SCORP: Scene-Consistent Object Refinement via Proxy Generation and Tuning
by: Chen, Ziwei, et al.
Published: (2025)
by: Chen, Ziwei, et al.
Published: (2025)
InfScene-SR: Arbitrary-Size Image Super-Resolution via Iterative Joint-Denoising
by: Sun, Shoukun, et al.
Published: (2026)
by: Sun, Shoukun, et al.
Published: (2026)
InteractMove: Text-Controlled Human-Object Interaction Generation in 3D Scenes with Movable Objects
by: Cai, Xinhao, et al.
Published: (2025)
by: Cai, Xinhao, et al.
Published: (2025)
Dynamic Worlds, Dynamic Humans: Generating Virtual Human-Scene Interaction Motion in Dynamic Scenes
by: Wang, Yin, et al.
Published: (2026)
by: Wang, Yin, et al.
Published: (2026)
Towards Robust Semantic Segmentation against Patch-based Attack via Attention Refinement
by: Yuan, Zheng, et al.
Published: (2024)
by: Yuan, Zheng, et al.
Published: (2024)
Asset-Driven Sematic Reconstruction of Dynamic Scene with Multi-Human-Object Interactions
by: Biswas, Sandika, et al.
Published: (2025)
by: Biswas, Sandika, et al.
Published: (2025)
Exploring the Role of Synthetic Data Augmentation in Controllable Human-Centric Video Generation
by: Fei, Yuanchen, et al.
Published: (2026)
by: Fei, Yuanchen, et al.
Published: (2026)
DFIR-DETR: Frequency-Domain Iterative Refinement and Dynamic Feature Aggregation for Small Object Detection
by: Gao, Bo, et al.
Published: (2025)
by: Gao, Bo, et al.
Published: (2025)
Interaction Replica: Tracking Human-Object Interaction and Scene Changes From Human Motion
by: Guzov, Vladimir, et al.
Published: (2022)
by: Guzov, Vladimir, et al.
Published: (2022)
OnlineHOI: Towards Online Human-Object Interaction Generation and Perception
by: Ji, Yihong, et al.
Published: (2025)
by: Ji, Yihong, et al.
Published: (2025)
InfMasking: Unleashing Synergistic Information by Contrastive Multimodal Interactions
by: Wen, Liangjian, et al.
Published: (2025)
by: Wen, Liangjian, et al.
Published: (2025)
UniHM: Universal Human Motion Generation with Object Interactions in Indoor Scenes
by: Geng, Zichen, et al.
Published: (2025)
by: Geng, Zichen, et al.
Published: (2025)
TIGeR: Text-Instructed Generation and Refinement for Template-Free Hand-Object Interaction
by: Huang, Yiyao, et al.
Published: (2025)
by: Huang, Yiyao, et al.
Published: (2025)
HOIverse: A Synthetic Scene Graph Dataset With Human Object Interactions
by: Phatak, Mrunmai Vivek, et al.
Published: (2025)
by: Phatak, Mrunmai Vivek, et al.
Published: (2025)
DecoupledGaussian: Object-Scene Decoupling for Physics-Based Interaction
by: Wang, Miaowei, et al.
Published: (2025)
by: Wang, Miaowei, et al.
Published: (2025)
ZeroHSI: Zero-Shot 4D Human-Scene Interaction by Video Generation
by: Li, Hongjie, et al.
Published: (2024)
by: Li, Hongjie, et al.
Published: (2024)
Enhancing Image Matting in Real-World Scenes with Mask-Guided Iterative Refinement
by: Liu, Rui
Published: (2025)
by: Liu, Rui
Published: (2025)
Scaling Up Dynamic Human-Scene Interaction Modeling
by: Jiang, Nan, et al.
Published: (2024)
by: Jiang, Nan, et al.
Published: (2024)
InfRS: Incremental Few-Shot Object Detection in Remote Sensing Images
by: Li, Wuzhou, et al.
Published: (2024)
by: Li, Wuzhou, et al.
Published: (2024)
Single-View Scene Point Cloud Human Grasp Generation
by: Wang, Yan-Kang, et al.
Published: (2024)
by: Wang, Yan-Kang, et al.
Published: (2024)
InfVSR: Toward Consistency-Driven Streaming Generative Video Super-Resolution
by: Zhang, Ziqing, et al.
Published: (2025)
by: Zhang, Ziqing, et al.
Published: (2025)
Insights from Visual Cognition: Understanding Human Action Dynamics with Overall Glance and Refined Gaze Transformer
by: Xing, Bohao, et al.
Published: (2026)
by: Xing, Bohao, et al.
Published: (2026)
InfMAE: A Foundation Model in the Infrared Modality
by: Liu, Fangcen, et al.
Published: (2024)
by: Liu, Fangcen, et al.
Published: (2024)
Composing People Together: Iterative Pose-Image Generation for Multi-Person Interaction Scenes
by: Peng, Wenxuan, et al.
Published: (2026)
by: Peng, Wenxuan, et al.
Published: (2026)
InfGen: A Resolution-Agnostic Paradigm for Scalable Image Synthesis
by: Han, Tao, et al.
Published: (2025)
by: Han, Tao, et al.
Published: (2025)
MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement
by: He, Xu, et al.
Published: (2024)
by: He, Xu, et al.
Published: (2024)
Generating Human Interaction Motions in Scenes with Text Control
by: Yi, Hongwei, et al.
Published: (2024)
by: Yi, Hongwei, et al.
Published: (2024)
CORE4D: A 4D Human-Object-Human Interaction Dataset for Collaborative Object REarrangement
by: Liu, Yun, et al.
Published: (2024)
by: Liu, Yun, et al.
Published: (2024)
InterPhys: Physics-aware Human Motion Synthesis in a Dynamic Scene
by: Xing, Chaoyue, et al.
Published: (2026)
by: Xing, Chaoyue, et al.
Published: (2026)
GRAFT: Geometric Refinement and Fitting Transformer for Human Scene Reconstruction
by: YM, Pradyumna, et al.
Published: (2026)
by: YM, Pradyumna, et al.
Published: (2026)
GenHSI: Controllable Generation of Human-Scene Interaction Videos
by: Li, Zekun, et al.
Published: (2025)
by: Li, Zekun, et al.
Published: (2025)
Iterative Prompt Refinement for Safer Text-to-Image Generation
by: Jeon, Jinwoo, et al.
Published: (2025)
by: Jeon, Jinwoo, et al.
Published: (2025)
MAPRPose: Mask-Aware Proposal and Amodal Refinement for Multi-Object 6D Pose Estimation
by: Luo, Yang, et al.
Published: (2026)
by: Luo, Yang, et al.
Published: (2026)
DCIRNet: Depth Completion with Iterative Refinement for Dexterous Grasping of Transparent and Reflective Objects
by: Xie, Guanghu, et al.
Published: (2025)
by: Xie, Guanghu, et al.
Published: (2025)
DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos
by: Chu, Wen-Hsuan, et al.
Published: (2024)
by: Chu, Wen-Hsuan, et al.
Published: (2024)
Reconstructing In-the-Wild Open-Vocabulary Human-Object Interactions
by: Wen, Boran, et al.
Published: (2025)
by: Wen, Boran, et al.
Published: (2025)
Dynamic Scene Understanding through Object-Centric Voxelization and Neural Rendering
by: Zhao, Yanpeng, et al.
Published: (2024)
by: Zhao, Yanpeng, et al.
Published: (2024)
Adaptive Forensic Feature Refinement via Intrinsic Importance Perception
by: Yang, Jiazhen, et al.
Published: (2026)
by: Yang, Jiazhen, et al.
Published: (2026)
Decoupled Generative Modeling for Human-Object Interaction Synthesis
by: Jung, Hwanhee, et al.
Published: (2025)
by: Jung, Hwanhee, et al.
Published: (2025)
Similar Items
-
HOSIG: Full-Body Human-Object-Scene Interaction Generation with Hierarchical Scene Perception
by: Yao, Wei, et al.
Published: (2025) -
SCORP: Scene-Consistent Object Refinement via Proxy Generation and Tuning
by: Chen, Ziwei, et al.
Published: (2025) -
InfScene-SR: Arbitrary-Size Image Super-Resolution via Iterative Joint-Denoising
by: Sun, Shoukun, et al.
Published: (2026) -
InteractMove: Text-Controlled Human-Object Interaction Generation in 3D Scenes with Movable Objects
by: Cai, Xinhao, et al.
Published: (2025) -
Dynamic Worlds, Dynamic Humans: Generating Virtual Human-Scene Interaction Motion in Dynamic Scenes
by: Wang, Yin, et al.
Published: (2026)