:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zou, Yude, Gong, Junji, Gao, Xing, Li, Zixuan, Chen, Tianxing, Zheng, Guanjie
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2604.04843
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

HOSIG: Full-Body Human-Object-Scene Interaction Generation with Hierarchical Scene Perception
by: Yao, Wei, et al.
Published: (2025)

SCORP: Scene-Consistent Object Refinement via Proxy Generation and Tuning
by: Chen, Ziwei, et al.
Published: (2025)

InfScene-SR: Arbitrary-Size Image Super-Resolution via Iterative Joint-Denoising
by: Sun, Shoukun, et al.
Published: (2026)

InteractMove: Text-Controlled Human-Object Interaction Generation in 3D Scenes with Movable Objects
by: Cai, Xinhao, et al.
Published: (2025)

Dynamic Worlds, Dynamic Humans: Generating Virtual Human-Scene Interaction Motion in Dynamic Scenes
by: Wang, Yin, et al.
Published: (2026)

Towards Robust Semantic Segmentation against Patch-based Attack via Attention Refinement
by: Yuan, Zheng, et al.
Published: (2024)

Asset-Driven Sematic Reconstruction of Dynamic Scene with Multi-Human-Object Interactions
by: Biswas, Sandika, et al.
Published: (2025)

Exploring the Role of Synthetic Data Augmentation in Controllable Human-Centric Video Generation
by: Fei, Yuanchen, et al.
Published: (2026)

DFIR-DETR: Frequency-Domain Iterative Refinement and Dynamic Feature Aggregation for Small Object Detection
by: Gao, Bo, et al.
Published: (2025)

Interaction Replica: Tracking Human-Object Interaction and Scene Changes From Human Motion
by: Guzov, Vladimir, et al.
Published: (2022)

OnlineHOI: Towards Online Human-Object Interaction Generation and Perception
by: Ji, Yihong, et al.
Published: (2025)

InfMasking: Unleashing Synergistic Information by Contrastive Multimodal Interactions
by: Wen, Liangjian, et al.
Published: (2025)

UniHM: Universal Human Motion Generation with Object Interactions in Indoor Scenes
by: Geng, Zichen, et al.
Published: (2025)

TIGeR: Text-Instructed Generation and Refinement for Template-Free Hand-Object Interaction
by: Huang, Yiyao, et al.
Published: (2025)

HOIverse: A Synthetic Scene Graph Dataset With Human Object Interactions
by: Phatak, Mrunmai Vivek, et al.
Published: (2025)

DecoupledGaussian: Object-Scene Decoupling for Physics-Based Interaction
by: Wang, Miaowei, et al.
Published: (2025)

ZeroHSI: Zero-Shot 4D Human-Scene Interaction by Video Generation
by: Li, Hongjie, et al.
Published: (2024)

Enhancing Image Matting in Real-World Scenes with Mask-Guided Iterative Refinement
by: Liu, Rui
Published: (2025)

Scaling Up Dynamic Human-Scene Interaction Modeling
by: Jiang, Nan, et al.
Published: (2024)

InfRS: Incremental Few-Shot Object Detection in Remote Sensing Images
by: Li, Wuzhou, et al.
Published: (2024)

Single-View Scene Point Cloud Human Grasp Generation
by: Wang, Yan-Kang, et al.
Published: (2024)

InfVSR: Toward Consistency-Driven Streaming Generative Video Super-Resolution
by: Zhang, Ziqing, et al.
Published: (2025)

Insights from Visual Cognition: Understanding Human Action Dynamics with Overall Glance and Refined Gaze Transformer
by: Xing, Bohao, et al.
Published: (2026)

InfMAE: A Foundation Model in the Infrared Modality
by: Liu, Fangcen, et al.
Published: (2024)

Composing People Together: Iterative Pose-Image Generation for Multi-Person Interaction Scenes
by: Peng, Wenxuan, et al.
Published: (2026)

InfGen: A Resolution-Agnostic Paradigm for Scalable Image Synthesis
by: Han, Tao, et al.
Published: (2025)

MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement
by: He, Xu, et al.
Published: (2024)

Generating Human Interaction Motions in Scenes with Text Control
by: Yi, Hongwei, et al.
Published: (2024)

CORE4D: A 4D Human-Object-Human Interaction Dataset for Collaborative Object REarrangement
by: Liu, Yun, et al.
Published: (2024)

InterPhys: Physics-aware Human Motion Synthesis in a Dynamic Scene
by: Xing, Chaoyue, et al.
Published: (2026)

GRAFT: Geometric Refinement and Fitting Transformer for Human Scene Reconstruction
by: YM, Pradyumna, et al.
Published: (2026)

GenHSI: Controllable Generation of Human-Scene Interaction Videos
by: Li, Zekun, et al.
Published: (2025)

Iterative Prompt Refinement for Safer Text-to-Image Generation
by: Jeon, Jinwoo, et al.
Published: (2025)

MAPRPose: Mask-Aware Proposal and Amodal Refinement for Multi-Object 6D Pose Estimation
by: Luo, Yang, et al.
Published: (2026)

DCIRNet: Depth Completion with Iterative Refinement for Dexterous Grasping of Transparent and Reflective Objects
by: Xie, Guanghu, et al.
Published: (2025)

DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos
by: Chu, Wen-Hsuan, et al.
Published: (2024)

Reconstructing In-the-Wild Open-Vocabulary Human-Object Interactions
by: Wen, Boran, et al.
Published: (2025)

Dynamic Scene Understanding through Object-Centric Voxelization and Neural Rendering
by: Zhao, Yanpeng, et al.
Published: (2024)

Adaptive Forensic Feature Refinement via Intrinsic Importance Perception
by: Yang, Jiazhen, et al.
Published: (2026)

Decoupled Generative Modeling for Human-Object Interaction Synthesis
by: Jung, Hwanhee, et al.
Published: (2025)