:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wang, Wenjia, Pan, Liang, Pi, Huaijin, Lou, Yuke, Ren, Xuqian, Wu, Yifan, Liao, Zhouyingcheng, Yang, Lei, Dabral, Rishabh, Theobalt, Christian, Komura, Taku
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2602.23205
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

SIMS: Simulating Stylized Human-Scene Interactions with Retrieval-Augmented Script Generation
by: Wang, Wenjia, et al.
Published: (2024)

SceMoS: Scene-Aware 3D Human Motion Synthesis by Planning with Geometry-Grounded Tokens
by: Ghosh, Anindita, et al.
Published: (2026)

RMD: A Simple Baseline for More General Human Motion Generation via Training-free Retrieval-Augmented Motion Diffuse
by: Liao, Zhouyingcheng, et al.
Published: (2024)

SENC: Handling Self-collision in Neural Cloth Simulation
by: Liao, Zhouyingcheng, et al.
Published: (2024)

CoDA: Coordinated Diffusion Noise Optimization for Whole-Body Manipulation of Articulated Objects
by: Pi, Huaijin, et al.
Published: (2025)

Betsu-Betsu: Multi-View Separable 3D Reconstruction of Two Interacting Objects
by: Gopal, Suhas, et al.
Published: (2025)

Zero-Shot Human-Object Interaction Synthesis with Multimodal Priors
by: Lou, Yuke, et al.
Published: (2025)

Follow My Hold: Hand-Object Interaction Reconstruction through Geometric Guidance
by: Aytekin, Ayce Idil, et al.
Published: (2025)

MetaCap: Meta-learning Priors from Multi-View Imagery for Sparse-view Human Performance Capture and Rendering
by: Sun, Guoxing, et al.
Published: (2024)

It Takes Two: Real-time Co-Speech Two-person's Interaction Generation via Reactive Auto-regressive Diffusion Model
by: Shi, Mingyi, et al.
Published: (2024)

VHOI: Controllable Video Generation of Human-Object Interactions from Sparse Trajectories via Motion Densification
by: Zhang, Wanyue, et al.
Published: (2025)

EMDM: Efficient Motion Diffusion Model for Fast and High-Quality Motion Generation
by: Zhou, Wenyang, et al.
Published: (2023)

MIBURI: Towards Expressive Interactive Gesture Synthesis
by: Mughal, M. Hamza, et al.
Published: (2026)

TokenHSI: Unified Synthesis of Physical Human-Scene Interactions through Task Tokenization
by: Pan, Liang, et al.
Published: (2025)

Real-time Free-view Human Rendering from Sparse-view RGB Videos using Double Unprojected Textures
by: Sun, Guoxing, et al.
Published: (2024)

Mocap-2-to-3: Multi-view Lifting for Monocular Motion Recovery with 2D Pretraining
by: Wang, Zhumei, et al.
Published: (2025)

ReMoS: 3D Motion-Conditioned Reaction Synthesis for Two-Person Interactions
by: Ghosh, Anindita, et al.
Published: (2023)

Grasp in Gaussians: Fast Monocular Reconstruction of Dynamic Hand-Object Interactions
by: Aytekin, Ayce Idil, et al.
Published: (2026)

PractiLight: Practical Light Control Using Foundational Diffusion Models
by: Erel, Yotam, et al.
Published: (2025)

Ego4o: Egocentric Human Motion Capture and Understanding from Multi-Modal Input
by: Wang, Jian, et al.
Published: (2025)

FunRec: Reconstructing Functional 3D Scenes from Egocentric Interaction Videos
by: Delitzas, Alexandros, et al.
Published: (2026)

CBIL: Collective Behavior Imitation Learning for Fish from Real Videos
by: Wu, Yifan, et al.
Published: (2025)

3HANDS Dataset: Learning from Humans for Generating Naturalistic Handovers with Supernumerary Robotic Limbs
by: Abadian, Artin Saberpour, et al.
Published: (2025)

PocoLoco: A Point Cloud Diffusion Model of Human Shape in Loose Clothing
by: Seth, Siddharth, et al.
Published: (2024)

Physics-based Human Pose Estimation from a Single Moving RGB Camera
by: Aytekin, Ayce Idil, et al.
Published: (2025)

ROAM: Robust and Object-Aware Motion Generation Using Neural Pose Descriptors
by: Zhang, Wanyue, et al.
Published: (2023)

MuSHRoom: Multi-Sensor Hybrid Room Dataset for Joint 3D Reconstruction and Novel View Synthesis
by: Ren, Xuqian, et al.
Published: (2023)

ConvoFusion: Multi-Modal Conversational Diffusion for Co-Speech Gesture Synthesis
by: Mughal, Muhammad Hamza, et al.
Published: (2024)

Attention (as Discrete-Time Markov) Chains
by: Erel, Yotam, et al.
Published: (2025)

CoMoGen: COntrollable MOtion Dynamics and Interactions with Mask-Guided Video GENeration
by: Meric, Adil, et al.
Published: (2026)

Retrieving Semantics from the Deep: an RAG Solution for Gesture Synthesis
by: Mughal, M. Hamza, et al.
Published: (2024)

SafeEmbodAI: a Safety Framework for Mobile Robots in Embodied AI Systems
by: Zhang, Wenxiao, et al.
Published: (2024)

Mocap Anywhere: Towards Pairwise-Distance based Motion Capture in the Wild (for the Wild)
by: Abramovich, Ofir, et al.
Published: (2026)

Relightable Holoported Characters: Capturing and Relighting Dynamic Human Performance from Sparse Views
by: Singh, Kunwar Maheep, et al.
Published: (2025)

FRAME: Floor-aligned Representation for Avatar Motion from Egocentric Video
by: Camiletto, Andrea Boscolo, et al.
Published: (2025)

FIORD: A Fisheye Indoor-Outdoor Dataset with LIDAR Ground Truth for 3D Scene Reconstruction and Benchmarking
by: Gunes, Ulas, et al.
Published: (2025)

BimArt: A Unified Approach for the Synthesis of 3D Bimanual Interaction with Articulated Objects
by: Zhang, Wanyue, et al.
Published: (2024)

DuetGen: Music Driven Two-Person Dance Generation via Hierarchical Masked Modeling
by: Ghosh, Anindita, et al.
Published: (2025)

Strips as Tokens: Artist Mesh Generation with Native UV Segmentation
by: Xu, Rui, et al.
Published: (2026)

Motion-2-To-3: Leveraging 2D Motion Data for 3D Motion Generations
by: Guo, Ruoxi, et al.
Published: (2024)