Saved in:
| Main Authors: | Wang, Wenjia, Pan, Liang, Pi, Huaijin, Lou, Yuke, Ren, Xuqian, Wu, Yifan, Liao, Zhouyingcheng, Yang, Lei, Dabral, Rishabh, Theobalt, Christian, Komura, Taku |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.23205 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
SIMS: Simulating Stylized Human-Scene Interactions with Retrieval-Augmented Script Generation
by: Wang, Wenjia, et al.
Published: (2024)
by: Wang, Wenjia, et al.
Published: (2024)
SceMoS: Scene-Aware 3D Human Motion Synthesis by Planning with Geometry-Grounded Tokens
by: Ghosh, Anindita, et al.
Published: (2026)
by: Ghosh, Anindita, et al.
Published: (2026)
RMD: A Simple Baseline for More General Human Motion Generation via Training-free Retrieval-Augmented Motion Diffuse
by: Liao, Zhouyingcheng, et al.
Published: (2024)
by: Liao, Zhouyingcheng, et al.
Published: (2024)
SENC: Handling Self-collision in Neural Cloth Simulation
by: Liao, Zhouyingcheng, et al.
Published: (2024)
by: Liao, Zhouyingcheng, et al.
Published: (2024)
CoDA: Coordinated Diffusion Noise Optimization for Whole-Body Manipulation of Articulated Objects
by: Pi, Huaijin, et al.
Published: (2025)
by: Pi, Huaijin, et al.
Published: (2025)
Betsu-Betsu: Multi-View Separable 3D Reconstruction of Two Interacting Objects
by: Gopal, Suhas, et al.
Published: (2025)
by: Gopal, Suhas, et al.
Published: (2025)
Zero-Shot Human-Object Interaction Synthesis with Multimodal Priors
by: Lou, Yuke, et al.
Published: (2025)
by: Lou, Yuke, et al.
Published: (2025)
Follow My Hold: Hand-Object Interaction Reconstruction through Geometric Guidance
by: Aytekin, Ayce Idil, et al.
Published: (2025)
by: Aytekin, Ayce Idil, et al.
Published: (2025)
MetaCap: Meta-learning Priors from Multi-View Imagery for Sparse-view Human Performance Capture and Rendering
by: Sun, Guoxing, et al.
Published: (2024)
by: Sun, Guoxing, et al.
Published: (2024)
It Takes Two: Real-time Co-Speech Two-person's Interaction Generation via Reactive Auto-regressive Diffusion Model
by: Shi, Mingyi, et al.
Published: (2024)
by: Shi, Mingyi, et al.
Published: (2024)
VHOI: Controllable Video Generation of Human-Object Interactions from Sparse Trajectories via Motion Densification
by: Zhang, Wanyue, et al.
Published: (2025)
by: Zhang, Wanyue, et al.
Published: (2025)
EMDM: Efficient Motion Diffusion Model for Fast and High-Quality Motion Generation
by: Zhou, Wenyang, et al.
Published: (2023)
by: Zhou, Wenyang, et al.
Published: (2023)
MIBURI: Towards Expressive Interactive Gesture Synthesis
by: Mughal, M. Hamza, et al.
Published: (2026)
by: Mughal, M. Hamza, et al.
Published: (2026)
TokenHSI: Unified Synthesis of Physical Human-Scene Interactions through Task Tokenization
by: Pan, Liang, et al.
Published: (2025)
by: Pan, Liang, et al.
Published: (2025)
Real-time Free-view Human Rendering from Sparse-view RGB Videos using Double Unprojected Textures
by: Sun, Guoxing, et al.
Published: (2024)
by: Sun, Guoxing, et al.
Published: (2024)
Mocap-2-to-3: Multi-view Lifting for Monocular Motion Recovery with 2D Pretraining
by: Wang, Zhumei, et al.
Published: (2025)
by: Wang, Zhumei, et al.
Published: (2025)
ReMoS: 3D Motion-Conditioned Reaction Synthesis for Two-Person Interactions
by: Ghosh, Anindita, et al.
Published: (2023)
by: Ghosh, Anindita, et al.
Published: (2023)
Grasp in Gaussians: Fast Monocular Reconstruction of Dynamic Hand-Object Interactions
by: Aytekin, Ayce Idil, et al.
Published: (2026)
by: Aytekin, Ayce Idil, et al.
Published: (2026)
PractiLight: Practical Light Control Using Foundational Diffusion Models
by: Erel, Yotam, et al.
Published: (2025)
by: Erel, Yotam, et al.
Published: (2025)
Ego4o: Egocentric Human Motion Capture and Understanding from Multi-Modal Input
by: Wang, Jian, et al.
Published: (2025)
by: Wang, Jian, et al.
Published: (2025)
FunRec: Reconstructing Functional 3D Scenes from Egocentric Interaction Videos
by: Delitzas, Alexandros, et al.
Published: (2026)
by: Delitzas, Alexandros, et al.
Published: (2026)
CBIL: Collective Behavior Imitation Learning for Fish from Real Videos
by: Wu, Yifan, et al.
Published: (2025)
by: Wu, Yifan, et al.
Published: (2025)
3HANDS Dataset: Learning from Humans for Generating Naturalistic Handovers with Supernumerary Robotic Limbs
by: Abadian, Artin Saberpour, et al.
Published: (2025)
by: Abadian, Artin Saberpour, et al.
Published: (2025)
PocoLoco: A Point Cloud Diffusion Model of Human Shape in Loose Clothing
by: Seth, Siddharth, et al.
Published: (2024)
by: Seth, Siddharth, et al.
Published: (2024)
Physics-based Human Pose Estimation from a Single Moving RGB Camera
by: Aytekin, Ayce Idil, et al.
Published: (2025)
by: Aytekin, Ayce Idil, et al.
Published: (2025)
ROAM: Robust and Object-Aware Motion Generation Using Neural Pose Descriptors
by: Zhang, Wanyue, et al.
Published: (2023)
by: Zhang, Wanyue, et al.
Published: (2023)
MuSHRoom: Multi-Sensor Hybrid Room Dataset for Joint 3D Reconstruction and Novel View Synthesis
by: Ren, Xuqian, et al.
Published: (2023)
by: Ren, Xuqian, et al.
Published: (2023)
ConvoFusion: Multi-Modal Conversational Diffusion for Co-Speech Gesture Synthesis
by: Mughal, Muhammad Hamza, et al.
Published: (2024)
by: Mughal, Muhammad Hamza, et al.
Published: (2024)
Attention (as Discrete-Time Markov) Chains
by: Erel, Yotam, et al.
Published: (2025)
by: Erel, Yotam, et al.
Published: (2025)
CoMoGen: COntrollable MOtion Dynamics and Interactions with Mask-Guided Video GENeration
by: Meric, Adil, et al.
Published: (2026)
by: Meric, Adil, et al.
Published: (2026)
Retrieving Semantics from the Deep: an RAG Solution for Gesture Synthesis
by: Mughal, M. Hamza, et al.
Published: (2024)
by: Mughal, M. Hamza, et al.
Published: (2024)
SafeEmbodAI: a Safety Framework for Mobile Robots in Embodied AI Systems
by: Zhang, Wenxiao, et al.
Published: (2024)
by: Zhang, Wenxiao, et al.
Published: (2024)
Mocap Anywhere: Towards Pairwise-Distance based Motion Capture in the Wild (for the Wild)
by: Abramovich, Ofir, et al.
Published: (2026)
by: Abramovich, Ofir, et al.
Published: (2026)
Relightable Holoported Characters: Capturing and Relighting Dynamic Human Performance from Sparse Views
by: Singh, Kunwar Maheep, et al.
Published: (2025)
by: Singh, Kunwar Maheep, et al.
Published: (2025)
FRAME: Floor-aligned Representation for Avatar Motion from Egocentric Video
by: Camiletto, Andrea Boscolo, et al.
Published: (2025)
by: Camiletto, Andrea Boscolo, et al.
Published: (2025)
FIORD: A Fisheye Indoor-Outdoor Dataset with LIDAR Ground Truth for 3D Scene Reconstruction and Benchmarking
by: Gunes, Ulas, et al.
Published: (2025)
by: Gunes, Ulas, et al.
Published: (2025)
BimArt: A Unified Approach for the Synthesis of 3D Bimanual Interaction with Articulated Objects
by: Zhang, Wanyue, et al.
Published: (2024)
by: Zhang, Wanyue, et al.
Published: (2024)
DuetGen: Music Driven Two-Person Dance Generation via Hierarchical Masked Modeling
by: Ghosh, Anindita, et al.
Published: (2025)
by: Ghosh, Anindita, et al.
Published: (2025)
Strips as Tokens: Artist Mesh Generation with Native UV Segmentation
by: Xu, Rui, et al.
Published: (2026)
by: Xu, Rui, et al.
Published: (2026)
Motion-2-To-3: Leveraging 2D Motion Data for 3D Motion Generations
by: Guo, Ruoxi, et al.
Published: (2024)
by: Guo, Ruoxi, et al.
Published: (2024)
Similar Items
-
SIMS: Simulating Stylized Human-Scene Interactions with Retrieval-Augmented Script Generation
by: Wang, Wenjia, et al.
Published: (2024) -
SceMoS: Scene-Aware 3D Human Motion Synthesis by Planning with Geometry-Grounded Tokens
by: Ghosh, Anindita, et al.
Published: (2026) -
RMD: A Simple Baseline for More General Human Motion Generation via Training-free Retrieval-Augmented Motion Diffuse
by: Liao, Zhouyingcheng, et al.
Published: (2024) -
SENC: Handling Self-collision in Neural Cloth Simulation
by: Liao, Zhouyingcheng, et al.
Published: (2024) -
CoDA: Coordinated Diffusion Noise Optimization for Whole-Body Manipulation of Articulated Objects
by: Pi, Huaijin, et al.
Published: (2025)