:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Fan, Siyuan, Huang, Wenke, Cai, Xiantao, Du, Bo
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2503.13120
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

TextIM: Part-aware Interactive Motion Synthesis from Text
by: Fan, Siyuan, et al.
Published: (2024)

A Survey of Safety on Large Vision-Language Models: Attacks, Defenses and Evaluations
by: Ye, Mang, et al.
Published: (2025)

F-HOI: Toward Fine-grained Semantic-Aligned 3D Human-Object Interactions
by: Yang, Jie, et al.
Published: (2024)

Reasoning-OCR: Can Large Multimodal Models Solve Complex Logical Reasoning Problems from OCR Cues?
by: He, Haibin, et al.
Published: (2025)

AttriHuman-3D: Editable 3D Human Avatar Generation with Attribute Decomposition and Indexing
by: Yang, Fan, et al.
Published: (2023)

Disentangled Hierarchical VAE for 3D Human-Human Interaction Generation
by: Geng, Zichen, et al.
Published: (2026)

SegVol: Universal and Interactive Volumetric Medical Image Segmentation
by: Du, Yuxin, et al.
Published: (2023)

InterFusion: Text-Driven Generation of 3D Human-Object Interaction
by: Dai, Sisi, et al.
Published: (2024)

Motion-Adaptive Multi-Scale Temporal Modelling with Skeleton-Constrained Spatial Graphs for Efficient 3D Human Pose Estimation
by: Li, Ruochen, et al.
Published: (2026)

M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models
by: Bai, Fan, et al.
Published: (2024)

InteractMove: Text-Controlled Human-Object Interaction Generation in 3D Scenes with Movable Objects
by: Cai, Xinhao, et al.
Published: (2025)

A Survey on Human Interaction Motion Generation
by: Sui, Kewei, et al.
Published: (2025)

The Escalator Problem: Identifying Implicit Motion Blindness in AI for Accessibility
by: Zhang, Xiantao
Published: (2025)

A Survey on 3D Human Avatar Modeling -- From Reconstruction to Generation
by: Wang, Ruihe, et al.
Published: (2024)

SafeGRPO: Self-Rewarded Multimodal Safety Alignment via Rule-Governed Policy Optimization
by: Rong, Xuankun, et al.
Published: (2025)

InteractAnything: Zero-shot Human Object Interaction Synthesis via LLM Feedback and Object Affordance Parsing
by: Zhang, Jinlu, et al.
Published: (2025)

TELA: Text to Layer-wise 3D Clothed Human Generation
by: Dong, Junting, et al.
Published: (2024)

Visual Mamba: A Survey and New Outlooks
by: Xu, Rui, et al.
Published: (2024)

Recent Advances in 3D Object and Scene Generation: A Survey
by: Tang, Xiang, et al.
Published: (2025)

3D Human-Human Interaction Anomaly Detection
by: Maeda, Shun, et al.
Published: (2025)

HumanOrbit: 3D Human Reconstruction as 360° Orbit Generation
by: Suzuki, Keito, et al.
Published: (2026)

LaxMotion: Rethinking Supervision Granularity for 3D Human Motion Generation
by: Liu, Sheng, et al.
Published: (2025)

Scaling Up Dynamic Human-Scene Interaction Modeling
by: Jiang, Nan, et al.
Published: (2024)

Move as You Say, Interact as You Can: Language-guided Human Motion Generation with Scene Affordance
by: Wang, Zan, et al.
Published: (2024)

LoRASculpt: Sculpting LoRA for Harmonizing General and Specialized Knowledge in Multimodal Large Language Models
by: Liang, Jian, et al.
Published: (2025)

ContactGen: Contact-Guided Interactive 3D Human Generation for Partners
by: Gu, Dongjun, et al.
Published: (2024)

A Survey on 3D Egocentric Human Pose Estimation
by: Azam, Md Mushfiqur, et al.
Published: (2024)

TIMotion: Temporal and Interactive Framework for Efficient Human-Human Motion Generation
by: Wang, Yabiao, et al.
Published: (2024)

Uni-Inter: Unifying 3D Human Motion Synthesis Across Diverse Interaction Contexts
by: Liu, Sheng, et al.
Published: (2025)

A Survey of Interactive Generative Video
by: Yu, Jiwen, et al.
Published: (2025)

HUMOTO: A 4D Dataset of Mocap Human Object Interactions
by: Lu, Jiaxin, et al.
Published: (2025)

Interactive3D: Create What You Want by Interactive 3D Generation
by: Dong, Shaocong, et al.
Published: (2024)

HunyuanVideo-HOMA: Generic Human-Object Interaction in Multimodal Driven Human Animation
by: Huang, Ziyao, et al.
Published: (2025)

3D Scene Generation: A Survey
by: Wen, Beichen, et al.
Published: (2025)

3D Shape Generation: A Survey
by: Caytuiro, Nicolas, et al.
Published: (2025)

Who Generated This 3D Asset? Learning Source Attribution for Generative 3D Models
by: Ma, Sihan, et al.
Published: (2026)

Generative Human-Object Interaction Detection via Differentiable Cognitive Steering of Multi-modal LLMs
by: Cai, Zhaolin, et al.
Published: (2025)

InterMoE: Individual-Specific 3D Human Interaction Generation via Dynamic Temporal-Selective MoE
by: Wang, Lipeng, et al.
Published: (2025)

3D Generation for Embodied AI and Robotic Simulation: A Survey
by: Ye, Tianwei, et al.
Published: (2026)

VAGNet: Grounding 3D Affordance from Human-Object Interactions in Videos
by: Mao, Aihua, et al.
Published: (2026)