Saved in:
| Main Authors: | Fan, Siyuan, Huang, Wenke, Cai, Xiantao, Du, Bo |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2503.13120 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
TextIM: Part-aware Interactive Motion Synthesis from Text
by: Fan, Siyuan, et al.
Published: (2024)
by: Fan, Siyuan, et al.
Published: (2024)
A Survey of Safety on Large Vision-Language Models: Attacks, Defenses and Evaluations
by: Ye, Mang, et al.
Published: (2025)
by: Ye, Mang, et al.
Published: (2025)
F-HOI: Toward Fine-grained Semantic-Aligned 3D Human-Object Interactions
by: Yang, Jie, et al.
Published: (2024)
by: Yang, Jie, et al.
Published: (2024)
Reasoning-OCR: Can Large Multimodal Models Solve Complex Logical Reasoning Problems from OCR Cues?
by: He, Haibin, et al.
Published: (2025)
by: He, Haibin, et al.
Published: (2025)
AttriHuman-3D: Editable 3D Human Avatar Generation with Attribute Decomposition and Indexing
by: Yang, Fan, et al.
Published: (2023)
by: Yang, Fan, et al.
Published: (2023)
Disentangled Hierarchical VAE for 3D Human-Human Interaction Generation
by: Geng, Zichen, et al.
Published: (2026)
by: Geng, Zichen, et al.
Published: (2026)
SegVol: Universal and Interactive Volumetric Medical Image Segmentation
by: Du, Yuxin, et al.
Published: (2023)
by: Du, Yuxin, et al.
Published: (2023)
InterFusion: Text-Driven Generation of 3D Human-Object Interaction
by: Dai, Sisi, et al.
Published: (2024)
by: Dai, Sisi, et al.
Published: (2024)
Motion-Adaptive Multi-Scale Temporal Modelling with Skeleton-Constrained Spatial Graphs for Efficient 3D Human Pose Estimation
by: Li, Ruochen, et al.
Published: (2026)
by: Li, Ruochen, et al.
Published: (2026)
M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models
by: Bai, Fan, et al.
Published: (2024)
by: Bai, Fan, et al.
Published: (2024)
InteractMove: Text-Controlled Human-Object Interaction Generation in 3D Scenes with Movable Objects
by: Cai, Xinhao, et al.
Published: (2025)
by: Cai, Xinhao, et al.
Published: (2025)
A Survey on Human Interaction Motion Generation
by: Sui, Kewei, et al.
Published: (2025)
by: Sui, Kewei, et al.
Published: (2025)
The Escalator Problem: Identifying Implicit Motion Blindness in AI for Accessibility
by: Zhang, Xiantao
Published: (2025)
by: Zhang, Xiantao
Published: (2025)
A Survey on 3D Human Avatar Modeling -- From Reconstruction to Generation
by: Wang, Ruihe, et al.
Published: (2024)
by: Wang, Ruihe, et al.
Published: (2024)
SafeGRPO: Self-Rewarded Multimodal Safety Alignment via Rule-Governed Policy Optimization
by: Rong, Xuankun, et al.
Published: (2025)
by: Rong, Xuankun, et al.
Published: (2025)
InteractAnything: Zero-shot Human Object Interaction Synthesis via LLM Feedback and Object Affordance Parsing
by: Zhang, Jinlu, et al.
Published: (2025)
by: Zhang, Jinlu, et al.
Published: (2025)
TELA: Text to Layer-wise 3D Clothed Human Generation
by: Dong, Junting, et al.
Published: (2024)
by: Dong, Junting, et al.
Published: (2024)
Visual Mamba: A Survey and New Outlooks
by: Xu, Rui, et al.
Published: (2024)
by: Xu, Rui, et al.
Published: (2024)
Recent Advances in 3D Object and Scene Generation: A Survey
by: Tang, Xiang, et al.
Published: (2025)
by: Tang, Xiang, et al.
Published: (2025)
3D Human-Human Interaction Anomaly Detection
by: Maeda, Shun, et al.
Published: (2025)
by: Maeda, Shun, et al.
Published: (2025)
HumanOrbit: 3D Human Reconstruction as 360° Orbit Generation
by: Suzuki, Keito, et al.
Published: (2026)
by: Suzuki, Keito, et al.
Published: (2026)
LaxMotion: Rethinking Supervision Granularity for 3D Human Motion Generation
by: Liu, Sheng, et al.
Published: (2025)
by: Liu, Sheng, et al.
Published: (2025)
Scaling Up Dynamic Human-Scene Interaction Modeling
by: Jiang, Nan, et al.
Published: (2024)
by: Jiang, Nan, et al.
Published: (2024)
Move as You Say, Interact as You Can: Language-guided Human Motion Generation with Scene Affordance
by: Wang, Zan, et al.
Published: (2024)
by: Wang, Zan, et al.
Published: (2024)
LoRASculpt: Sculpting LoRA for Harmonizing General and Specialized Knowledge in Multimodal Large Language Models
by: Liang, Jian, et al.
Published: (2025)
by: Liang, Jian, et al.
Published: (2025)
ContactGen: Contact-Guided Interactive 3D Human Generation for Partners
by: Gu, Dongjun, et al.
Published: (2024)
by: Gu, Dongjun, et al.
Published: (2024)
A Survey on 3D Egocentric Human Pose Estimation
by: Azam, Md Mushfiqur, et al.
Published: (2024)
by: Azam, Md Mushfiqur, et al.
Published: (2024)
TIMotion: Temporal and Interactive Framework for Efficient Human-Human Motion Generation
by: Wang, Yabiao, et al.
Published: (2024)
by: Wang, Yabiao, et al.
Published: (2024)
Uni-Inter: Unifying 3D Human Motion Synthesis Across Diverse Interaction Contexts
by: Liu, Sheng, et al.
Published: (2025)
by: Liu, Sheng, et al.
Published: (2025)
A Survey of Interactive Generative Video
by: Yu, Jiwen, et al.
Published: (2025)
by: Yu, Jiwen, et al.
Published: (2025)
HUMOTO: A 4D Dataset of Mocap Human Object Interactions
by: Lu, Jiaxin, et al.
Published: (2025)
by: Lu, Jiaxin, et al.
Published: (2025)
Interactive3D: Create What You Want by Interactive 3D Generation
by: Dong, Shaocong, et al.
Published: (2024)
by: Dong, Shaocong, et al.
Published: (2024)
HunyuanVideo-HOMA: Generic Human-Object Interaction in Multimodal Driven Human Animation
by: Huang, Ziyao, et al.
Published: (2025)
by: Huang, Ziyao, et al.
Published: (2025)
3D Scene Generation: A Survey
by: Wen, Beichen, et al.
Published: (2025)
by: Wen, Beichen, et al.
Published: (2025)
3D Shape Generation: A Survey
by: Caytuiro, Nicolas, et al.
Published: (2025)
by: Caytuiro, Nicolas, et al.
Published: (2025)
Who Generated This 3D Asset? Learning Source Attribution for Generative 3D Models
by: Ma, Sihan, et al.
Published: (2026)
by: Ma, Sihan, et al.
Published: (2026)
Generative Human-Object Interaction Detection via Differentiable Cognitive Steering of Multi-modal LLMs
by: Cai, Zhaolin, et al.
Published: (2025)
by: Cai, Zhaolin, et al.
Published: (2025)
InterMoE: Individual-Specific 3D Human Interaction Generation via Dynamic Temporal-Selective MoE
by: Wang, Lipeng, et al.
Published: (2025)
by: Wang, Lipeng, et al.
Published: (2025)
3D Generation for Embodied AI and Robotic Simulation: A Survey
by: Ye, Tianwei, et al.
Published: (2026)
by: Ye, Tianwei, et al.
Published: (2026)
VAGNet: Grounding 3D Affordance from Human-Object Interactions in Videos
by: Mao, Aihua, et al.
Published: (2026)
by: Mao, Aihua, et al.
Published: (2026)
Similar Items
-
TextIM: Part-aware Interactive Motion Synthesis from Text
by: Fan, Siyuan, et al.
Published: (2024) -
A Survey of Safety on Large Vision-Language Models: Attacks, Defenses and Evaluations
by: Ye, Mang, et al.
Published: (2025) -
F-HOI: Toward Fine-grained Semantic-Aligned 3D Human-Object Interactions
by: Yang, Jie, et al.
Published: (2024) -
Reasoning-OCR: Can Large Multimodal Models Solve Complex Logical Reasoning Problems from OCR Cues?
by: He, Haibin, et al.
Published: (2025) -
AttriHuman-3D: Editable 3D Human Avatar Generation with Attribute Decomposition and Indexing
by: Yang, Fan, et al.
Published: (2023)