Saved in:
| Main Authors: | Mondal, Arnab Kumar, Alletto, Stefano, Tome, Denis |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.10880 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Learning Disentangled Representation in Object-Centric Models for Visual Dynamics Prediction via Transformers
by: Gandhi, Sanket, et al.
Published: (2024)
by: Gandhi, Sanket, et al.
Published: (2024)
KineST: A Kinematics-guided Spatiotemporal State Space Model for Human Motion Tracking from Sparse Signals
by: Zhao, Shuting, et al.
Published: (2025)
by: Zhao, Shuting, et al.
Published: (2025)
GeoMotionGPT: Geometry-Aligned Motion Understanding with Large Language Models
by: Ye, Zhankai, et al.
Published: (2026)
by: Ye, Zhankai, et al.
Published: (2026)
KinMo: Kinematic-aware Human Motion Understanding and Generation
by: Zhang, Pengfei, et al.
Published: (2024)
by: Zhang, Pengfei, et al.
Published: (2024)
Hyperbolic Space Learning Method Leveraging Temporal Motion Priors for Human Mesh Recovery
by: Zhang, Xiang, et al.
Published: (2025)
by: Zhang, Xiang, et al.
Published: (2025)
Multimodal Generative AI with Autoregressive LLMs for Human Motion Understanding and Generation: A Way Forward
by: Islam, Muhammad, et al.
Published: (2025)
by: Islam, Muhammad, et al.
Published: (2025)
HUMOS: Human Motion Model Conditioned on Body Shape
by: Tripathi, Shashank, et al.
Published: (2024)
by: Tripathi, Shashank, et al.
Published: (2024)
HFNeRF: Learning Human Biomechanic Features with Neural Radiance Fields
by: Dey, Arnab, et al.
Published: (2024)
by: Dey, Arnab, et al.
Published: (2024)
Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level
by: Deng, Andong, et al.
Published: (2024)
by: Deng, Andong, et al.
Published: (2024)
UniHOI: Unified Human-Object Interaction Understanding via Unified Token Space
by: Yang, Panqi, et al.
Published: (2025)
by: Yang, Panqi, et al.
Published: (2025)
Human Motion Instruction Tuning
by: Li, Lei, et al.
Published: (2024)
by: Li, Lei, et al.
Published: (2024)
UniMotion: A Unified Framework for Motion-Text-Vision Understanding and Generation
by: Wang, Ziyi, et al.
Published: (2026)
by: Wang, Ziyi, et al.
Published: (2026)
HumanCM: One Step Human Motion Prediction
by: Haojie, Liu, et al.
Published: (2025)
by: Haojie, Liu, et al.
Published: (2025)
HFGaussian: Learning Generalizable Gaussian Human with Integrated Human Features
by: Dey, Arnab, et al.
Published: (2024)
by: Dey, Arnab, et al.
Published: (2024)
Deformba: Vision State Space Model with Adaptive State Fusion
by: Ke, Hongyu, et al.
Published: (2026)
by: Ke, Hongyu, et al.
Published: (2026)
MambaEVT: Event Stream based Visual Object Tracking using State Space Model
by: Wang, Xiao, et al.
Published: (2024)
by: Wang, Xiao, et al.
Published: (2024)
Geometry-Guided Camera Motion Understanding in VideoLLMs
by: Feng, Haoan, et al.
Published: (2026)
by: Feng, Haoan, et al.
Published: (2026)
StickMotion: Generating 3D Human Motions by Drawing a Stickman
by: Wang, Tao, et al.
Published: (2025)
by: Wang, Tao, et al.
Published: (2025)
DivDiff: A Conditional Diffusion Model for Diverse Human Motion Prediction
by: Yu, Hua, et al.
Published: (2024)
by: Yu, Hua, et al.
Published: (2024)
WANDR: Intention-guided Human Motion Generation
by: Diomataris, Markos, et al.
Published: (2024)
by: Diomataris, Markos, et al.
Published: (2024)
VFIMamba: Video Frame Interpolation with State Space Models
by: Zhang, Guozhen, et al.
Published: (2024)
by: Zhang, Guozhen, et al.
Published: (2024)
SF-Mamba: Rethinking State Space Model for Vision
by: Yoshimura, Masakazu, et al.
Published: (2026)
by: Yoshimura, Masakazu, et al.
Published: (2026)
HumANDiff: Articulated Noise Diffusion for Motion-Consistent Human Video Generation
by: Hu, Tao, et al.
Published: (2026)
by: Hu, Tao, et al.
Published: (2026)
MLP: Motion Label Prior for Temporal Sentence Localization in Untrimmed 3D Human Motions
by: Yan, Sheng, et al.
Published: (2024)
by: Yan, Sheng, et al.
Published: (2024)
TokenMotion: Decoupled Motion Control via Token Disentanglement for Human-centric Video Generation
by: Li, Ruineng, et al.
Published: (2025)
by: Li, Ruineng, et al.
Published: (2025)
PhyMotion: Structured 3D Motion Reward for Physics-Grounded Human Video Generation
by: Huang, Yidong, et al.
Published: (2026)
by: Huang, Yidong, et al.
Published: (2026)
X-UniMotion: Animating Human Images with Expressive, Unified and Identity-Agnostic Motion Latents
by: Song, Guoxian, et al.
Published: (2025)
by: Song, Guoxian, et al.
Published: (2025)
Towards Fine-Grained Human Motion Video Captioning
by: Song, Guorui, et al.
Published: (2025)
by: Song, Guorui, et al.
Published: (2025)
InsectMamba: Insect Pest Classification with State Space Model
by: Wang, Qianning, et al.
Published: (2024)
by: Wang, Qianning, et al.
Published: (2024)
Unleashing Diffusion and State Space Models for Medical Image Segmentation
by: Wu, Rong, et al.
Published: (2025)
by: Wu, Rong, et al.
Published: (2025)
Unified Medical Image Segmentation with State Space Modeling Snake
by: Zhang, Ruicheng, et al.
Published: (2025)
by: Zhang, Ruicheng, et al.
Published: (2025)
Tri-Modal Motion Retrieval by Learning a Joint Embedding Space
by: Yin, Kangning, et al.
Published: (2024)
by: Yin, Kangning, et al.
Published: (2024)
MESH -- Understanding Videos Like Human: Measuring Hallucinations in Large Video Models
by: Yang, Garry, et al.
Published: (2025)
by: Yang, Garry, et al.
Published: (2025)
GHNeRF: Learning Generalizable Human Features with Efficient Neural Radiance Fields
by: Dey, Arnab, et al.
Published: (2024)
by: Dey, Arnab, et al.
Published: (2024)
BadHMP: Backdoor Attack against Human Motion Prediction
by: Xu, Chaohui, et al.
Published: (2024)
by: Xu, Chaohui, et al.
Published: (2024)
LS-GAN: Human Motion Synthesis with Latent-space GANs
by: Amballa, Avinash, et al.
Published: (2024)
by: Amballa, Avinash, et al.
Published: (2024)
Contact-aware Human Motion Generation from Textual Descriptions
by: Ma, Sihan, et al.
Published: (2024)
by: Ma, Sihan, et al.
Published: (2024)
Generative AI-Driven High-Fidelity Human Motion Simulation
by: Iyer, Hari, et al.
Published: (2025)
by: Iyer, Hari, et al.
Published: (2025)
RAM: Recover Any 3D Human Motion in-the-Wild
by: Jia, Sen, et al.
Published: (2026)
by: Jia, Sen, et al.
Published: (2026)
Coordinating Multiple Conditions for Trajectory-Controlled Human Motion Generation
by: Cai, Deli, et al.
Published: (2026)
by: Cai, Deli, et al.
Published: (2026)
Similar Items
-
Learning Disentangled Representation in Object-Centric Models for Visual Dynamics Prediction via Transformers
by: Gandhi, Sanket, et al.
Published: (2024) -
KineST: A Kinematics-guided Spatiotemporal State Space Model for Human Motion Tracking from Sparse Signals
by: Zhao, Shuting, et al.
Published: (2025) -
GeoMotionGPT: Geometry-Aligned Motion Understanding with Large Language Models
by: Ye, Zhankai, et al.
Published: (2026) -
KinMo: Kinematic-aware Human Motion Understanding and Generation
by: Zhang, Pengfei, et al.
Published: (2024) -
Hyperbolic Space Learning Method Leveraging Temporal Motion Priors for Human Mesh Recovery
by: Zhang, Xiang, et al.
Published: (2025)