:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Mondal, Arnab Kumar, Alletto, Stefano, Tome, Denis
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2404.10880
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Learning Disentangled Representation in Object-Centric Models for Visual Dynamics Prediction via Transformers
by: Gandhi, Sanket, et al.
Published: (2024)

KineST: A Kinematics-guided Spatiotemporal State Space Model for Human Motion Tracking from Sparse Signals
by: Zhao, Shuting, et al.
Published: (2025)

GeoMotionGPT: Geometry-Aligned Motion Understanding with Large Language Models
by: Ye, Zhankai, et al.
Published: (2026)

KinMo: Kinematic-aware Human Motion Understanding and Generation
by: Zhang, Pengfei, et al.
Published: (2024)

Hyperbolic Space Learning Method Leveraging Temporal Motion Priors for Human Mesh Recovery
by: Zhang, Xiang, et al.
Published: (2025)

Multimodal Generative AI with Autoregressive LLMs for Human Motion Understanding and Generation: A Way Forward
by: Islam, Muhammad, et al.
Published: (2025)

HUMOS: Human Motion Model Conditioned on Body Shape
by: Tripathi, Shashank, et al.
Published: (2024)

HFNeRF: Learning Human Biomechanic Features with Neural Radiance Fields
by: Dey, Arnab, et al.
Published: (2024)

Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level
by: Deng, Andong, et al.
Published: (2024)

UniHOI: Unified Human-Object Interaction Understanding via Unified Token Space
by: Yang, Panqi, et al.
Published: (2025)

Human Motion Instruction Tuning
by: Li, Lei, et al.
Published: (2024)

UniMotion: A Unified Framework for Motion-Text-Vision Understanding and Generation
by: Wang, Ziyi, et al.
Published: (2026)

HumanCM: One Step Human Motion Prediction
by: Haojie, Liu, et al.
Published: (2025)

HFGaussian: Learning Generalizable Gaussian Human with Integrated Human Features
by: Dey, Arnab, et al.
Published: (2024)

Deformba: Vision State Space Model with Adaptive State Fusion
by: Ke, Hongyu, et al.
Published: (2026)

MambaEVT: Event Stream based Visual Object Tracking using State Space Model
by: Wang, Xiao, et al.
Published: (2024)

Geometry-Guided Camera Motion Understanding in VideoLLMs
by: Feng, Haoan, et al.
Published: (2026)

StickMotion: Generating 3D Human Motions by Drawing a Stickman
by: Wang, Tao, et al.
Published: (2025)

DivDiff: A Conditional Diffusion Model for Diverse Human Motion Prediction
by: Yu, Hua, et al.
Published: (2024)

WANDR: Intention-guided Human Motion Generation
by: Diomataris, Markos, et al.
Published: (2024)

VFIMamba: Video Frame Interpolation with State Space Models
by: Zhang, Guozhen, et al.
Published: (2024)

SF-Mamba: Rethinking State Space Model for Vision
by: Yoshimura, Masakazu, et al.
Published: (2026)

HumANDiff: Articulated Noise Diffusion for Motion-Consistent Human Video Generation
by: Hu, Tao, et al.
Published: (2026)

MLP: Motion Label Prior for Temporal Sentence Localization in Untrimmed 3D Human Motions
by: Yan, Sheng, et al.
Published: (2024)

TokenMotion: Decoupled Motion Control via Token Disentanglement for Human-centric Video Generation
by: Li, Ruineng, et al.
Published: (2025)

PhyMotion: Structured 3D Motion Reward for Physics-Grounded Human Video Generation
by: Huang, Yidong, et al.
Published: (2026)

X-UniMotion: Animating Human Images with Expressive, Unified and Identity-Agnostic Motion Latents
by: Song, Guoxian, et al.
Published: (2025)

Towards Fine-Grained Human Motion Video Captioning
by: Song, Guorui, et al.
Published: (2025)

InsectMamba: Insect Pest Classification with State Space Model
by: Wang, Qianning, et al.
Published: (2024)

Unleashing Diffusion and State Space Models for Medical Image Segmentation
by: Wu, Rong, et al.
Published: (2025)

Unified Medical Image Segmentation with State Space Modeling Snake
by: Zhang, Ruicheng, et al.
Published: (2025)

Tri-Modal Motion Retrieval by Learning a Joint Embedding Space
by: Yin, Kangning, et al.
Published: (2024)

MESH -- Understanding Videos Like Human: Measuring Hallucinations in Large Video Models
by: Yang, Garry, et al.
Published: (2025)

GHNeRF: Learning Generalizable Human Features with Efficient Neural Radiance Fields
by: Dey, Arnab, et al.
Published: (2024)

BadHMP: Backdoor Attack against Human Motion Prediction
by: Xu, Chaohui, et al.
Published: (2024)

LS-GAN: Human Motion Synthesis with Latent-space GANs
by: Amballa, Avinash, et al.
Published: (2024)

Contact-aware Human Motion Generation from Textual Descriptions
by: Ma, Sihan, et al.
Published: (2024)

Generative AI-Driven High-Fidelity Human Motion Simulation
by: Iyer, Hari, et al.
Published: (2025)

RAM: Recover Any 3D Human Motion in-the-Wild
by: Jia, Sen, et al.
Published: (2026)

Coordinating Multiple Conditions for Trajectory-Controlled Human Motion Generation
by: Cai, Deli, et al.
Published: (2026)