Saved in:
| Main Authors: | Pinyoanuntapong, Ekkasit, Saleem, Muhammad Usama, Wang, Pu, Lee, Minwoo, Das, Srijan, Chen, Chen |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2403.19435 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
MMM: Generative Masked Motion Model
by: Pinyoanuntapong, Ekkasit, et al.
Published: (2023)
by: Pinyoanuntapong, Ekkasit, et al.
Published: (2023)
GenHMR: Generative Human Mesh Recovery
by: Saleem, Muhammad Usama, et al.
Published: (2024)
by: Saleem, Muhammad Usama, et al.
Published: (2024)
MaskHand: Generative Masked Modeling for Robust Hand Mesh Reconstruction in the Wild
by: Saleem, Muhammad Usama, et al.
Published: (2024)
by: Saleem, Muhammad Usama, et al.
Published: (2024)
MaskControl: Spatio-Temporal Control for Masked Motion Synthesis
by: Pinyoanuntapong, Ekkasit, et al.
Published: (2024)
by: Pinyoanuntapong, Ekkasit, et al.
Published: (2024)
Walk Before You Dance: High-fidelity and Editable Dance Synthesis via Generative Masked Motion Prior
by: Shah, Foram N, et al.
Published: (2025)
by: Shah, Foram N, et al.
Published: (2025)
LiveGesture Streamable Co-Speech Gesture Generation Model
by: Saleem, Muhammad Usama, et al.
Published: (2026)
by: Saleem, Muhammad Usama, et al.
Published: (2026)
KHMP: Frequency-Domain Kalman Refinement for High-Fidelity Human Motion Prediction
by: Wu, Wenhan, et al.
Published: (2026)
by: Wu, Wenhan, et al.
Published: (2026)
Monocular Models are Strong Learners for Multi-View Human Mesh Recovery
by: Xie, Haoyu, et al.
Published: (2026)
by: Xie, Haoyu, et al.
Published: (2026)
SKI Models: Skeleton Induced Vision-Language Embeddings for Understanding Activities of Daily Living
by: Sinha, Arkaprava, et al.
Published: (2025)
by: Sinha, Arkaprava, et al.
Published: (2025)
UniLACT: Depth-Aware RGB Latent Action Learning for Vision-Language-Action Models
by: Govind, Manish Kumar, et al.
Published: (2026)
by: Govind, Manish Kumar, et al.
Published: (2026)
BioPose: Biomechanically-accurate 3D Pose Estimation from Monocular Videos
by: Koleini, Farnoosh, et al.
Published: (2025)
by: Koleini, Farnoosh, et al.
Published: (2025)
A Robust Deep Networks based Multi-Object MultiCamera Tracking System for City Scale Traffic
by: Zaman, Muhammad Imran, et al.
Published: (2025)
by: Zaman, Muhammad Imran, et al.
Published: (2025)
Bidirectional Recurrence for Cardiac Motion Tracking with Gaussian Process Latent Coding
by: Yang, Jiewen, et al.
Published: (2024)
by: Yang, Jiewen, et al.
Published: (2024)
Fusion-SSAT: Unleashing the Potential of Self-supervised Auxiliary Task by Feature Fusion for Generalized Deepfake Detection
by: Reddy, Shukesh, et al.
Published: (2026)
by: Reddy, Shukesh, et al.
Published: (2026)
Self-supervised Auxiliary Learning for Texture and Model-based Hybrid Robust and Fair Featuring in Face Analysis
by: Reddy, Shukesh, et al.
Published: (2024)
by: Reddy, Shukesh, et al.
Published: (2024)
MS-Temba: Multi-Scale Temporal Mamba for Understanding Long Untrimmed Videos
by: Sinha, Arkaprava, et al.
Published: (2025)
by: Sinha, Arkaprava, et al.
Published: (2025)
From Slow Bidirectional to Fast Autoregressive Video Diffusion Models
by: Yin, Tianwei, et al.
Published: (2024)
by: Yin, Tianwei, et al.
Published: (2024)
A Training-Free Style-aligned Image Generation with Scale-wise Autoregressive Model
by: Park, Jihun, et al.
Published: (2025)
by: Park, Jihun, et al.
Published: (2025)
Causal Motion Diffusion Models for Autoregressive Motion Generation
by: Yu, Qing, et al.
Published: (2026)
by: Yu, Qing, et al.
Published: (2026)
Bidirectional Autoregressive Diffusion Model for Dance Generation
by: Zhang, Canyu, et al.
Published: (2024)
by: Zhang, Canyu, et al.
Published: (2024)
MoSa: Motion Generation with Scalable Autoregressive Modeling
by: Liu, Mengyuan, et al.
Published: (2025)
by: Liu, Mengyuan, et al.
Published: (2025)
Frequency Guidance Matters: Skeletal Action Recognition by Frequency-Aware Mixed Transformer
by: Wu, Wenhan, et al.
Published: (2024)
by: Wu, Wenhan, et al.
Published: (2024)
VisCoP: Visual Probing for Video Domain Adaptation of Vision Language Models
by: Reilly, Dominick, et al.
Published: (2025)
by: Reilly, Dominick, et al.
Published: (2025)
NURBGen: High-Fidelity Text-to-CAD Generation through LLM-Driven NURBS Modeling
by: Usama, Muhammad, et al.
Published: (2025)
by: Usama, Muhammad, et al.
Published: (2025)
Next-Scale Autoregressive Models for Text-to-Motion Generation
by: Zheng, Zhiwei, et al.
Published: (2026)
by: Zheng, Zhiwei, et al.
Published: (2026)
LLAVIDAL: A Large LAnguage VIsion Model for Daily Activities of Living
by: Reilly, Dominick, et al.
Published: (2024)
by: Reilly, Dominick, et al.
Published: (2024)
Beyond Pixels: Semi-Supervised Semantic Segmentation with a Multi-scale Patch-based Multi-Label Classifier
by: Howlader, Prantik, et al.
Published: (2024)
by: Howlader, Prantik, et al.
Published: (2024)
Coordinate-Based Dual-Constrained Autoregressive Motion Generation
by: Ding, Kang, et al.
Published: (2026)
by: Ding, Kang, et al.
Published: (2026)
HINT: Hierarchical Interaction Modeling for Autoregressive Multi-Human Motion Generation
by: Liu, Mengge, et al.
Published: (2026)
by: Liu, Mengge, et al.
Published: (2026)
ScaMo: Exploring the Scaling Law in Autoregressive Motion Generation Model
by: Lu, Shunlin, et al.
Published: (2024)
by: Lu, Shunlin, et al.
Published: (2024)
UHD Image Deblurring via Autoregressive Flow with Ill-conditioned Constraints
by: Xin, Yucheng, et al.
Published: (2026)
by: Xin, Yucheng, et al.
Published: (2026)
Autoregressive Flow Matching for Motion Prediction
by: Xie, Johnathan, et al.
Published: (2025)
by: Xie, Johnathan, et al.
Published: (2025)
MotionStreamer: Streaming Motion Generation via Diffusion-based Autoregressive Model in Causal Latent Space
by: Xiao, Lixing, et al.
Published: (2025)
by: Xiao, Lixing, et al.
Published: (2025)
OmniMotion: Multimodal Motion Generation with Continuous Masked Autoregression
by: Li, Zhe, et al.
Published: (2025)
by: Li, Zhe, et al.
Published: (2025)
Dense Policy: Bidirectional Autoregressive Learning of Actions
by: Su, Yue, et al.
Published: (2025)
by: Su, Yue, et al.
Published: (2025)
From My View to Yours: Ego-to-Exo Transfer in VLMs for Understanding Activities of Daily Living
by: Reilly, Dominick, et al.
Published: (2025)
by: Reilly, Dominick, et al.
Published: (2025)
EarthMapper: Visual Autoregressive Models for Controllable Bidirectional Satellite-Map Translation
by: Dong, Zhe, et al.
Published: (2025)
by: Dong, Zhe, et al.
Published: (2025)
D2-V2X: Depth-Driven Cooperative V2X Reasoning for Autonomous Driving
by: Richard, Kevin, et al.
Published: (2026)
by: Richard, Kevin, et al.
Published: (2026)
DiffSwap++: 3D Latent-Controlled Diffusion for Identity-Preserving Face Swapping
by: Bondurant, Weston, et al.
Published: (2025)
by: Bondurant, Weston, et al.
Published: (2025)
Bidirectional Feature-aligned Motion Transformation for Efficient Dynamic Point Cloud Compression
by: Deng, Xuan, et al.
Published: (2025)
by: Deng, Xuan, et al.
Published: (2025)
Similar Items
-
MMM: Generative Masked Motion Model
by: Pinyoanuntapong, Ekkasit, et al.
Published: (2023) -
GenHMR: Generative Human Mesh Recovery
by: Saleem, Muhammad Usama, et al.
Published: (2024) -
MaskHand: Generative Masked Modeling for Robust Hand Mesh Reconstruction in the Wild
by: Saleem, Muhammad Usama, et al.
Published: (2024) -
MaskControl: Spatio-Temporal Control for Masked Motion Synthesis
by: Pinyoanuntapong, Ekkasit, et al.
Published: (2024) -
Walk Before You Dance: High-fidelity and Editable Dance Synthesis via Generative Masked Motion Prior
by: Shah, Foram N, et al.
Published: (2025)