:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Pinyoanuntapong, Ekkasit, Saleem, Muhammad Usama, Wang, Pu, Lee, Minwoo, Das, Srijan, Chen, Chen
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2403.19435
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

MMM: Generative Masked Motion Model
by: Pinyoanuntapong, Ekkasit, et al.
Published: (2023)

GenHMR: Generative Human Mesh Recovery
by: Saleem, Muhammad Usama, et al.
Published: (2024)

MaskHand: Generative Masked Modeling for Robust Hand Mesh Reconstruction in the Wild
by: Saleem, Muhammad Usama, et al.
Published: (2024)

MaskControl: Spatio-Temporal Control for Masked Motion Synthesis
by: Pinyoanuntapong, Ekkasit, et al.
Published: (2024)

Walk Before You Dance: High-fidelity and Editable Dance Synthesis via Generative Masked Motion Prior
by: Shah, Foram N, et al.
Published: (2025)

LiveGesture Streamable Co-Speech Gesture Generation Model
by: Saleem, Muhammad Usama, et al.
Published: (2026)

KHMP: Frequency-Domain Kalman Refinement for High-Fidelity Human Motion Prediction
by: Wu, Wenhan, et al.
Published: (2026)

Monocular Models are Strong Learners for Multi-View Human Mesh Recovery
by: Xie, Haoyu, et al.
Published: (2026)

SKI Models: Skeleton Induced Vision-Language Embeddings for Understanding Activities of Daily Living
by: Sinha, Arkaprava, et al.
Published: (2025)

UniLACT: Depth-Aware RGB Latent Action Learning for Vision-Language-Action Models
by: Govind, Manish Kumar, et al.
Published: (2026)

BioPose: Biomechanically-accurate 3D Pose Estimation from Monocular Videos
by: Koleini, Farnoosh, et al.
Published: (2025)

A Robust Deep Networks based Multi-Object MultiCamera Tracking System for City Scale Traffic
by: Zaman, Muhammad Imran, et al.
Published: (2025)

Bidirectional Recurrence for Cardiac Motion Tracking with Gaussian Process Latent Coding
by: Yang, Jiewen, et al.
Published: (2024)

Fusion-SSAT: Unleashing the Potential of Self-supervised Auxiliary Task by Feature Fusion for Generalized Deepfake Detection
by: Reddy, Shukesh, et al.
Published: (2026)

Self-supervised Auxiliary Learning for Texture and Model-based Hybrid Robust and Fair Featuring in Face Analysis
by: Reddy, Shukesh, et al.
Published: (2024)

MS-Temba: Multi-Scale Temporal Mamba for Understanding Long Untrimmed Videos
by: Sinha, Arkaprava, et al.
Published: (2025)

From Slow Bidirectional to Fast Autoregressive Video Diffusion Models
by: Yin, Tianwei, et al.
Published: (2024)

A Training-Free Style-aligned Image Generation with Scale-wise Autoregressive Model
by: Park, Jihun, et al.
Published: (2025)

Causal Motion Diffusion Models for Autoregressive Motion Generation
by: Yu, Qing, et al.
Published: (2026)

Bidirectional Autoregressive Diffusion Model for Dance Generation
by: Zhang, Canyu, et al.
Published: (2024)

MoSa: Motion Generation with Scalable Autoregressive Modeling
by: Liu, Mengyuan, et al.
Published: (2025)

Frequency Guidance Matters: Skeletal Action Recognition by Frequency-Aware Mixed Transformer
by: Wu, Wenhan, et al.
Published: (2024)

VisCoP: Visual Probing for Video Domain Adaptation of Vision Language Models
by: Reilly, Dominick, et al.
Published: (2025)

NURBGen: High-Fidelity Text-to-CAD Generation through LLM-Driven NURBS Modeling
by: Usama, Muhammad, et al.
Published: (2025)

Next-Scale Autoregressive Models for Text-to-Motion Generation
by: Zheng, Zhiwei, et al.
Published: (2026)

LLAVIDAL: A Large LAnguage VIsion Model for Daily Activities of Living
by: Reilly, Dominick, et al.
Published: (2024)

Beyond Pixels: Semi-Supervised Semantic Segmentation with a Multi-scale Patch-based Multi-Label Classifier
by: Howlader, Prantik, et al.
Published: (2024)

Coordinate-Based Dual-Constrained Autoregressive Motion Generation
by: Ding, Kang, et al.
Published: (2026)

HINT: Hierarchical Interaction Modeling for Autoregressive Multi-Human Motion Generation
by: Liu, Mengge, et al.
Published: (2026)

ScaMo: Exploring the Scaling Law in Autoregressive Motion Generation Model
by: Lu, Shunlin, et al.
Published: (2024)

UHD Image Deblurring via Autoregressive Flow with Ill-conditioned Constraints
by: Xin, Yucheng, et al.
Published: (2026)

Autoregressive Flow Matching for Motion Prediction
by: Xie, Johnathan, et al.
Published: (2025)

MotionStreamer: Streaming Motion Generation via Diffusion-based Autoregressive Model in Causal Latent Space
by: Xiao, Lixing, et al.
Published: (2025)

OmniMotion: Multimodal Motion Generation with Continuous Masked Autoregression
by: Li, Zhe, et al.
Published: (2025)

Dense Policy: Bidirectional Autoregressive Learning of Actions
by: Su, Yue, et al.
Published: (2025)

From My View to Yours: Ego-to-Exo Transfer in VLMs for Understanding Activities of Daily Living
by: Reilly, Dominick, et al.
Published: (2025)

EarthMapper: Visual Autoregressive Models for Controllable Bidirectional Satellite-Map Translation
by: Dong, Zhe, et al.
Published: (2025)

D2-V2X: Depth-Driven Cooperative V2X Reasoning for Autonomous Driving
by: Richard, Kevin, et al.
Published: (2026)

DiffSwap++: 3D Latent-Controlled Diffusion for Identity-Preserving Face Swapping
by: Bondurant, Weston, et al.
Published: (2025)

Bidirectional Feature-aligned Motion Transformation for Efficient Dynamic Point Cloud Compression
by: Deng, Xuan, et al.
Published: (2025)