Saved in:
| Main Authors: | Ding, Kang, Wang, Hongsong, Gui, Jie, Wang, Liang |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.08088 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Dual Conditioned Motion Diffusion for Pose-Based Video Anomaly Detection
by: Wang, Hongsong, et al.
Published: (2024)
by: Wang, Hongsong, et al.
Published: (2024)
Marrying Text-to-Motion Generation with Skeleton-Based Action Recognition
by: Kuang, Jidong, et al.
Published: (2026)
by: Kuang, Jidong, et al.
Published: (2026)
Not All Agents Matter: From Global Attention Dilution to Risk-Prioritized Game Planning
by: Ding, Kang, et al.
Published: (2026)
by: Ding, Kang, et al.
Published: (2026)
Towards Universal Skeleton-Based Action Recognition
by: Kuang, Jidong, et al.
Published: (2026)
by: Kuang, Jidong, et al.
Published: (2026)
Structure-Aware Fine-Grained Gaussian Splatting for Expressive Avatar Reconstruction
by: Su, Yuze, et al.
Published: (2026)
by: Su, Yuze, et al.
Published: (2026)
Data-Free Class-Incremental Gesture Recognition with Prototype-Guided Pseudo Feature Replay
by: Wang, Hongsong, et al.
Published: (2025)
by: Wang, Hongsong, et al.
Published: (2025)
Region-aware Image-based Human Action Retrieval with Transformers
by: Wang, Hongsong, et al.
Published: (2024)
by: Wang, Hongsong, et al.
Published: (2024)
Heterogeneous Skeleton-Based Action Representation Learning
by: Wang, Hongsong, et al.
Published: (2025)
by: Wang, Hongsong, et al.
Published: (2025)
Point-Supervised Skeleton-Based Human Action Segmentation
by: Wang, Hongsong, et al.
Published: (2026)
by: Wang, Hongsong, et al.
Published: (2026)
Attribution as Retrieval: Model-Agnostic AI-Generated Image Attribution
by: Wang, Hongsong, et al.
Published: (2026)
by: Wang, Hongsong, et al.
Published: (2026)
Multimodal Skeleton-Based Action Representation Learning via Decomposition and Composition
by: Wang, Hongsong, et al.
Published: (2025)
by: Wang, Hongsong, et al.
Published: (2025)
Zero-Shot Skeleton-based Action Recognition with Dual Visual-Text Alignment
by: Kuang, Jidong, et al.
Published: (2024)
by: Kuang, Jidong, et al.
Published: (2024)
MotionRFT: Unified Reinforcement Fine-Tuning for Text-to-Motion Generation
by: Tan, Xiaofeng, et al.
Published: (2026)
by: Tan, Xiaofeng, et al.
Published: (2026)
LOTA: Bit-Planes Guided AI-Generated Image Detection
by: Wang, Hongsong, et al.
Published: (2025)
by: Wang, Hongsong, et al.
Published: (2025)
OZ-TAL: Online Zero-Shot Temporal Action Localization
by: Han, Chaolei, et al.
Published: (2026)
by: Han, Chaolei, et al.
Published: (2026)
Dragging with Geometry: From Pixels to Geometry-Guided Image Editing
by: Pu, Xinyu, et al.
Published: (2025)
by: Pu, Xinyu, et al.
Published: (2025)
Fast Inference of Visual Autoregressive Model with Adjacency-Adaptive Dynamical Draft Trees
by: Lei, Haodong, et al.
Published: (2025)
by: Lei, Haodong, et al.
Published: (2025)
EasyTune: Efficient Step-Aware Fine-Tuning for Diffusion-Based Motion Generation
by: Tan, Xiaofeng, et al.
Published: (2026)
by: Tan, Xiaofeng, et al.
Published: (2026)
Efficient Diffusion-Based 3D Human Pose Estimation with Hierarchical Temporal Pruning
by: Bi, Yuquan, et al.
Published: (2025)
by: Bi, Yuquan, et al.
Published: (2025)
MoSa: Motion Generation with Scalable Autoregressive Modeling
by: Liu, Mengyuan, et al.
Published: (2025)
by: Liu, Mengyuan, et al.
Published: (2025)
Temporal Consistency-Aware Text-to-Motion Generation
by: Wang, Hongsong, et al.
Published: (2026)
by: Wang, Hongsong, et al.
Published: (2026)
Controllable Dance Generation with Style-Guided Motion Diffusion
by: Wang, Hongsong, et al.
Published: (2024)
by: Wang, Hongsong, et al.
Published: (2024)
Training-Free Zero-Shot Temporal Action Detection with Vision-Language Models
by: Han, Chaolei, et al.
Published: (2025)
by: Han, Chaolei, et al.
Published: (2025)
SoPo: Text-to-Motion Generation Using Semi-Online Preference Optimization
by: Tan, Xiaofeng, et al.
Published: (2024)
by: Tan, Xiaofeng, et al.
Published: (2024)
ReAlign: Bilingual Text-to-Motion Generation via Step-Aware Reward-Guided Alignment
by: Weng, Wanjiang, et al.
Published: (2025)
by: Weng, Wanjiang, et al.
Published: (2025)
Frequency-Guided Diffusion Model with Perturbation Training for Skeleton-Based Video Anomaly Detection
by: Tan, Xiaofeng, et al.
Published: (2024)
by: Tan, Xiaofeng, et al.
Published: (2024)
ReAlign: Text-to-Motion Generation via Step-Aware Reward-Guided Alignment
by: Weng, Wanjiang, et al.
Published: (2025)
by: Weng, Wanjiang, et al.
Published: (2025)
PAMD: Plausibility-Aware Motion Diffusion Model for Long Dance Generation
by: Wang, Hongsong, et al.
Published: (2025)
by: Wang, Hongsong, et al.
Published: (2025)
Bilingual Text-to-Motion Generation: A New Benchmark and Baselines
by: Weng, Wanjiang, et al.
Published: (2026)
by: Weng, Wanjiang, et al.
Published: (2026)
MoReact: Generating Reactive Motion from Textual Descriptions
by: Xu, Xiyan, et al.
Published: (2025)
by: Xu, Xiyan, et al.
Published: (2025)
MotionStreamer: Streaming Motion Generation via Diffusion-based Autoregressive Model in Causal Latent Space
by: Xiao, Lixing, et al.
Published: (2025)
by: Xiao, Lixing, et al.
Published: (2025)
Causal Motion Diffusion Models for Autoregressive Motion Generation
by: Yu, Qing, et al.
Published: (2026)
by: Yu, Qing, et al.
Published: (2026)
OmniMotion: Multimodal Motion Generation with Continuous Masked Autoregression
by: Li, Zhe, et al.
Published: (2025)
by: Li, Zhe, et al.
Published: (2025)
Towards Variable and Coordinated Holistic Co-Speech Motion Generation
by: Liu, Yifei, et al.
Published: (2024)
by: Liu, Yifei, et al.
Published: (2024)
BAMM: Bidirectional Autoregressive Motion Model
by: Pinyoanuntapong, Ekkasit, et al.
Published: (2024)
by: Pinyoanuntapong, Ekkasit, et al.
Published: (2024)
HINT: Hierarchical Interaction Modeling for Autoregressive Multi-Human Motion Generation
by: Liu, Mengge, et al.
Published: (2026)
by: Liu, Mengge, et al.
Published: (2026)
Foundation Model for Skeleton-Based Human Action Understanding
by: Wang, Hongsong, et al.
Published: (2025)
by: Wang, Hongsong, et al.
Published: (2025)
DreamCS: Geometry-Aware Text-to-3D Generation with Unpaired 3D Reward Supervision
by: Zou, Xiandong, et al.
Published: (2025)
by: Zou, Xiandong, et al.
Published: (2025)
ScaleMoGen: Autoregressive Next-Scale Prediction for Human Motion Generation
by: Hwang, Inwoo, et al.
Published: (2026)
by: Hwang, Inwoo, et al.
Published: (2026)
Next-Scale Autoregressive Models for Text-to-Motion Generation
by: Zheng, Zhiwei, et al.
Published: (2026)
by: Zheng, Zhiwei, et al.
Published: (2026)
Similar Items
-
Dual Conditioned Motion Diffusion for Pose-Based Video Anomaly Detection
by: Wang, Hongsong, et al.
Published: (2024) -
Marrying Text-to-Motion Generation with Skeleton-Based Action Recognition
by: Kuang, Jidong, et al.
Published: (2026) -
Not All Agents Matter: From Global Attention Dilution to Risk-Prioritized Game Planning
by: Ding, Kang, et al.
Published: (2026) -
Towards Universal Skeleton-Based Action Recognition
by: Kuang, Jidong, et al.
Published: (2026) -
Structure-Aware Fine-Grained Gaussian Splatting for Expressive Avatar Reconstruction
by: Su, Yuze, et al.
Published: (2026)