Saved in:
| Main Authors: | Wang, Lizhen, Xia, Zhurong, Hu, Tianshu, Wang, Pengrui, Wei, Pengfei, Zheng, Zerong, Zhou, Ming, Zhang, Yuan, Gao, Mingyuan |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.10568 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DreamActor-M1: Holistic, Expressive and Robust Human Image Animation with Hybrid Guidance
by: Luo, Yuxuan, et al.
Published: (2025)
by: Luo, Yuxuan, et al.
Published: (2025)
DreamActor-M2: Universal Character Image Animation via Spatiotemporal In-Context Learning
by: Luo, Mingshuang, et al.
Published: (2026)
by: Luo, Mingshuang, et al.
Published: (2026)
DreamVVT: Mastering Realistic Video Virtual Try-On in the Wild via a Stage-Wise Diffusion Transformer Framework
by: Zuo, Tongchun, et al.
Published: (2025)
by: Zuo, Tongchun, et al.
Published: (2025)
Human4DiT: 360-degree Human Video Generation with 4D Diffusion Transformer
by: Shao, Ruizhi, et al.
Published: (2024)
by: Shao, Ruizhi, et al.
Published: (2024)
AlignHuman: Improving Motion and Fidelity via Timestep-Segment Preference Optimization for Audio-Driven Human Animation
by: Liang, Chao, et al.
Published: (2025)
by: Liang, Chao, et al.
Published: (2025)
FlowAct-R1: Towards Interactive Humanoid Video Generation
by: Wang, Lizhen, et al.
Published: (2026)
by: Wang, Lizhen, et al.
Published: (2026)
Animatable and Relightable Gaussians for High-fidelity Human Avatar Modeling
by: Li, Zhe, et al.
Published: (2023)
by: Li, Zhe, et al.
Published: (2023)
Towards Imbalanced Motion: Part-Decoupling Network for Video Portrait Segmentation
by: Yu, Tianshu, et al.
Published: (2023)
by: Yu, Tianshu, et al.
Published: (2023)
InterActHuman: Multi-Concept Human Animation with Layout-Aligned Audio Conditions
by: Wang, Zhenzhi, et al.
Published: (2025)
by: Wang, Zhenzhi, et al.
Published: (2025)
OmniHuman-1.5: Instilling an Active Mind in Avatars via Cognitive Simulation
by: Jiang, Jianwen, et al.
Published: (2025)
by: Jiang, Jianwen, et al.
Published: (2025)
DreamID-V:Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer
by: Guo, Xu, et al.
Published: (2026)
by: Guo, Xu, et al.
Published: (2026)
EchoMotion: Unified Human Video and Motion Generation via Dual-Modality Diffusion Transformer
by: Yang, Yuxiao, et al.
Published: (2025)
by: Yang, Yuxiao, et al.
Published: (2025)
DreamShot: Personalized Storyboard Synthesis with Video Diffusion Prior
by: Huang, Junjia, et al.
Published: (2026)
by: Huang, Junjia, et al.
Published: (2026)
DreamVideo: High-Fidelity Image-to-Video Generation with Image Retention and Text Guidance
by: Wang, Cong, et al.
Published: (2023)
by: Wang, Cong, et al.
Published: (2023)
RMD: A Simple Baseline for More General Human Motion Generation via Training-free Retrieval-Augmented Motion Diffuse
by: Liao, Zhouyingcheng, et al.
Published: (2024)
by: Liao, Zhouyingcheng, et al.
Published: (2024)
PP-Motion: Physical-Perceptual Fidelity Evaluation for Human Motion Generation
by: Zhao, Sihan, et al.
Published: (2025)
by: Zhao, Sihan, et al.
Published: (2025)
Video Diffusion Transformers are In-Context Learners
by: Fei, Zhengcong, et al.
Published: (2024)
by: Fei, Zhengcong, et al.
Published: (2024)
Structural Analysis of Phosphorus and Arsenic Clusters: A Comparative DFT and MP2 Study
by: Zerong Daniel Wang
Published: (2025)
by: Zerong Daniel Wang
Published: (2025)
HumanDiT: Pose-Guided Diffusion Transformer for Long-form Human Motion Video Generation
by: Gan, Qijun, et al.
Published: (2025)
by: Gan, Qijun, et al.
Published: (2025)
DreamFuse: Adaptive Image Fusion with Diffusion Transformer
by: Huang, Junjia, et al.
Published: (2025)
by: Huang, Junjia, et al.
Published: (2025)
Make Your Actor Talk: Generalizable and High-Fidelity Lip Sync with Motion and Appearance Disentanglement
by: Yu, Runyi, et al.
Published: (2024)
by: Yu, Runyi, et al.
Published: (2024)
MultiMotion: Multi Subject Video Motion Transfer via Video Diffusion Transformer
by: Liu, Penghui, et al.
Published: (2025)
by: Liu, Penghui, et al.
Published: (2025)
DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control
by: Wei, Yujie, et al.
Published: (2024)
by: Wei, Yujie, et al.
Published: (2024)
Demonstrating Record Fidelity for the Quantum Fourier Transform
by: Aumann, Philipp, et al.
Published: (2026)
by: Aumann, Philipp, et al.
Published: (2026)
Motion Diffusion Autoencoders: Enabling Attribute Manipulation in Human Motion Demonstrated on Karate Techniques
by: Richardson, Anthony, et al.
Published: (2025)
by: Richardson, Anthony, et al.
Published: (2025)
AI Spillover is Different: Flat and Lean Firms as Engines of AI Diffusion and Productivity Gain
by: Wang, Xiaoning, et al.
Published: (2025)
by: Wang, Xiaoning, et al.
Published: (2025)
Ingredients: Blending Custom Photos with Video Diffusion Transformers
by: Fei, Zhengcong, et al.
Published: (2025)
by: Fei, Zhengcong, et al.
Published: (2025)
DreamFoley: Scalable VLMs for High-Fidelity Video-to-Audio Generation
by: Li, Fu, et al.
Published: (2025)
by: Li, Fu, et al.
Published: (2025)
DreamText: High Fidelity Scene Text Synthesis
by: Wang, Yibin, et al.
Published: (2024)
by: Wang, Yibin, et al.
Published: (2024)
Video Motion Transfer with Diffusion Transformers
by: Pondaven, Alexander, et al.
Published: (2024)
by: Pondaven, Alexander, et al.
Published: (2024)
An improved evolutionary structure optimization method considering stress minimization and smooth design
by: Leijia Wang, et al.
Published: (2024)
by: Leijia Wang, et al.
Published: (2024)
Semantics-Aware Human Motion Generation from Audio Instructions
by: Wang, Zi-An, et al.
Published: (2025)
by: Wang, Zi-An, et al.
Published: (2025)
Grasp as You Dream: Imitating Functional Grasping from Generated Human Demonstrations
by: Tang, Chao, et al.
Published: (2026)
by: Tang, Chao, et al.
Published: (2026)
MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation
by: Shi, Shuwei, et al.
Published: (2024)
by: Shi, Shuwei, et al.
Published: (2024)
MeshAvatar: Learning High-quality Triangular Human Avatars from Multi-view Videos
by: Chen, Yushuo, et al.
Published: (2024)
by: Chen, Yushuo, et al.
Published: (2024)
DreamVideo-Omni: Omni-Motion Controlled Multi-Subject Video Customization with Latent Identity Reinforcement Learning
by: Wei, Yujie, et al.
Published: (2026)
by: Wei, Yujie, et al.
Published: (2026)
UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer
by: Wang, Xiang, et al.
Published: (2025)
by: Wang, Xiang, et al.
Published: (2025)
Balancing Privacy and Efficiency: Music Information Retrieval via Additive Homomorphic Encryption
by: Wang, William Zerong, et al.
Published: (2025)
by: Wang, William Zerong, et al.
Published: (2025)
SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers
by: Qiu, Di, et al.
Published: (2025)
by: Qiu, Di, et al.
Published: (2025)
MotionCtrl: A Unified and Flexible Motion Controller for Video Generation
by: Wang, Zhouxia, et al.
Published: (2023)
by: Wang, Zhouxia, et al.
Published: (2023)
Similar Items
-
DreamActor-M1: Holistic, Expressive and Robust Human Image Animation with Hybrid Guidance
by: Luo, Yuxuan, et al.
Published: (2025) -
DreamActor-M2: Universal Character Image Animation via Spatiotemporal In-Context Learning
by: Luo, Mingshuang, et al.
Published: (2026) -
DreamVVT: Mastering Realistic Video Virtual Try-On in the Wild via a Stage-Wise Diffusion Transformer Framework
by: Zuo, Tongchun, et al.
Published: (2025) -
Human4DiT: 360-degree Human Video Generation with 4D Diffusion Transformer
by: Shao, Ruizhi, et al.
Published: (2024) -
AlignHuman: Improving Motion and Fidelity via Timestep-Segment Preference Optimization for Audio-Driven Human Animation
by: Liang, Chao, et al.
Published: (2025)