:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wang, Lizhen, Xia, Zhurong, Hu, Tianshu, Wang, Pengrui, Wei, Pengfei, Zheng, Zerong, Zhou, Ming, Zhang, Yuan, Gao, Mingyuan
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2506.10568
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

DreamActor-M1: Holistic, Expressive and Robust Human Image Animation with Hybrid Guidance
by: Luo, Yuxuan, et al.
Published: (2025)

DreamActor-M2: Universal Character Image Animation via Spatiotemporal In-Context Learning
by: Luo, Mingshuang, et al.
Published: (2026)

DreamVVT: Mastering Realistic Video Virtual Try-On in the Wild via a Stage-Wise Diffusion Transformer Framework
by: Zuo, Tongchun, et al.
Published: (2025)

Human4DiT: 360-degree Human Video Generation with 4D Diffusion Transformer
by: Shao, Ruizhi, et al.
Published: (2024)

AlignHuman: Improving Motion and Fidelity via Timestep-Segment Preference Optimization for Audio-Driven Human Animation
by: Liang, Chao, et al.
Published: (2025)

FlowAct-R1: Towards Interactive Humanoid Video Generation
by: Wang, Lizhen, et al.
Published: (2026)

Animatable and Relightable Gaussians for High-fidelity Human Avatar Modeling
by: Li, Zhe, et al.
Published: (2023)

Towards Imbalanced Motion: Part-Decoupling Network for Video Portrait Segmentation
by: Yu, Tianshu, et al.
Published: (2023)

InterActHuman: Multi-Concept Human Animation with Layout-Aligned Audio Conditions
by: Wang, Zhenzhi, et al.
Published: (2025)

OmniHuman-1.5: Instilling an Active Mind in Avatars via Cognitive Simulation
by: Jiang, Jianwen, et al.
Published: (2025)

DreamID-V:Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer
by: Guo, Xu, et al.
Published: (2026)

EchoMotion: Unified Human Video and Motion Generation via Dual-Modality Diffusion Transformer
by: Yang, Yuxiao, et al.
Published: (2025)

DreamShot: Personalized Storyboard Synthesis with Video Diffusion Prior
by: Huang, Junjia, et al.
Published: (2026)

DreamVideo: High-Fidelity Image-to-Video Generation with Image Retention and Text Guidance
by: Wang, Cong, et al.
Published: (2023)

RMD: A Simple Baseline for More General Human Motion Generation via Training-free Retrieval-Augmented Motion Diffuse
by: Liao, Zhouyingcheng, et al.
Published: (2024)

PP-Motion: Physical-Perceptual Fidelity Evaluation for Human Motion Generation
by: Zhao, Sihan, et al.
Published: (2025)

Video Diffusion Transformers are In-Context Learners
by: Fei, Zhengcong, et al.
Published: (2024)

Structural Analysis of Phosphorus and Arsenic Clusters: A Comparative DFT and MP2 Study
by: Zerong Daniel Wang
Published: (2025)

HumanDiT: Pose-Guided Diffusion Transformer for Long-form Human Motion Video Generation
by: Gan, Qijun, et al.
Published: (2025)

DreamFuse: Adaptive Image Fusion with Diffusion Transformer
by: Huang, Junjia, et al.
Published: (2025)

Make Your Actor Talk: Generalizable and High-Fidelity Lip Sync with Motion and Appearance Disentanglement
by: Yu, Runyi, et al.
Published: (2024)

MultiMotion: Multi Subject Video Motion Transfer via Video Diffusion Transformer
by: Liu, Penghui, et al.
Published: (2025)

DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control
by: Wei, Yujie, et al.
Published: (2024)

Demonstrating Record Fidelity for the Quantum Fourier Transform
by: Aumann, Philipp, et al.
Published: (2026)

Motion Diffusion Autoencoders: Enabling Attribute Manipulation in Human Motion Demonstrated on Karate Techniques
by: Richardson, Anthony, et al.
Published: (2025)

AI Spillover is Different: Flat and Lean Firms as Engines of AI Diffusion and Productivity Gain
by: Wang, Xiaoning, et al.
Published: (2025)

Ingredients: Blending Custom Photos with Video Diffusion Transformers
by: Fei, Zhengcong, et al.
Published: (2025)

DreamFoley: Scalable VLMs for High-Fidelity Video-to-Audio Generation
by: Li, Fu, et al.
Published: (2025)

DreamText: High Fidelity Scene Text Synthesis
by: Wang, Yibin, et al.
Published: (2024)

Video Motion Transfer with Diffusion Transformers
by: Pondaven, Alexander, et al.
Published: (2024)

An improved evolutionary structure optimization method considering stress minimization and smooth design
by: Leijia Wang, et al.
Published: (2024)

Semantics-Aware Human Motion Generation from Audio Instructions
by: Wang, Zi-An, et al.
Published: (2025)

Grasp as You Dream: Imitating Functional Grasping from Generated Human Demonstrations
by: Tang, Chao, et al.
Published: (2026)

MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation
by: Shi, Shuwei, et al.
Published: (2024)

MeshAvatar: Learning High-quality Triangular Human Avatars from Multi-view Videos
by: Chen, Yushuo, et al.
Published: (2024)

DreamVideo-Omni: Omni-Motion Controlled Multi-Subject Video Customization with Latent Identity Reinforcement Learning
by: Wei, Yujie, et al.
Published: (2026)

UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer
by: Wang, Xiang, et al.
Published: (2025)

Balancing Privacy and Efficiency: Music Information Retrieval via Additive Homomorphic Encryption
by: Wang, William Zerong, et al.
Published: (2025)

SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers
by: Qiu, Di, et al.
Published: (2025)

MotionCtrl: A Unified and Flexible Motion Controller for Video Generation
by: Wang, Zhouxia, et al.
Published: (2023)