Saved in:
| Main Authors: | Chen, Jiahao, Yuan, Hangjie, Qian, Yichen, Liang, Jingyun, Xing, Jiazheng, Liu, Pengwei, Chen, Weihua, Wang, Fan, Su, Bing |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.02497 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
LumosX: Relate Any Identities with Their Attributes for Personalized Video Generation
by: Xing, Jiazheng, et al.
Published: (2026)
by: Xing, Jiazheng, et al.
Published: (2026)
Lumos-1: On Autoregressive Video Generation with Discrete Diffusion from a Unified Model Perspective
by: Yuan, Hangjie, et al.
Published: (2025)
by: Yuan, Hangjie, et al.
Published: (2025)
UniLumos: Fast and Unified Image and Video Relighting with Physics-Plausible Feedback
by: Liu, Ropeway, et al.
Published: (2025)
by: Liu, Ropeway, et al.
Published: (2025)
Lumos-Nexus: Efficient Frequency Bridging with Homogeneous Latent Space for Video Unified Models
by: Xing, Jiazheng, et al.
Published: (2026)
by: Xing, Jiazheng, et al.
Published: (2026)
RealisMotion: Decomposed Human Motion Control and Video Generation in the World Space
by: Liang, Jingyun, et al.
Published: (2025)
by: Liang, Jingyun, et al.
Published: (2025)
Towards 3D-Aware Video Diffusion Models: Render-Free Human Motion Control with Mesh Tokenization
by: Liang, Jingyun, et al.
Published: (2026)
by: Liang, Jingyun, et al.
Published: (2026)
MathFlow: Enhancing the Perceptual Flow of MLLMs for Visual Mathematical Problems
by: Chen, Shuhang, et al.
Published: (2025)
by: Chen, Shuhang, et al.
Published: (2025)
Knowledge is Power: Advancing Few-shot Action Recognition with Multimodal Semantics from MLLMs
by: Xing, Jiazheng, et al.
Published: (2026)
by: Xing, Jiazheng, et al.
Published: (2026)
SciLT: Long-tailed Image Classification under Scientific Image Domains
by: Chen, Jiahao, et al.
Published: (2026)
by: Chen, Jiahao, et al.
Published: (2026)
MoVideo: Motion-Aware Video Generation with Diffusion Models
by: Liang, Jingyun, et al.
Published: (2023)
by: Liang, Jingyun, et al.
Published: (2023)
Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning
by: Liu, Xinyu, et al.
Published: (2025)
by: Liu, Xinyu, et al.
Published: (2025)
LoFT: Parameter-Efficient Fine-Tuning for Long-tailed Semi-Supervised Learning in Open-World Scenarios
by: Huang, Zhiyuan, et al.
Published: (2025)
by: Huang, Zhiyuan, et al.
Published: (2025)
SAMora: Enhancing SAM through Hierarchical Self-Supervised Pre-Training for Medical Images
by: Chen, Shuhang, et al.
Published: (2025)
by: Chen, Shuhang, et al.
Published: (2025)
HumanDiT: Pose-Guided Diffusion Transformer for Long-form Human Motion Video Generation
by: Gan, Qijun, et al.
Published: (2025)
by: Gan, Qijun, et al.
Published: (2025)
Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation
by: Cao, Chenjie, et al.
Published: (2025)
by: Cao, Chenjie, et al.
Published: (2025)
Rethinking the Bias of Foundation Model under Long-tailed Distribution
by: Chen, Jiahao, et al.
Published: (2025)
by: Chen, Jiahao, et al.
Published: (2025)
PanFlow: Decoupled Motion Control for Panoramic Video Generation
by: Zhang, Cheng, et al.
Published: (2025)
by: Zhang, Cheng, et al.
Published: (2025)
FlexiFilm: Long Video Generation with Flexible Conditions
by: Ouyang, Yichen, et al.
Published: (2024)
by: Ouyang, Yichen, et al.
Published: (2024)
DFVEdit: Conditional Delta Flow Vector for Zero-shot Video Editing
by: Cai, Lingling, et al.
Published: (2025)
by: Cai, Lingling, et al.
Published: (2025)
Motion Semantics Guided Normalizing Flow for Privacy-Preserving Video Anomaly Detection
by: Liu, Yang, et al.
Published: (2026)
by: Liu, Yang, et al.
Published: (2026)
DreamVideo-Omni: Omni-Motion Controlled Multi-Subject Video Customization with Latent Identity Reinforcement Learning
by: Wei, Yujie, et al.
Published: (2026)
by: Wei, Yujie, et al.
Published: (2026)
Domain-adaptive and Subgroup-specific Cascaded Temperature Regression for Out-of-distribution Calibration
by: Wang, Jiexin, et al.
Published: (2024)
by: Wang, Jiexin, et al.
Published: (2024)
EchoMotion: Unified Human Video and Motion Generation via Dual-Modality Diffusion Transformer
by: Yang, Yuxiao, et al.
Published: (2025)
by: Yang, Yuxiao, et al.
Published: (2025)
FlowMotion: Training-Free Flow Guidance for Video Motion Transfer
by: Wang, Zhen, et al.
Published: (2026)
by: Wang, Zhen, et al.
Published: (2026)
T-CorresNet: Template Guided 3D Point Cloud Completion with Correspondence Pooling Query Generation Strategy
by: Duan, Fan, et al.
Published: (2024)
by: Duan, Fan, et al.
Published: (2024)
LTCA: Long-range Temporal Context Attention for Referring Video Object Segmentation
by: Yan, Cilin, et al.
Published: (2025)
by: Yan, Cilin, et al.
Published: (2025)
Motion-Guided Semantic Alignment with Negative Prompts for Zero-Shot Video Action Recognition
by: Wang, Yiming, et al.
Published: (2026)
by: Wang, Yiming, et al.
Published: (2026)
AMG: Avatar Motion Guided Video Generation
by: Yang, Zhangsihao, et al.
Published: (2024)
by: Yang, Zhangsihao, et al.
Published: (2024)
Stability-Driven Motion Generation for Object-Guided Human-Human Co-Manipulation
by: Xu, Jiahao, et al.
Published: (2026)
by: Xu, Jiahao, et al.
Published: (2026)
MotionShot: Adaptive Motion Transfer across Arbitrary Objects for Text-to-Video Generation
by: Liu, Yanchen, et al.
Published: (2025)
by: Liu, Yanchen, et al.
Published: (2025)
DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control
by: Wei, Yujie, et al.
Published: (2024)
by: Wei, Yujie, et al.
Published: (2024)
LongVie: Multimodal-Guided Controllable Ultra-Long Video Generation
by: Gao, Jianxiong, et al.
Published: (2025)
by: Gao, Jianxiong, et al.
Published: (2025)
MoDA: Multi-modal Diffusion Architecture for Talking Head Generation
by: Li, Xinyang, et al.
Published: (2025)
by: Li, Xinyang, et al.
Published: (2025)
MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequence
by: Zhao, Canyu, et al.
Published: (2024)
by: Zhao, Canyu, et al.
Published: (2024)
Decentralized Gossip Mutual Learning (GML) for brain tumor segmentation on multi-parametric MRI
by: Chen, Jingyun, et al.
Published: (2024)
by: Chen, Jingyun, et al.
Published: (2024)
Flow-Guided Diffusion for Video Inpainting
by: Gu, Bohai, et al.
Published: (2023)
by: Gu, Bohai, et al.
Published: (2023)
SAMA: Factorized Semantic Anchoring and Motion Alignment for Instruction-Guided Video Editing
by: Zhang, Xinyao, et al.
Published: (2026)
by: Zhang, Xinyao, et al.
Published: (2026)
Local Action-Guided Motion Diffusion Model for Text-to-Motion Generation
by: Jin, Peng, et al.
Published: (2024)
by: Jin, Peng, et al.
Published: (2024)
Separate Motion from Appearance: Customizing Motion via Customizing Text-to-Video Diffusion Models
by: Liu, Huijie, et al.
Published: (2025)
by: Liu, Huijie, et al.
Published: (2025)
VideoMAR: Autoregressive Video Generatio with Continuous Tokens
by: Yu, Hu, et al.
Published: (2025)
by: Yu, Hu, et al.
Published: (2025)
Similar Items
-
LumosX: Relate Any Identities with Their Attributes for Personalized Video Generation
by: Xing, Jiazheng, et al.
Published: (2026) -
Lumos-1: On Autoregressive Video Generation with Discrete Diffusion from a Unified Model Perspective
by: Yuan, Hangjie, et al.
Published: (2025) -
UniLumos: Fast and Unified Image and Video Relighting with Physics-Plausible Feedback
by: Liu, Ropeway, et al.
Published: (2025) -
Lumos-Nexus: Efficient Frequency Bridging with Homogeneous Latent Space for Video Unified Models
by: Xing, Jiazheng, et al.
Published: (2026) -
RealisMotion: Decomposed Human Motion Control and Video Generation in the World Space
by: Liang, Jingyun, et al.
Published: (2025)