:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Chen, Jiahao, Yuan, Hangjie, Qian, Yichen, Liang, Jingyun, Xing, Jiazheng, Liu, Pengwei, Chen, Weihua, Wang, Fan, Su, Bing
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2506.02497
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

LumosX: Relate Any Identities with Their Attributes for Personalized Video Generation
by: Xing, Jiazheng, et al.
Published: (2026)

Lumos-1: On Autoregressive Video Generation with Discrete Diffusion from a Unified Model Perspective
by: Yuan, Hangjie, et al.
Published: (2025)

UniLumos: Fast and Unified Image and Video Relighting with Physics-Plausible Feedback
by: Liu, Ropeway, et al.
Published: (2025)

Lumos-Nexus: Efficient Frequency Bridging with Homogeneous Latent Space for Video Unified Models
by: Xing, Jiazheng, et al.
Published: (2026)

RealisMotion: Decomposed Human Motion Control and Video Generation in the World Space
by: Liang, Jingyun, et al.
Published: (2025)

Towards 3D-Aware Video Diffusion Models: Render-Free Human Motion Control with Mesh Tokenization
by: Liang, Jingyun, et al.
Published: (2026)

MathFlow: Enhancing the Perceptual Flow of MLLMs for Visual Mathematical Problems
by: Chen, Shuhang, et al.
Published: (2025)

Knowledge is Power: Advancing Few-shot Action Recognition with Multimodal Semantics from MLLMs
by: Xing, Jiazheng, et al.
Published: (2026)

SciLT: Long-tailed Image Classification under Scientific Image Domains
by: Chen, Jiahao, et al.
Published: (2026)

MoVideo: Motion-Aware Video Generation with Diffusion Models
by: Liang, Jingyun, et al.
Published: (2023)

Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning
by: Liu, Xinyu, et al.
Published: (2025)

LoFT: Parameter-Efficient Fine-Tuning for Long-tailed Semi-Supervised Learning in Open-World Scenarios
by: Huang, Zhiyuan, et al.
Published: (2025)

SAMora: Enhancing SAM through Hierarchical Self-Supervised Pre-Training for Medical Images
by: Chen, Shuhang, et al.
Published: (2025)

HumanDiT: Pose-Guided Diffusion Transformer for Long-form Human Motion Video Generation
by: Gan, Qijun, et al.
Published: (2025)

Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation
by: Cao, Chenjie, et al.
Published: (2025)

Rethinking the Bias of Foundation Model under Long-tailed Distribution
by: Chen, Jiahao, et al.
Published: (2025)

PanFlow: Decoupled Motion Control for Panoramic Video Generation
by: Zhang, Cheng, et al.
Published: (2025)

FlexiFilm: Long Video Generation with Flexible Conditions
by: Ouyang, Yichen, et al.
Published: (2024)

DFVEdit: Conditional Delta Flow Vector for Zero-shot Video Editing
by: Cai, Lingling, et al.
Published: (2025)

Motion Semantics Guided Normalizing Flow for Privacy-Preserving Video Anomaly Detection
by: Liu, Yang, et al.
Published: (2026)

DreamVideo-Omni: Omni-Motion Controlled Multi-Subject Video Customization with Latent Identity Reinforcement Learning
by: Wei, Yujie, et al.
Published: (2026)

Domain-adaptive and Subgroup-specific Cascaded Temperature Regression for Out-of-distribution Calibration
by: Wang, Jiexin, et al.
Published: (2024)

EchoMotion: Unified Human Video and Motion Generation via Dual-Modality Diffusion Transformer
by: Yang, Yuxiao, et al.
Published: (2025)

FlowMotion: Training-Free Flow Guidance for Video Motion Transfer
by: Wang, Zhen, et al.
Published: (2026)

T-CorresNet: Template Guided 3D Point Cloud Completion with Correspondence Pooling Query Generation Strategy
by: Duan, Fan, et al.
Published: (2024)

LTCA: Long-range Temporal Context Attention for Referring Video Object Segmentation
by: Yan, Cilin, et al.
Published: (2025)

Motion-Guided Semantic Alignment with Negative Prompts for Zero-Shot Video Action Recognition
by: Wang, Yiming, et al.
Published: (2026)

AMG: Avatar Motion Guided Video Generation
by: Yang, Zhangsihao, et al.
Published: (2024)

Stability-Driven Motion Generation for Object-Guided Human-Human Co-Manipulation
by: Xu, Jiahao, et al.
Published: (2026)

MotionShot: Adaptive Motion Transfer across Arbitrary Objects for Text-to-Video Generation
by: Liu, Yanchen, et al.
Published: (2025)

DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control
by: Wei, Yujie, et al.
Published: (2024)

LongVie: Multimodal-Guided Controllable Ultra-Long Video Generation
by: Gao, Jianxiong, et al.
Published: (2025)

MoDA: Multi-modal Diffusion Architecture for Talking Head Generation
by: Li, Xinyang, et al.
Published: (2025)

MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequence
by: Zhao, Canyu, et al.
Published: (2024)

Decentralized Gossip Mutual Learning (GML) for brain tumor segmentation on multi-parametric MRI
by: Chen, Jingyun, et al.
Published: (2024)

Flow-Guided Diffusion for Video Inpainting
by: Gu, Bohai, et al.
Published: (2023)

SAMA: Factorized Semantic Anchoring and Motion Alignment for Instruction-Guided Video Editing
by: Zhang, Xinyao, et al.
Published: (2026)

Local Action-Guided Motion Diffusion Model for Text-to-Motion Generation
by: Jin, Peng, et al.
Published: (2024)

Separate Motion from Appearance: Customizing Motion via Customizing Text-to-Video Diffusion Models
by: Liu, Huijie, et al.
Published: (2025)

VideoMAR: Autoregressive Video Generatio with Continuous Tokens
by: Yu, Hu, et al.
Published: (2025)