:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Xing, Jiazheng, Du, Fei, Yuan, Hangjie, Liu, Pengwei, Xu, Hongbin, Ci, Hai, Niu, Ruigang, Chen, Weihua, Wang, Fan, Liu, Yong
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2603.20192
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Lumos-Nexus: Efficient Frequency Bridging with Homogeneous Latent Space for Video Unified Models
by: Xing, Jiazheng, et al.
Published: (2026)

LumosFlow: Motion-Guided Long Video Generation
by: Chen, Jiahao, et al.
Published: (2025)

UniLumos: Fast and Unified Image and Video Relighting with Physics-Plausible Feedback
by: Liu, Ropeway, et al.
Published: (2025)

OptMark: Robust Multi-bit Diffusion Watermarking via Inference Time Optimization
by: Xing, Jiazheng, et al.
Published: (2025)

Lumos-1: On Autoregressive Video Generation with Discrete Diffusion from a Unified Model Perspective
by: Yuan, Hangjie, et al.
Published: (2025)

Cyc3D: Fine-grained Controllable 3D Generation via Cycle Consistency Regularization
by: Xu, Hongbin, et al.
Published: (2025)

Knowledge is Power: Advancing Few-shot Action Recognition with Multimodal Semantics from MLLMs
by: Xing, Jiazheng, et al.
Published: (2026)

SwapAnyone: Consistent and Realistic Video Synthesis for Swapping Any Person into Any Video
by: Zhao, Chengshu, et al.
Published: (2025)

SAMora: Enhancing SAM through Hierarchical Self-Supervised Pre-Training for Medical Images
by: Chen, Shuhang, et al.
Published: (2025)

PAPM: A Physics-aware Proxy Model for Process Systems
by: Liu, Pengwei, et al.
Published: (2024)

Lumos Extrema
by: Moitra, Upamanyu
Published: (2024)

X-Humanoid: Robotize Human Videos to Generate Humanoid Videos at Scale
by: Yang, Pei, et al.
Published: (2025)

B2N3D: Progressive Learning from Binary to N-ary Relationships for 3D Object Grounding
by: Xiao, Feng, et al.
Published: (2025)

Towards 3D-Aware Video Diffusion Models: Render-Free Human Motion Control with Mesh Tokenization
by: Liang, Jingyun, et al.
Published: (2026)

Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning
by: Liu, Xinyu, et al.
Published: (2025)

TryOn-Adapter: Efficient Fine-Grained Clothing Identity Adaptation for High-Fidelity Virtual Try-On
by: Xing, Jiazheng, et al.
Published: (2024)

DreamVideo-Omni: Omni-Motion Controlled Multi-Subject Video Customization with Latent Identity Reinforcement Learning
by: Wei, Yujie, et al.
Published: (2026)

AnyPhoto: Multi-Person Identity Preserving Image Generation with ID Adaptive Modulation on Location Canvas
by: Yuan, Longhui
Published: (2026)

LUM-ViT: Learnable Under-sampling Mask Vision Transformer for Bandwidth Limited Optical Signal Acquisition
by: Liu, Lingfeng, et al.
Published: (2024)

Impossible Videos
by: Bai, Zechen, et al.
Published: (2025)

MathFlow: Enhancing the Perceptual Flow of MLLMs for Visual Mathematical Problems
by: Chen, Shuhang, et al.
Published: (2025)

Excitonic-Superconducting Coexistence and Emergent Nematic Superconductivity Driven by Spontaneous Symmetry Breaking
by: Yang, Fei, et al.
Published: (2026)

AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding
by: Liu, Tao, et al.
Published: (2024)

RealisID: Scale-Robust and Fine-Controllable Identity Customization via Local and Global Complementation
by: Sun, Zhaoyang, et al.
Published: (2024)

Identity as Presence: Towards Appearance and Voice Personalized Joint Audio-Video Generation
by: Chen, Yingjie, et al.
Published: (2026)

Referring to Any Person
by: Jiang, Qing, et al.
Published: (2025)

DreamRelation: Relation-Centric Video Customization
by: Wei, Yujie, et al.
Published: (2025)

H2R-Grounder: A Paired-Data-Free Paradigm for Translating Human Interaction Videos into Physically Grounded Robot Videos
by: Ci, Hai, et al.
Published: (2025)

AnyTalker: Scaling Multi-Person Talking Video Generation with Interactivity Refinement
by: Zhong, Zhizhou, et al.
Published: (2025)

Enhancing the predictive models for disability in older adults with hypertension: recommendations for future research
by: Ruigang Wei
Published: (2024)

Exploring Determinants of Institutionalization Among Germany's Oldest Old
by: Ruigang Wei
Published: (2024)

Assessing the applicability of the D80 + study in different cultural contexts
by: Ruigang Wei
Published: (2024)

Lumos: Let there be Language Model System Certification
by: Chaudhary, Isha, et al.
Published: (2025)

An Efficient Graph-Transformer Operator for Learning Physical Dynamics with Manifolds Embedding
by: Liu, Pengwei, et al.
Published: (2025)

AnyID: Ultra-Fidelity Universal Identity-Preserving Video Generation from Any Visual References
by: Wang, Jiahao, et al.
Published: (2026)

Energetic variational formulation for electrohydrodynamics of surfactant-laden droplets
by: Ji, Hangjie, et al.
Published: (2026)

AnyAct: Towards Human Reenactment of Character Motion From Video
by: Chen, Liuhan, et al.
Published: (2026)

Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation
by: Wu, Shengqiong, et al.
Published: (2025)

Lumos : Empowering Multimodal LLMs with Scene Text Recognition
by: Shenoy, Ashish, et al.
Published: (2024)

Attribution Explanations for Deep Neural Networks: A Theoretical Perspective
by: Deng, Huiqi, et al.
Published: (2025)