Saved in:
| Main Authors: | Xing, Jiazheng, Du, Fei, Yuan, Hangjie, Liu, Pengwei, Xu, Hongbin, Ci, Hai, Niu, Ruigang, Chen, Weihua, Wang, Fan, Liu, Yong |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.20192 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Lumos-Nexus: Efficient Frequency Bridging with Homogeneous Latent Space for Video Unified Models
by: Xing, Jiazheng, et al.
Published: (2026)
by: Xing, Jiazheng, et al.
Published: (2026)
LumosFlow: Motion-Guided Long Video Generation
by: Chen, Jiahao, et al.
Published: (2025)
by: Chen, Jiahao, et al.
Published: (2025)
UniLumos: Fast and Unified Image and Video Relighting with Physics-Plausible Feedback
by: Liu, Ropeway, et al.
Published: (2025)
by: Liu, Ropeway, et al.
Published: (2025)
OptMark: Robust Multi-bit Diffusion Watermarking via Inference Time Optimization
by: Xing, Jiazheng, et al.
Published: (2025)
by: Xing, Jiazheng, et al.
Published: (2025)
Lumos-1: On Autoregressive Video Generation with Discrete Diffusion from a Unified Model Perspective
by: Yuan, Hangjie, et al.
Published: (2025)
by: Yuan, Hangjie, et al.
Published: (2025)
Cyc3D: Fine-grained Controllable 3D Generation via Cycle Consistency Regularization
by: Xu, Hongbin, et al.
Published: (2025)
by: Xu, Hongbin, et al.
Published: (2025)
Knowledge is Power: Advancing Few-shot Action Recognition with Multimodal Semantics from MLLMs
by: Xing, Jiazheng, et al.
Published: (2026)
by: Xing, Jiazheng, et al.
Published: (2026)
SwapAnyone: Consistent and Realistic Video Synthesis for Swapping Any Person into Any Video
by: Zhao, Chengshu, et al.
Published: (2025)
by: Zhao, Chengshu, et al.
Published: (2025)
SAMora: Enhancing SAM through Hierarchical Self-Supervised Pre-Training for Medical Images
by: Chen, Shuhang, et al.
Published: (2025)
by: Chen, Shuhang, et al.
Published: (2025)
PAPM: A Physics-aware Proxy Model for Process Systems
by: Liu, Pengwei, et al.
Published: (2024)
by: Liu, Pengwei, et al.
Published: (2024)
Lumos Extrema
by: Moitra, Upamanyu
Published: (2024)
by: Moitra, Upamanyu
Published: (2024)
X-Humanoid: Robotize Human Videos to Generate Humanoid Videos at Scale
by: Yang, Pei, et al.
Published: (2025)
by: Yang, Pei, et al.
Published: (2025)
B2N3D: Progressive Learning from Binary to N-ary Relationships for 3D Object Grounding
by: Xiao, Feng, et al.
Published: (2025)
by: Xiao, Feng, et al.
Published: (2025)
Towards 3D-Aware Video Diffusion Models: Render-Free Human Motion Control with Mesh Tokenization
by: Liang, Jingyun, et al.
Published: (2026)
by: Liang, Jingyun, et al.
Published: (2026)
Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning
by: Liu, Xinyu, et al.
Published: (2025)
by: Liu, Xinyu, et al.
Published: (2025)
TryOn-Adapter: Efficient Fine-Grained Clothing Identity Adaptation for High-Fidelity Virtual Try-On
by: Xing, Jiazheng, et al.
Published: (2024)
by: Xing, Jiazheng, et al.
Published: (2024)
DreamVideo-Omni: Omni-Motion Controlled Multi-Subject Video Customization with Latent Identity Reinforcement Learning
by: Wei, Yujie, et al.
Published: (2026)
by: Wei, Yujie, et al.
Published: (2026)
AnyPhoto: Multi-Person Identity Preserving Image Generation with ID Adaptive Modulation on Location Canvas
by: Yuan, Longhui
Published: (2026)
by: Yuan, Longhui
Published: (2026)
LUM-ViT: Learnable Under-sampling Mask Vision Transformer for Bandwidth Limited Optical Signal Acquisition
by: Liu, Lingfeng, et al.
Published: (2024)
by: Liu, Lingfeng, et al.
Published: (2024)
Impossible Videos
by: Bai, Zechen, et al.
Published: (2025)
by: Bai, Zechen, et al.
Published: (2025)
MathFlow: Enhancing the Perceptual Flow of MLLMs for Visual Mathematical Problems
by: Chen, Shuhang, et al.
Published: (2025)
by: Chen, Shuhang, et al.
Published: (2025)
Excitonic-Superconducting Coexistence and Emergent Nematic Superconductivity Driven by Spontaneous Symmetry Breaking
by: Yang, Fei, et al.
Published: (2026)
by: Yang, Fei, et al.
Published: (2026)
AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding
by: Liu, Tao, et al.
Published: (2024)
by: Liu, Tao, et al.
Published: (2024)
RealisID: Scale-Robust and Fine-Controllable Identity Customization via Local and Global Complementation
by: Sun, Zhaoyang, et al.
Published: (2024)
by: Sun, Zhaoyang, et al.
Published: (2024)
Identity as Presence: Towards Appearance and Voice Personalized Joint Audio-Video Generation
by: Chen, Yingjie, et al.
Published: (2026)
by: Chen, Yingjie, et al.
Published: (2026)
Referring to Any Person
by: Jiang, Qing, et al.
Published: (2025)
by: Jiang, Qing, et al.
Published: (2025)
DreamRelation: Relation-Centric Video Customization
by: Wei, Yujie, et al.
Published: (2025)
by: Wei, Yujie, et al.
Published: (2025)
H2R-Grounder: A Paired-Data-Free Paradigm for Translating Human Interaction Videos into Physically Grounded Robot Videos
by: Ci, Hai, et al.
Published: (2025)
by: Ci, Hai, et al.
Published: (2025)
AnyTalker: Scaling Multi-Person Talking Video Generation with Interactivity Refinement
by: Zhong, Zhizhou, et al.
Published: (2025)
by: Zhong, Zhizhou, et al.
Published: (2025)
Enhancing the predictive models for disability in older adults with hypertension: recommendations for future research
by: Ruigang Wei
Published: (2024)
by: Ruigang Wei
Published: (2024)
Exploring Determinants of Institutionalization Among Germany's Oldest Old
by: Ruigang Wei
Published: (2024)
by: Ruigang Wei
Published: (2024)
Assessing the applicability of the D80 + study in different cultural contexts
by: Ruigang Wei
Published: (2024)
by: Ruigang Wei
Published: (2024)
Lumos: Let there be Language Model System Certification
by: Chaudhary, Isha, et al.
Published: (2025)
by: Chaudhary, Isha, et al.
Published: (2025)
An Efficient Graph-Transformer Operator for Learning Physical Dynamics with Manifolds Embedding
by: Liu, Pengwei, et al.
Published: (2025)
by: Liu, Pengwei, et al.
Published: (2025)
AnyID: Ultra-Fidelity Universal Identity-Preserving Video Generation from Any Visual References
by: Wang, Jiahao, et al.
Published: (2026)
by: Wang, Jiahao, et al.
Published: (2026)
Energetic variational formulation for electrohydrodynamics of surfactant-laden droplets
by: Ji, Hangjie, et al.
Published: (2026)
by: Ji, Hangjie, et al.
Published: (2026)
AnyAct: Towards Human Reenactment of Character Motion From Video
by: Chen, Liuhan, et al.
Published: (2026)
by: Chen, Liuhan, et al.
Published: (2026)
Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation
by: Wu, Shengqiong, et al.
Published: (2025)
by: Wu, Shengqiong, et al.
Published: (2025)
Lumos : Empowering Multimodal LLMs with Scene Text Recognition
by: Shenoy, Ashish, et al.
Published: (2024)
by: Shenoy, Ashish, et al.
Published: (2024)
Attribution Explanations for Deep Neural Networks: A Theoretical Perspective
by: Deng, Huiqi, et al.
Published: (2025)
by: Deng, Huiqi, et al.
Published: (2025)
Similar Items
-
Lumos-Nexus: Efficient Frequency Bridging with Homogeneous Latent Space for Video Unified Models
by: Xing, Jiazheng, et al.
Published: (2026) -
LumosFlow: Motion-Guided Long Video Generation
by: Chen, Jiahao, et al.
Published: (2025) -
UniLumos: Fast and Unified Image and Video Relighting with Physics-Plausible Feedback
by: Liu, Ropeway, et al.
Published: (2025) -
OptMark: Robust Multi-bit Diffusion Watermarking via Inference Time Optimization
by: Xing, Jiazheng, et al.
Published: (2025) -
Lumos-1: On Autoregressive Video Generation with Discrete Diffusion from a Unified Model Perspective
by: Yuan, Hangjie, et al.
Published: (2025)