Saved in:
| Main Authors: | Liu, Zichen, Meng, Yihao, Ouyang, Hao, Yu, Yue, Zhao, Bolin, Cohen-Or, Daniel, Qu, Huamin |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.11614 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
CausalCine: Real-Time Autoregressive Generation for Multi-Shot Video Narratives
by: Meng, Yihao, et al.
Published: (2026)
by: Meng, Yihao, et al.
Published: (2026)
Bring Your Dreams to Life: Continual Text-to-Video Customization
by: Dong, Jiahua, et al.
Published: (2025)
by: Dong, Jiahua, et al.
Published: (2025)
HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives
by: Meng, Yihao, et al.
Published: (2025)
by: Meng, Yihao, et al.
Published: (2025)
Animated Stickers: Bringing Stickers to Life with Video Diffusion
by: Yan, David, et al.
Published: (2024)
by: Yan, David, et al.
Published: (2024)
Kinetic Typography Diffusion Model
by: Park, Seonmi, et al.
Published: (2024)
by: Park, Seonmi, et al.
Published: (2024)
AniDoc: Animation Creation Made Easier
by: Meng, Yihao, et al.
Published: (2024)
by: Meng, Yihao, et al.
Published: (2024)
WordCon: Word-level Typography Control in Scene Text Rendering
by: Shi, Wenda, et al.
Published: (2025)
by: Shi, Wenda, et al.
Published: (2025)
Generative Neural Video Compression via Video Diffusion Prior
by: Mao, Qi, et al.
Published: (2025)
by: Mao, Qi, et al.
Published: (2025)
Diffusion Models Need Visual Priors for Image Generation
by: Yue, Xiaoyu, et al.
Published: (2024)
by: Yue, Xiaoyu, et al.
Published: (2024)
FonTS: Text Rendering with Typography and Style Controls
by: Shi, Wenda, et al.
Published: (2024)
by: Shi, Wenda, et al.
Published: (2024)
VitaGlyph: Vitalizing Artistic Typography with Flexible Dual-branch Diffusion Models
by: Feng, Kailai, et al.
Published: (2024)
by: Feng, Kailai, et al.
Published: (2024)
Iris: Bringing Real-World Priors into Diffusion Model for Monocular Depth Estimation
by: Cai, Xinhao, et al.
Published: (2026)
by: Cai, Xinhao, et al.
Published: (2026)
UniDream: Unifying Diffusion Priors for Relightable Text-to-3D Generation
by: Liu, Zexiang, et al.
Published: (2023)
by: Liu, Zexiang, et al.
Published: (2023)
FaceShot: Bring Any Character into Life
by: Gao, Junyao, et al.
Published: (2025)
by: Gao, Junyao, et al.
Published: (2025)
GaussianIP: Identity-Preserving Realistic 3D Human Generation via Human-Centric Diffusion Prior
by: Tang, Zichen, et al.
Published: (2025)
by: Tang, Zichen, et al.
Published: (2025)
Intelligent Artistic Typography: A Comprehensive Review of Artistic Text Design and Generation
by: Bai, Yuhang, et al.
Published: (2024)
by: Bai, Yuhang, et al.
Published: (2024)
Bring the Power of Diffusion Model to Defect Detection
by: Yu, Xuyi
Published: (2024)
by: Yu, Xuyi
Published: (2024)
XPSR: Cross-modal Priors for Diffusion-based Image Super-Resolution
by: Qu, Yunpeng, et al.
Published: (2024)
by: Qu, Yunpeng, et al.
Published: (2024)
Diffusion Priors for Dynamic View Synthesis from Monocular Videos
by: Wang, Chaoyang, et al.
Published: (2024)
by: Wang, Chaoyang, et al.
Published: (2024)
Learning Temporally Consistent Video Depth from Video Diffusion Priors
by: Shao, Jiahao, et al.
Published: (2024)
by: Shao, Jiahao, et al.
Published: (2024)
Reading $\neq$ Seeing: Diagnosing and Closing the Typography Gap in Vision-Language Models
by: Zhou, Heng, et al.
Published: (2026)
by: Zhou, Heng, et al.
Published: (2026)
Prior-Enhanced Gaussian Splatting for Dynamic Scene Reconstruction from Casual Video
by: Shih, Meng-Li, et al.
Published: (2025)
by: Shih, Meng-Li, et al.
Published: (2025)
The Dynamic Prior: Understanding 3D Structures for Casual Dynamic Videos
by: Wu, Zhuoyuan, et al.
Published: (2025)
by: Wu, Zhuoyuan, et al.
Published: (2025)
Calligrapher: Freestyle Text Image Customization
by: Ma, Yue, et al.
Published: (2025)
by: Ma, Yue, et al.
Published: (2025)
TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control
by: Zeng, Weichao, et al.
Published: (2024)
by: Zeng, Weichao, et al.
Published: (2024)
InFusion: Inpainting 3D Gaussians via Learning Depth Completion from Diffusion Prior
by: Liu, Zhiheng, et al.
Published: (2024)
by: Liu, Zhiheng, et al.
Published: (2024)
Boosting Visual Recognition in Real-world Degradations via Unsupervised Feature Enhancement Module with Deep Channel Prior
by: Liu, Zhanwen, et al.
Published: (2024)
by: Liu, Zhanwen, et al.
Published: (2024)
CameraCtrl II: Dynamic Scene Exploration via Camera-controlled Video Diffusion Models
by: He, Hao, et al.
Published: (2025)
by: He, Hao, et al.
Published: (2025)
UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors
by: Chen, Houyuan, et al.
Published: (2026)
by: Chen, Houyuan, et al.
Published: (2026)
Video Deblurring by Sharpness Prior Detection and Edge Information
by: Tian, Yang, et al.
Published: (2025)
by: Tian, Yang, et al.
Published: (2025)
Towards Transferable Attacks Against Vision-LLMs in Autonomous Driving with Typography
by: Chung, Nhat, et al.
Published: (2024)
by: Chung, Nhat, et al.
Published: (2024)
WordCraft: Interactive Artistic Typography with Attention Awareness and Noise Blending
by: Wang, Zhe, et al.
Published: (2025)
by: Wang, Zhe, et al.
Published: (2025)
Typography-Based Monocular Distance Estimation Framework for Vehicle Safety Systems
by: Reddy, Manognya Lokesh, et al.
Published: (2026)
by: Reddy, Manognya Lokesh, et al.
Published: (2026)
Style Customization of Text-to-Vector Generation with Image Diffusion Priors
by: Zhang, Peiying, et al.
Published: (2025)
by: Zhang, Peiying, et al.
Published: (2025)
Unfolding Videos Dynamics via Taylor Expansion
by: Chen, Siyi, et al.
Published: (2024)
by: Chen, Siyi, et al.
Published: (2024)
The World is Your Canvas: Painting Promptable Events with Reference Images, Trajectories, and Text
by: Wang, Hanlin, et al.
Published: (2025)
by: Wang, Hanlin, et al.
Published: (2025)
4Dynamic: Text-to-4D Generation with Hybrid Priors
by: Yuan, Yu-Jie, et al.
Published: (2024)
by: Yuan, Yu-Jie, et al.
Published: (2024)
End-to-End Training for Autoregressive Video Diffusion via Self-Resampling
by: Guo, Yuwei, et al.
Published: (2025)
by: Guo, Yuwei, et al.
Published: (2025)
Single-Shot HDR Recovery via a Video Diffusion Prior
by: Talegaonkar, Chinmay, et al.
Published: (2026)
by: Talegaonkar, Chinmay, et al.
Published: (2026)
Dysen-VDM: Empowering Dynamics-aware Text-to-Video Diffusion with LLMs
by: Fei, Hao, et al.
Published: (2023)
by: Fei, Hao, et al.
Published: (2023)
Similar Items
-
CausalCine: Real-Time Autoregressive Generation for Multi-Shot Video Narratives
by: Meng, Yihao, et al.
Published: (2026) -
Bring Your Dreams to Life: Continual Text-to-Video Customization
by: Dong, Jiahua, et al.
Published: (2025) -
HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives
by: Meng, Yihao, et al.
Published: (2025) -
Animated Stickers: Bringing Stickers to Life with Video Diffusion
by: Yan, David, et al.
Published: (2024) -
Kinetic Typography Diffusion Model
by: Park, Seonmi, et al.
Published: (2024)