:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Liu, Zichen, Meng, Yihao, Ouyang, Hao, Yu, Yue, Zhao, Bolin, Cohen-Or, Daniel, Qu, Huamin
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2404.11614
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

CausalCine: Real-Time Autoregressive Generation for Multi-Shot Video Narratives
by: Meng, Yihao, et al.
Published: (2026)

Bring Your Dreams to Life: Continual Text-to-Video Customization
by: Dong, Jiahua, et al.
Published: (2025)

HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives
by: Meng, Yihao, et al.
Published: (2025)

Animated Stickers: Bringing Stickers to Life with Video Diffusion
by: Yan, David, et al.
Published: (2024)

Kinetic Typography Diffusion Model
by: Park, Seonmi, et al.
Published: (2024)

AniDoc: Animation Creation Made Easier
by: Meng, Yihao, et al.
Published: (2024)

WordCon: Word-level Typography Control in Scene Text Rendering
by: Shi, Wenda, et al.
Published: (2025)

Generative Neural Video Compression via Video Diffusion Prior
by: Mao, Qi, et al.
Published: (2025)

Diffusion Models Need Visual Priors for Image Generation
by: Yue, Xiaoyu, et al.
Published: (2024)

FonTS: Text Rendering with Typography and Style Controls
by: Shi, Wenda, et al.
Published: (2024)

VitaGlyph: Vitalizing Artistic Typography with Flexible Dual-branch Diffusion Models
by: Feng, Kailai, et al.
Published: (2024)

Iris: Bringing Real-World Priors into Diffusion Model for Monocular Depth Estimation
by: Cai, Xinhao, et al.
Published: (2026)

UniDream: Unifying Diffusion Priors for Relightable Text-to-3D Generation
by: Liu, Zexiang, et al.
Published: (2023)

FaceShot: Bring Any Character into Life
by: Gao, Junyao, et al.
Published: (2025)

GaussianIP: Identity-Preserving Realistic 3D Human Generation via Human-Centric Diffusion Prior
by: Tang, Zichen, et al.
Published: (2025)

Intelligent Artistic Typography: A Comprehensive Review of Artistic Text Design and Generation
by: Bai, Yuhang, et al.
Published: (2024)

Bring the Power of Diffusion Model to Defect Detection
by: Yu, Xuyi
Published: (2024)

XPSR: Cross-modal Priors for Diffusion-based Image Super-Resolution
by: Qu, Yunpeng, et al.
Published: (2024)

Diffusion Priors for Dynamic View Synthesis from Monocular Videos
by: Wang, Chaoyang, et al.
Published: (2024)

Learning Temporally Consistent Video Depth from Video Diffusion Priors
by: Shao, Jiahao, et al.
Published: (2024)

Reading $\neq$ Seeing: Diagnosing and Closing the Typography Gap in Vision-Language Models
by: Zhou, Heng, et al.
Published: (2026)

Prior-Enhanced Gaussian Splatting for Dynamic Scene Reconstruction from Casual Video
by: Shih, Meng-Li, et al.
Published: (2025)

The Dynamic Prior: Understanding 3D Structures for Casual Dynamic Videos
by: Wu, Zhuoyuan, et al.
Published: (2025)

Calligrapher: Freestyle Text Image Customization
by: Ma, Yue, et al.
Published: (2025)

TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control
by: Zeng, Weichao, et al.
Published: (2024)

InFusion: Inpainting 3D Gaussians via Learning Depth Completion from Diffusion Prior
by: Liu, Zhiheng, et al.
Published: (2024)

Boosting Visual Recognition in Real-world Degradations via Unsupervised Feature Enhancement Module with Deep Channel Prior
by: Liu, Zhanwen, et al.
Published: (2024)

CameraCtrl II: Dynamic Scene Exploration via Camera-controlled Video Diffusion Models
by: He, Hao, et al.
Published: (2025)

UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors
by: Chen, Houyuan, et al.
Published: (2026)

Video Deblurring by Sharpness Prior Detection and Edge Information
by: Tian, Yang, et al.
Published: (2025)

Towards Transferable Attacks Against Vision-LLMs in Autonomous Driving with Typography
by: Chung, Nhat, et al.
Published: (2024)

WordCraft: Interactive Artistic Typography with Attention Awareness and Noise Blending
by: Wang, Zhe, et al.
Published: (2025)

Typography-Based Monocular Distance Estimation Framework for Vehicle Safety Systems
by: Reddy, Manognya Lokesh, et al.
Published: (2026)

Style Customization of Text-to-Vector Generation with Image Diffusion Priors
by: Zhang, Peiying, et al.
Published: (2025)

Unfolding Videos Dynamics via Taylor Expansion
by: Chen, Siyi, et al.
Published: (2024)

The World is Your Canvas: Painting Promptable Events with Reference Images, Trajectories, and Text
by: Wang, Hanlin, et al.
Published: (2025)

4Dynamic: Text-to-4D Generation with Hybrid Priors
by: Yuan, Yu-Jie, et al.
Published: (2024)

End-to-End Training for Autoregressive Video Diffusion via Self-Resampling
by: Guo, Yuwei, et al.
Published: (2025)

Single-Shot HDR Recovery via a Video Diffusion Prior
by: Talegaonkar, Chinmay, et al.
Published: (2026)

Dysen-VDM: Empowering Dynamics-aware Text-to-Video Diffusion with LLMs
by: Fei, Hao, et al.
Published: (2023)