:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Kwon, Patrick, Chen, Chen
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2512.01686
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control
by: Wei, Yujie, et al.
Published: (2024)

DreamVideo-Omni: Omni-Motion Controlled Multi-Subject Video Customization with Latent Identity Reinforcement Learning
by: Wei, Yujie, et al.
Published: (2026)

DreamSwapV: Mask-guided Subject Swapping for Any Customized Video Editing
by: Wang, Weitao, et al.
Published: (2025)

CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects
by: Wang, Zhao, et al.
Published: (2024)

DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
by: He, Huiguo, et al.
Published: (2024)

DreamVAR: Taming Reinforced Visual Autoregressive Model for High-Fidelity Subject-Driven Image Generation
by: Jiang, Xin, et al.
Published: (2026)

StoryTailor:A Zero-Shot Pipeline for Action-Rich Multi-Subject Visual Narratives
by: Hu, Jinghao, et al.
Published: (2026)

DreamRelation: Relation-Centric Video Customization
by: Wei, Yujie, et al.
Published: (2025)

VideoDreamer: Customized Multi-Subject Text-to-Video Generation with Disen-Mix Finetuning on Language-Video Foundation Models
by: Chen, Hong, et al.
Published: (2023)

StoryAgent: Customized Storytelling Video Generation via Multi-Agent Collaboration
by: Hu, Panwen, et al.
Published: (2024)

DreamJourney: Perpetual View Generation with Video Diffusion Models
by: Pan, Bo, et al.
Published: (2025)

DreamRelation: Bridging Customization and Relation Generation
by: Shi, Qingyu, et al.
Published: (2024)

Bring Your Dreams to Life: Continual Text-to-Video Customization
by: Dong, Jiahua, et al.
Published: (2025)

SMRABooth: Subject and Motion Representation Alignment for Customized Video Generation
by: Xu, Xuancheng, et al.
Published: (2025)

Lay2Story: Extending Diffusion Transformers for Layout-Togglable Story Generation
by: Ma, Ao, et al.
Published: (2025)

Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models
by: Ren, Yixuan, et al.
Published: (2024)

Making Your Dreams A Reality: Decoding the Dreams into a Coherent Video Story from fMRI Signals
by: Fu, Yanwei, et al.
Published: (2025)

VideoMage: Multi-Subject and Motion Customization of Text-to-Video Diffusion Models
by: Huang, Chi-Pin, et al.
Published: (2025)

FactorizedHMR: A Hybrid Framework for Video Human Mesh Recovery
by: Kwon, Patrick, et al.
Published: (2026)

Towards Long Video Understanding via Fine-detailed Video Story Generation
by: You, Zeng, et al.
Published: (2024)

LEARN: A Story-Driven Layout-to-Image Generation Framework for STEM Instruction
by: Zhang, Maoquan, et al.
Published: (2025)

StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization
by: Zhang, Jinlu, et al.
Published: (2024)

CustomContrast: A Multilevel Contrastive Perspective For Subject-Driven Text-to-Image Customization
by: Chen, Nan, et al.
Published: (2024)

DreamO: A Unified Framework for Image Customization
by: Mou, Chong, et al.
Published: (2025)

OmniVCus: Feedforward Subject-driven Video Customization with Multimodal Control Conditions
by: Cai, Yuanhao, et al.
Published: (2025)

DreamWorld: Unified World Modeling in Video Generation
by: Tan, Boming, et al.
Published: (2026)

ReDiStory: Region-Disentangled Diffusion for Consistent Visual Story Generation
by: Sarkar, Ayushman, et al.
Published: (2026)

DreamRunner: Fine-Grained Compositional Story-to-Video Generation with Retrieval-Augmented Motion Adaptation
by: Wang, Zun, et al.
Published: (2024)

Zooming into Comics: Region-Aware RL Improves Fine-Grained Comic Understanding in Vision-Language Models
by: Chen, Yule, et al.
Published: (2025)

DreamFrame: Enhancing Video Understanding via Automatically Generated QA and Style-Consistent Keyframes
by: Song, Zhende, et al.
Published: (2024)

AnyStory: Towards Unified Single and Multiple Subject Personalization in Text-to-Image Generation
by: He, Junjie, et al.
Published: (2025)

SEED-Story: Multimodal Long Story Generation with Large Language Model
by: Yang, Shuai, et al.
Published: (2024)

SUGAR: Subject-Driven Video Customization in a Zero-Shot Manner
by: Zhou, Yufan, et al.
Published: (2024)

Manga Generation via Layout-controllable Diffusion
by: Chen, Siyu, et al.
Published: (2024)

Retrieval Augmented Comic Image Generation
by: Shui, Yunhao, et al.
Published: (2025)

DreamStyle: A Unified Framework for Video Stylization
by: Li, Mengtian, et al.
Published: (2026)

DreamVTON: Customizing 3D Virtual Try-on with Personalized Diffusion Models
by: Xie, Zhenyu, et al.
Published: (2024)

SynMotion: Semantic-Visual Adaptation for Motion Customized Video Generation
by: Tan, Shuai, et al.
Published: (2025)

DreamDA: Generative Data Augmentation with Diffusion Models
by: Fu, Yunxiang, et al.
Published: (2024)

Still-Moving: Customized Video Generation without Customized Video Data
by: Chefer, Hila, et al.
Published: (2024)