Saved in:
| Main Authors: | Shen, Fei, Ye, Hu, Liu, Sibo, Zhang, Jun, Wang, Cong, Han, Xiao, Yang, Wei |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2407.02482 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Advancing Pose-Guided Image Synthesis with Progressive Conditional Diffusion Models
by: Shen, Fei, et al.
Published: (2023)
by: Shen, Fei, et al.
Published: (2023)
StoryGPT-V: Large Language Models as Consistent Story Visualizers
by: Shen, Xiaoqian, et al.
Published: (2023)
by: Shen, Xiaoqian, et al.
Published: (2023)
Adaptive Visual Conditioning for Semantic Consistency in Diffusion-Based Story Continuation
by: Mousavi, Seyed Mohammad, et al.
Published: (2025)
by: Mousavi, Seyed Mohammad, et al.
Published: (2025)
ReDiStory: Region-Disentangled Diffusion for Consistent Visual Story Generation
by: Sarkar, Ayushman, et al.
Published: (2026)
by: Sarkar, Ayushman, et al.
Published: (2026)
Object Isolated Attention for Consistent Story Visualization
by: Luo, Xiangyang, et al.
Published: (2025)
by: Luo, Xiangyang, et al.
Published: (2025)
StoryTailor:A Zero-Shot Pipeline for Action-Rich Multi-Subject Visual Narratives
by: Hu, Jinghao, et al.
Published: (2026)
by: Hu, Jinghao, et al.
Published: (2026)
DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
by: He, Huiguo, et al.
Published: (2024)
by: He, Huiguo, et al.
Published: (2024)
ContextualStory: Consistent Visual Storytelling with Spatially-Enhanced and Storyline Context
by: Zheng, Sixiao, et al.
Published: (2024)
by: Zheng, Sixiao, et al.
Published: (2024)
Visual-Aware CoT: Achieving High-Fidelity Visual Consistency in Unified Models
by: Ye, Zixuan, et al.
Published: (2025)
by: Ye, Zixuan, et al.
Published: (2025)
Audit & Repair: An Agentic Framework for Consistent Story Visualization in Text-to-Image Diffusion Models
by: Akdemir, Kiymet, et al.
Published: (2025)
by: Akdemir, Kiymet, et al.
Published: (2025)
MegaStyle: Constructing Diverse and Scalable Style Dataset via Consistent Text-to-Image Style Mapping
by: Gao, Junyao, et al.
Published: (2026)
by: Gao, Junyao, et al.
Published: (2026)
ViSTA: Visual Storytelling using Multi-modal Adapters for Text-to-Image Diffusion Models
by: Dong, Sibo, et al.
Published: (2025)
by: Dong, Sibo, et al.
Published: (2025)
V-Express: Conditional Dropout for Progressive Training of Portrait Video Generation
by: Wang, Cong, et al.
Published: (2024)
by: Wang, Cong, et al.
Published: (2024)
SceneDecorator: Towards Scene-Oriented Story Generation with Scene Planning and Scene Consistency
by: Song, Quanjian, et al.
Published: (2025)
by: Song, Quanjian, et al.
Published: (2025)
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation
by: Ren, Weiming, et al.
Published: (2024)
by: Ren, Weiming, et al.
Published: (2024)
HDR Reconstruction Boosting with Training-Free and Exposure-Consistent Diffusion
by: Lin, Yo-Tin, et al.
Published: (2026)
by: Lin, Yo-Tin, et al.
Published: (2026)
Ensembling Diffusion Models via Adaptive Feature Aggregation
by: Wang, Cong, et al.
Published: (2024)
by: Wang, Cong, et al.
Published: (2024)
Generative Compositor for Few-Shot Visual Information Extraction
by: Yang, Zhibo, et al.
Published: (2025)
by: Yang, Zhibo, et al.
Published: (2025)
Long-Term TalkingFace Generation via Motion-Prior Conditional Diffusion Model
by: Shen, Fei, et al.
Published: (2025)
by: Shen, Fei, et al.
Published: (2025)
Few-shot Defect Image Generation based on Consistency Modeling
by: Shi, Qingfeng, et al.
Published: (2024)
by: Shi, Qingfeng, et al.
Published: (2024)
D-Fusion: Direct Preference Optimization for Aligning Diffusion Models with Visually Consistent Samples
by: Hu, Zijing, et al.
Published: (2025)
by: Hu, Zijing, et al.
Published: (2025)
UniGlyph: Unified Segmentation-Conditioned Diffusion for Precise Visual Text Synthesis
by: Wang, Yuanrui, et al.
Published: (2025)
by: Wang, Yuanrui, et al.
Published: (2025)
ViStoryBench: Comprehensive Benchmark Suite for Story Visualization
by: Zhuang, Cailin, et al.
Published: (2025)
by: Zhuang, Cailin, et al.
Published: (2025)
Bayesian-Optimized One-Step Diffusion Model with Knowledge Distillation for Real-Time 3D Human Motion Prediction
by: Tian, Sibo, et al.
Published: (2024)
by: Tian, Sibo, et al.
Published: (2024)
LTRL: Boosting Long-tail Recognition via Reflective Learning
by: Zhao, Qihao, et al.
Published: (2024)
by: Zhao, Qihao, et al.
Published: (2024)
Towards Multi-View Consistent Style Transfer with One-Step Diffusion via Vision Conditioning
by: Zuo, Yushen, et al.
Published: (2024)
by: Zuo, Yushen, et al.
Published: (2024)
Relation-Rich Visual Document Generator for Visual Information Extraction
by: Jiang, Zi-Han, et al.
Published: (2025)
by: Jiang, Zi-Han, et al.
Published: (2025)
Magic-Boost: Boost 3D Generation with Multi-View Conditioned Diffusion
by: Yang, Fan, et al.
Published: (2024)
by: Yang, Fan, et al.
Published: (2024)
Semantically Consistent Video Inpainting with Conditional Diffusion Models
by: Green, Dylan, et al.
Published: (2024)
by: Green, Dylan, et al.
Published: (2024)
ACT-Diffusion: Efficient Adversarial Consistency Training for One-step Diffusion Models
by: Kong, Fei, et al.
Published: (2023)
by: Kong, Fei, et al.
Published: (2023)
One-Step Diffusion for Detail-Rich and Temporally Consistent Video Super-Resolution
by: Sun, Yujing, et al.
Published: (2025)
by: Sun, Yujing, et al.
Published: (2025)
DCI: Dual-Conditional Inversion for Boosting Diffusion-Based Image Editing
by: Li, Zixiang, et al.
Published: (2025)
by: Li, Zixiang, et al.
Published: (2025)
DC-ControlNet: Decoupling Inter- and Intra-Element Conditions in Image Generation with Diffusion Models
by: Yang, Hongji, et al.
Published: (2025)
by: Yang, Hongji, et al.
Published: (2025)
Generalized Visual Relation Detection with Diffusion Models
by: Gao, Kaifeng, et al.
Published: (2025)
by: Gao, Kaifeng, et al.
Published: (2025)
Solving Inverse Problems with Latent Diffusion Models via Hard Data Consistency
by: Song, Bowen, et al.
Published: (2023)
by: Song, Bowen, et al.
Published: (2023)
StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation
by: Zhou, Yupeng, et al.
Published: (2024)
by: Zhou, Yupeng, et al.
Published: (2024)
D-Feat Occlusions: Diffusion Features for Robustness to Partial Visual Occlusions in Object Recognition
by: Mallick, Rupayan, et al.
Published: (2025)
by: Mallick, Rupayan, et al.
Published: (2025)
Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration
by: Deng, Junyuan, et al.
Published: (2024)
by: Deng, Junyuan, et al.
Published: (2024)
Contextualized Visual Personalization in Vision-Language Models
by: Oh, Yeongtak, et al.
Published: (2026)
by: Oh, Yeongtak, et al.
Published: (2026)
ConsistTalk: Intensity Controllable Temporally Consistent Talking Head Generation with Diffusion Noise Search
by: Liu, Zhenjie, et al.
Published: (2025)
by: Liu, Zhenjie, et al.
Published: (2025)
Similar Items
-
Advancing Pose-Guided Image Synthesis with Progressive Conditional Diffusion Models
by: Shen, Fei, et al.
Published: (2023) -
StoryGPT-V: Large Language Models as Consistent Story Visualizers
by: Shen, Xiaoqian, et al.
Published: (2023) -
Adaptive Visual Conditioning for Semantic Consistency in Diffusion-Based Story Continuation
by: Mousavi, Seyed Mohammad, et al.
Published: (2025) -
ReDiStory: Region-Disentangled Diffusion for Consistent Visual Story Generation
by: Sarkar, Ayushman, et al.
Published: (2026) -
Object Isolated Attention for Consistent Story Visualization
by: Luo, Xiangyang, et al.
Published: (2025)