Guardado en:
| Autores principales: | Mousavi, Seyed Mohammad, Analoui, Morteza |
|---|---|
| Formato: | Preprint |
| Publicado: |
2025
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2510.13787 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models
por: Shen, Fei, et al.
Publicado: (2024)
por: Shen, Fei, et al.
Publicado: (2024)
ReDiStory: Region-Disentangled Diffusion for Consistent Visual Story Generation
por: Sarkar, Ayushman, et al.
Publicado: (2026)
por: Sarkar, Ayushman, et al.
Publicado: (2026)
Synthetic Data Generation for Emotional Depth Faces: Optimizing Conditional DCGANs via Genetic Algorithms in the Latent Space and Stabilizing Training with Knowledge Distillation
por: Mousavi, Seyed Muhammad Hossein, et al.
Publicado: (2025)
por: Mousavi, Seyed Muhammad Hossein, et al.
Publicado: (2025)
StoryGPT-V: Large Language Models as Consistent Story Visualizers
por: Shen, Xiaoqian, et al.
Publicado: (2023)
por: Shen, Xiaoqian, et al.
Publicado: (2023)
Audit & Repair: An Agentic Framework for Consistent Story Visualization in Text-to-Image Diffusion Models
por: Akdemir, Kiymet, et al.
Publicado: (2025)
por: Akdemir, Kiymet, et al.
Publicado: (2025)
DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
por: He, Huiguo, et al.
Publicado: (2024)
por: He, Huiguo, et al.
Publicado: (2024)
Semantically Consistent Video Inpainting with Conditional Diffusion Models
por: Green, Dylan, et al.
Publicado: (2024)
por: Green, Dylan, et al.
Publicado: (2024)
Object Isolated Attention for Consistent Story Visualization
por: Luo, Xiangyang, et al.
Publicado: (2025)
por: Luo, Xiangyang, et al.
Publicado: (2025)
Extracting Overlapping Microservices from Monolithic Code via Deep Semantic Embeddings and Graph Neural Network-Based Soft Clustering
por: Ziabakhsh, Morteza, et al.
Publicado: (2025)
por: Ziabakhsh, Morteza, et al.
Publicado: (2025)
StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation
por: Zhou, Yupeng, et al.
Publicado: (2024)
por: Zhou, Yupeng, et al.
Publicado: (2024)
StoryState: Agent-Based State Control for Consistent and Editable Storybooks
por: Sarkar, Ayushman, et al.
Publicado: (2026)
por: Sarkar, Ayushman, et al.
Publicado: (2026)
Pointmap-Conditioned Diffusion for Consistent Novel View Synthesis
por: Nguyen, Thang-Anh-Quan, et al.
Publicado: (2025)
por: Nguyen, Thang-Anh-Quan, et al.
Publicado: (2025)
ASemConsist: Adaptive Semantic Feature Control for Training-Free Identity-Consistent Generation
por: Kim, Shin Seong, et al.
Publicado: (2025)
por: Kim, Shin Seong, et al.
Publicado: (2025)
Adaptive Semantic Consistency for Cross-domain Few-shot Classification
por: Lu, Hengchu, et al.
Publicado: (2023)
por: Lu, Hengchu, et al.
Publicado: (2023)
AttriStory: Fine-grained Attribute Realization for Visual Storytelling with Diffusion Models
por: Sreenivas, Manogna, et al.
Publicado: (2026)
por: Sreenivas, Manogna, et al.
Publicado: (2026)
Sequential Visual and Semantic Consistency for Semi-supervised Text Recognition
por: Yang, Mingkun, et al.
Publicado: (2024)
por: Yang, Mingkun, et al.
Publicado: (2024)
ViStoryBench: Comprehensive Benchmark Suite for Story Visualization
por: Zhuang, Cailin, et al.
Publicado: (2025)
por: Zhuang, Cailin, et al.
Publicado: (2025)
AdaptiveDrag: Semantic-Driven Dragging on Diffusion-Based Image Editing
por: Chen, DuoSheng, et al.
Publicado: (2024)
por: Chen, DuoSheng, et al.
Publicado: (2024)
Guiding Diffusion Models with Semantically Degraded Conditions
por: Han, Shilong, et al.
Publicado: (2026)
por: Han, Shilong, et al.
Publicado: (2026)
Consistent Human Image and Video Generation with Spatially Conditioned Diffusion
por: Cao, Mingdeng, et al.
Publicado: (2024)
por: Cao, Mingdeng, et al.
Publicado: (2024)
Consistent Story Generation: Unlocking the Potential of Zigzag Sampling
por: Li, Mingxiao, et al.
Publicado: (2025)
por: Li, Mingxiao, et al.
Publicado: (2025)
ContextualStory: Consistent Visual Storytelling with Spatially-Enhanced and Storyline Context
por: Zheng, Sixiao, et al.
Publicado: (2024)
por: Zheng, Sixiao, et al.
Publicado: (2024)
GramSR: Visual Feature Conditioning for Diffusion-Based Super-Resolution
por: D'Oronzio, Fabio, et al.
Publicado: (2026)
por: D'Oronzio, Fabio, et al.
Publicado: (2026)
Retinex-guided Histogram Transformer for Mask-free Shadow Removal
por: Dong, Wei, et al.
Publicado: (2025)
por: Dong, Wei, et al.
Publicado: (2025)
Getting to the Point: Pointing Improves LVLMs at Counting
por: Alghisi, Simone, et al.
Publicado: (2026)
por: Alghisi, Simone, et al.
Publicado: (2026)
Low-Bitrate Video Compression through Semantic-Conditioned Diffusion
por: Wang, Lingdong, et al.
Publicado: (2025)
por: Wang, Lingdong, et al.
Publicado: (2025)
Diffusion-Guided Semantic Consistency for Multimodal Heterogeneity
por: Liu, Jing, et al.
Publicado: (2026)
por: Liu, Jing, et al.
Publicado: (2026)
SDiT: Semantic Region-Adaptive for Diffusion Transformers
por: Lin, Bowen, et al.
Publicado: (2026)
por: Lin, Bowen, et al.
Publicado: (2026)
Semantically Consistent Discrete Diffusion for 3D Biological Graph Modeling
por: Prabhakar, Chinmay, et al.
Publicado: (2025)
por: Prabhakar, Chinmay, et al.
Publicado: (2025)
Learning Quantized Adaptive Conditions for Diffusion Models
por: Liang, Yuchen, et al.
Publicado: (2024)
por: Liang, Yuchen, et al.
Publicado: (2024)
Conditional Diffusion for 3D CT Volume Reconstruction from 2D X-rays
por: Rath, Martin, et al.
Publicado: (2026)
por: Rath, Martin, et al.
Publicado: (2026)
Visual Neural Decoding via Improved Visual-EEG Semantic Consistency
por: Chen, Hongzhou, et al.
Publicado: (2024)
por: Chen, Hongzhou, et al.
Publicado: (2024)
StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion
por: Tao, Ming, et al.
Publicado: (2024)
por: Tao, Ming, et al.
Publicado: (2024)
Story-Iter: A Training-free Iterative Paradigm for Long Story Visualization
por: Mao, Jiawei, et al.
Publicado: (2024)
por: Mao, Jiawei, et al.
Publicado: (2024)
Lay2Story: Extending Diffusion Transformers for Layout-Togglable Story Generation
por: Ma, Ao, et al.
Publicado: (2025)
por: Ma, Ao, et al.
Publicado: (2025)
A Transformer-Based Adaptive Semantic Aggregation Method for UAV Visual Geo-Localization
por: Li, Shishen, et al.
Publicado: (2024)
por: Li, Shishen, et al.
Publicado: (2024)
Stochastic Conditional Diffusion Models for Robust Semantic Image Synthesis
por: Ko, Juyeon, et al.
Publicado: (2024)
por: Ko, Juyeon, et al.
Publicado: (2024)
A Hidden Semantic Bottleneck in Conditional Embeddings of Diffusion Transformers
por: Pham, Trung X., et al.
Publicado: (2026)
por: Pham, Trung X., et al.
Publicado: (2026)
Infinite-Story: A Training-Free Consistent Text-to-Image Generation
por: Park, Jihun, et al.
Publicado: (2025)
por: Park, Jihun, et al.
Publicado: (2025)
StoryMaker: Towards Holistic Consistent Characters in Text-to-image Generation
por: Zhou, Zhengguang, et al.
Publicado: (2024)
por: Zhou, Zhengguang, et al.
Publicado: (2024)
Ejemplares similares
-
Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models
por: Shen, Fei, et al.
Publicado: (2024) -
ReDiStory: Region-Disentangled Diffusion for Consistent Visual Story Generation
por: Sarkar, Ayushman, et al.
Publicado: (2026) -
Synthetic Data Generation for Emotional Depth Faces: Optimizing Conditional DCGANs via Genetic Algorithms in the Latent Space and Stabilizing Training with Knowledge Distillation
por: Mousavi, Seyed Muhammad Hossein, et al.
Publicado: (2025) -
StoryGPT-V: Large Language Models as Consistent Story Visualizers
por: Shen, Xiaoqian, et al.
Publicado: (2023) -
Audit & Repair: An Agentic Framework for Consistent Story Visualization in Text-to-Image Diffusion Models
por: Akdemir, Kiymet, et al.
Publicado: (2025)