Saved in:
| Main Authors: | Ohanyan, Marianna, Manukyan, Hayk, Wang, Zhangyang, Navasardyan, Shant, Shi, Humphrey |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2406.04032 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models
by: Manukyan, Hayk, et al.
Published: (2023)
by: Manukyan, Hayk, et al.
Published: (2023)
Dr-SAM: An End-to-End Framework for Vascular Segmentation, Diameter Estimation, and Anomaly Detection on Angiography Images
by: Zohranyan, Vazgen, et al.
Published: (2024)
by: Zohranyan, Vazgen, et al.
Published: (2024)
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
by: Henschel, Roberto, et al.
Published: (2024)
by: Henschel, Roberto, et al.
Published: (2024)
FlowDIS: Language-Guided Dichotomous Image Segmentation with Flow Matching
by: Sargsyan, Andranik, et al.
Published: (2026)
by: Sargsyan, Andranik, et al.
Published: (2026)
Versatile Diffusion: Text, Images and Variations All in One Diffusion Model
by: Xu, Xingqian, et al.
Published: (2022)
by: Xu, Xingqian, et al.
Published: (2022)
Token Painter: Training-Free Text-Guided Image Inpainting via Mask Autoregressive Models
by: Jiang, Longtao, et al.
Published: (2025)
by: Jiang, Longtao, et al.
Published: (2025)
SpotActor: Training-Free Layout-Controlled Consistent Image Generation
by: Wang, Jiahao, et al.
Published: (2024)
by: Wang, Jiahao, et al.
Published: (2024)
LoCo: Locally Constrained Training-Free Layout-to-Image Synthesis
by: Zhao, Peiang, et al.
Published: (2023)
by: Zhao, Peiang, et al.
Published: (2023)
T2I-Copilot: A Training-Free Multi-Agent Text-to-Image System for Enhanced Prompt Interpretation and Interactive Generation
by: Chen, Chieh-Yun, et al.
Published: (2025)
by: Chen, Chieh-Yun, et al.
Published: (2025)
Social Reward: Evaluating and Enhancing Generative AI through Million-User Feedback from an Online Creative Community
by: Isajanyan, Arman, et al.
Published: (2024)
by: Isajanyan, Arman, et al.
Published: (2024)
Efficient Image Generation with Variadic Attention Heads
by: Walton, Steven, et al.
Published: (2022)
by: Walton, Steven, et al.
Published: (2022)
Check, Locate, Rectify: A Training-Free Layout Calibration System for Text-to-Image Generation
by: Gong, Biao, et al.
Published: (2023)
by: Gong, Biao, et al.
Published: (2023)
OpenBias: Open-set Bias Detection in Text-to-Image Generative Models
by: D'Incà, Moreno, et al.
Published: (2024)
by: D'Incà, Moreno, et al.
Published: (2024)
Training-and-Prompt-Free General Painterly Harmonization via Zero-Shot Disentenglement on Style and Content References
by: Hsiao, Teng-Fang, et al.
Published: (2024)
by: Hsiao, Teng-Fang, et al.
Published: (2024)
Training-Free Layout-to-Image Generation with Marginal Attention Constraints
by: Chen, Huancheng, et al.
Published: (2024)
by: Chen, Huancheng, et al.
Published: (2024)
Layout Agnostic Scene Text Image Synthesis with Diffusion Models
by: Zhangli, Qilong, et al.
Published: (2024)
by: Zhangli, Qilong, et al.
Published: (2024)
Control and Realism: Best of Both Worlds in Layout-to-Image without Training
by: Li, Bonan, et al.
Published: (2025)
by: Li, Bonan, et al.
Published: (2025)
ToLo: A Two-Stage, Training-Free Layout-To-Image Generation Framework For High-Overlap Layouts
by: Huang, Linhao, et al.
Published: (2025)
by: Huang, Linhao, et al.
Published: (2025)
AnomalyPainter: Vision-Language-Diffusion Synergy for Zero-Shot Realistic and Diverse Industrial Anomaly Synthesis
by: Lai, Zhangyu, et al.
Published: (2025)
by: Lai, Zhangyu, et al.
Published: (2025)
CountDiffusion: Text-to-Image Synthesis with Training-Free Counting-Guidance Diffusion
by: Li, Yanyu, et al.
Published: (2025)
by: Li, Yanyu, et al.
Published: (2025)
OmniText: A Training-Free Generalist for Controllable Text-Image Manipulation
by: Gunawan, Agus, et al.
Published: (2025)
by: Gunawan, Agus, et al.
Published: (2025)
DanceText: A Training-Free Layered Framework for Controllable Multilingual Text Transformation in Images
by: Yu, Zhenyu, et al.
Published: (2025)
by: Yu, Zhenyu, et al.
Published: (2025)
ConsistCompose: Unified Multimodal Layout Control for Image Composition
by: Shi, Xuanke, et al.
Published: (2025)
by: Shi, Xuanke, et al.
Published: (2025)
RichControl: Structure- and Appearance-Rich Training-Free Spatial Control for Text-to-Image Generation
by: Pang, Lexi, et al.
Published: (2025)
by: Pang, Lexi, et al.
Published: (2025)
Training-free Composite Scene Generation for Layout-to-Image Synthesis
by: Liu, Jiaqi, et al.
Published: (2024)
by: Liu, Jiaqi, et al.
Published: (2024)
ExpertGen: Training-Free Expert Guidance for Controllable Text-to-Face Generation
by: Shi, Liang, et al.
Published: (2025)
by: Shi, Liang, et al.
Published: (2025)
Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesis
by: Hu, Taihang, et al.
Published: (2024)
by: Hu, Taihang, et al.
Published: (2024)
Spatial Transport Optimization by Repositioning Attention Map for Training-Free Text-to-Image Synthesis
by: Han, Woojung, et al.
Published: (2025)
by: Han, Woojung, et al.
Published: (2025)
PainterNet: Adaptive Image Inpainting with Actual-Token Attention and Diverse Mask Control
by: Wang, Ruichen, et al.
Published: (2024)
by: Wang, Ruichen, et al.
Published: (2024)
VASE: Object-Centric Appearance and Shape Manipulation of Real Videos
by: Peruzzo, Elia, et al.
Published: (2024)
by: Peruzzo, Elia, et al.
Published: (2024)
Applying Medical Imaging Tractography Techniques to Painterly Rendering of Images
by: Di Biase, Alberto
Published: (2025)
by: Di Biase, Alberto
Published: (2025)
TextDiffuser-RL: Efficient and Robust Text Layout Optimization for High-Fidelity Text-to-Image Synthesis
by: Rahman, Kazi Mahathir, et al.
Published: (2025)
by: Rahman, Kazi Mahathir, et al.
Published: (2025)
GradBias: Unveiling Word Influence on Bias in Text-to-Image Generative Models
by: D'Incà, Moreno, et al.
Published: (2024)
by: D'Incà, Moreno, et al.
Published: (2024)
Multitwine: Multi-Object Compositing with Text and Layout Control
by: Tarrés, Gemma Canet, et al.
Published: (2025)
by: Tarrés, Gemma Canet, et al.
Published: (2025)
MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance
by: Wang, Xierui, et al.
Published: (2024)
by: Wang, Xierui, et al.
Published: (2024)
Enhancing Object Coherence in Layout-to-Image Synthesis
by: Wang, Yibin, et al.
Published: (2023)
by: Wang, Yibin, et al.
Published: (2023)
Co-generation of Layout and Shape from Text via Autoregressive 3D Diffusion
by: Tang, Zhenggang, et al.
Published: (2026)
by: Tang, Zhenggang, et al.
Published: (2026)
Text and Image Are Mutually Beneficial: Enhancing Training-Free Few-Shot Classification with CLIP
by: Li, Yayuan, et al.
Published: (2024)
by: Li, Yayuan, et al.
Published: (2024)
Generating a Paracosm for Training-Free Zero-Shot Composed Image Retrieval
by: Wang, Tong, et al.
Published: (2026)
by: Wang, Tong, et al.
Published: (2026)
FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors
by: Zhang, Yabo, et al.
Published: (2025)
by: Zhang, Yabo, et al.
Published: (2025)
Similar Items
-
HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models
by: Manukyan, Hayk, et al.
Published: (2023) -
Dr-SAM: An End-to-End Framework for Vascular Segmentation, Diameter Estimation, and Anomaly Detection on Angiography Images
by: Zohranyan, Vazgen, et al.
Published: (2024) -
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
by: Henschel, Roberto, et al.
Published: (2024) -
FlowDIS: Language-Guided Dichotomous Image Segmentation with Flow Matching
by: Sargsyan, Andranik, et al.
Published: (2026) -
Versatile Diffusion: Text, Images and Variations All in One Diffusion Model
by: Xu, Xingqian, et al.
Published: (2022)