:: Library Catalog

Imagen de Portada

Guardado en:

Detalles Bibliográficos
Autores principales:	Kim, Joong Ho, Thai, Nicholas, Dip, Souhardya Saha, Lao, Dong, Mills, Keith G.
Formato:	Preprint
Publicado:	2026
Materias:	Computer Vision and Pattern Recognition Artificial Intelligence Machine Learning
Acceso en línea:	https://arxiv.org/abs/2603.12506
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Ejemplares similares

ViPCap: Retrieval Text-Based Visual Prompts for Lightweight Image Captioning
por: Kim, Taewhan, et al.
Publicado: (2024)

CLUE: Controllable Latent space of Unprompted Embeddings for Diversity Management in Text-to-Image Synthesis
por: Park, Keunwoo, et al.
Publicado: (2025)

Optimizing Prompts for Text-to-Image Generation
por: Hao, Yaru, et al.
Publicado: (2022)

PromptSR: Cascade Prompting for Lightweight Image Super-Resolution
por: Liu, Wenyang, et al.
Publicado: (2025)

Token-Efficient Multimodal Reasoning via Image Prompt Packaging
por: Choi, Joong Ho, et al.
Publicado: (2026)

Align Beyond Prompts: Evaluating World Knowledge Alignment in Text-to-Image Generation
por: Zhang, Wenchao, et al.
Publicado: (2025)

FIQ: Fundamental Question Generation with the Integration of Question Embeddings for Video Question Answering
por: Oh, Ju-Young, et al.
Publicado: (2025)

Prompt Refinement with Image Pivot for Text-to-Image Generation
por: Zhan, Jingtao, et al.
Publicado: (2024)

Mechanistic Interpretability of Diffusion Models: Circuit-Level Analysis and Causal Validation
por: Roy, Dip
Publicado: (2025)

TE-TAD: Towards Full End-to-End Temporal Action Detection via Time-Aligned Coordinate Expression
por: Kim, Ho-Joong, et al.
Publicado: (2024)

DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection Transformer
por: Kim, Ho-Joong, et al.
Publicado: (2025)

PromptSafe: Gated Prompt Tuning for Safe Text-to-Image Generation
por: Jing, Zonglei, et al.
Publicado: (2025)

RGB2Point: 3D Point Cloud Generation from Single RGB Images
por: Lee, Jae Joong, et al.
Publicado: (2024)

TIT-Score: Evaluating Long-Prompt Based Text-to-Image Alignment via Text-to-Image-to-Text Consistency
por: Wang, Juntong, et al.
Publicado: (2025)

FP4DiT: Towards Effective Floating Point Quantization for Diffusion Transformers
por: Chen, Ruichen, et al.
Publicado: (2025)

Iterative Prompt Refinement for Safer Text-to-Image Generation
por: Jeon, Jinwoo, et al.
Publicado: (2025)

Comprehensive Information Bottleneck for Unveiling Universal Attribution to Interpret Vision Transformers
por: Hong, Jung-Ho, et al.
Publicado: (2025)

Iteratively Prompting Multimodal LLMs to Reproduce Natural and AI-Generated Images
por: Naseh, Ali, et al.
Publicado: (2024)

ClipTBP: Clip-Pair based Temporal Boundary Prediction with Boundary-Aware Learning for Moment Retrieval
por: Kim, Ji-Hyeon, et al.
Publicado: (2026)

Prompt Augmentation for Self-supervised Text-guided Image Manipulation
por: Bodur, Rumeysa, et al.
Publicado: (2024)

Guiding What Not to Generate: Automated Negative Prompting for Text-Image Alignment
por: Park, Sangha, et al.
Publicado: (2025)

Re-ttention: Ultra Sparse Visual Generation via Attention Statistical Reshape
por: Chen, Ruichen, et al.
Publicado: (2025)

Evaluating Hallucination in Text-to-Image Diffusion Models with Scene-Graph based Question-Answering Agent
por: Qin, Ziyuan, et al.
Publicado: (2024)

Revisiting Text-to-Image Evaluation with Gecko: On Metrics, Prompts, and Human Ratings
por: Wiles, Olivia, et al.
Publicado: (2024)

Unified Prompt Attack Against Text-to-Image Generation Models
por: Peng, Duo, et al.
Publicado: (2025)

Learning to Sample Effective and Diverse Prompts for Text-to-Image Generation
por: Yun, Taeyoung, et al.
Publicado: (2025)

Culture-TRIP: Culturally-Aware Text-to-Image Generation with Iterative Prompt Refinement
por: Jeong, Suchae, et al.
Publicado: (2025)

Dynamic Prompt Optimizing for Text-to-Image Generation
por: Mo, Wenyi, et al.
Publicado: (2024)

Fast Prompt Alignment for Text-to-Image Generation
por: Mrini, Khalil, et al.
Publicado: (2024)

Naïve Exposure of Generative AI Capabilities Undermines Deepfake Detection
por: Kim, Sunpill, et al.
Publicado: (2026)

Multi-Scale Visual Prompting for Lightweight Small-Image Classification
por: Khazem, Salim
Publicado: (2025)

Improving Calibration in Test-Time Prompt Tuning for Vision-Language Models via Data-Free Flatness-Aware Prompt Pretraining
por: Jang, Hyeonseo, et al.
Publicado: (2026)

Bayesian Autoencoder for Medical Anomaly Detection: Uncertainty-Aware Approach for Brain 2 MRI Analysis
por: Roy, Dip
Publicado: (2025)

TIPO: Text to Image with Text Presampling for Prompt Optimization
por: Yeh, Shih-Ying, et al.
Publicado: (2024)

Evaluating Text-to-Image Generative Models: An Empirical Study on Human Image Synthesis
por: Chen, Muxi, et al.
Publicado: (2024)

TextTIGER: Text-based Intelligent Generation with Entity Prompt Refinement for Text-to-Image Generation
por: Ozaki, Shintaro, et al.
Publicado: (2025)

ProTPS: Prototype-Guided Text Prompt Selection for Continual Learning
por: Mei, Jie, et al.
Publicado: (2026)

Anonymization Prompt Learning for Facial Privacy-Preserving Text-to-Image Generation
por: Shi, Liang, et al.
Publicado: (2024)

Tailored Visions: Enhancing Text-to-Image Generation with Personalized Prompt Rewriting
por: Chen, Zijie, et al.
Publicado: (2023)

NSFW-Classifier Guided Prompt Sanitization for Safe Text-to-Image Generation
por: Xie, Yu, et al.
Publicado: (2025)