Guardado en:
| Autores principales: | Kim, Joong Ho, Thai, Nicholas, Dip, Souhardya Saha, Lao, Dong, Mills, Keith G. |
|---|---|
| Formato: | Preprint |
| Publicado: |
2026
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2603.12506 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
ViPCap: Retrieval Text-Based Visual Prompts for Lightweight Image Captioning
por: Kim, Taewhan, et al.
Publicado: (2024)
por: Kim, Taewhan, et al.
Publicado: (2024)
CLUE: Controllable Latent space of Unprompted Embeddings for Diversity Management in Text-to-Image Synthesis
por: Park, Keunwoo, et al.
Publicado: (2025)
por: Park, Keunwoo, et al.
Publicado: (2025)
Optimizing Prompts for Text-to-Image Generation
por: Hao, Yaru, et al.
Publicado: (2022)
por: Hao, Yaru, et al.
Publicado: (2022)
PromptSR: Cascade Prompting for Lightweight Image Super-Resolution
por: Liu, Wenyang, et al.
Publicado: (2025)
por: Liu, Wenyang, et al.
Publicado: (2025)
Token-Efficient Multimodal Reasoning via Image Prompt Packaging
por: Choi, Joong Ho, et al.
Publicado: (2026)
por: Choi, Joong Ho, et al.
Publicado: (2026)
Align Beyond Prompts: Evaluating World Knowledge Alignment in Text-to-Image Generation
por: Zhang, Wenchao, et al.
Publicado: (2025)
por: Zhang, Wenchao, et al.
Publicado: (2025)
FIQ: Fundamental Question Generation with the Integration of Question Embeddings for Video Question Answering
por: Oh, Ju-Young, et al.
Publicado: (2025)
por: Oh, Ju-Young, et al.
Publicado: (2025)
Prompt Refinement with Image Pivot for Text-to-Image Generation
por: Zhan, Jingtao, et al.
Publicado: (2024)
por: Zhan, Jingtao, et al.
Publicado: (2024)
Mechanistic Interpretability of Diffusion Models: Circuit-Level Analysis and Causal Validation
por: Roy, Dip
Publicado: (2025)
por: Roy, Dip
Publicado: (2025)
TE-TAD: Towards Full End-to-End Temporal Action Detection via Time-Aligned Coordinate Expression
por: Kim, Ho-Joong, et al.
Publicado: (2024)
por: Kim, Ho-Joong, et al.
Publicado: (2024)
DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection Transformer
por: Kim, Ho-Joong, et al.
Publicado: (2025)
por: Kim, Ho-Joong, et al.
Publicado: (2025)
PromptSafe: Gated Prompt Tuning for Safe Text-to-Image Generation
por: Jing, Zonglei, et al.
Publicado: (2025)
por: Jing, Zonglei, et al.
Publicado: (2025)
RGB2Point: 3D Point Cloud Generation from Single RGB Images
por: Lee, Jae Joong, et al.
Publicado: (2024)
por: Lee, Jae Joong, et al.
Publicado: (2024)
TIT-Score: Evaluating Long-Prompt Based Text-to-Image Alignment via Text-to-Image-to-Text Consistency
por: Wang, Juntong, et al.
Publicado: (2025)
por: Wang, Juntong, et al.
Publicado: (2025)
FP4DiT: Towards Effective Floating Point Quantization for Diffusion Transformers
por: Chen, Ruichen, et al.
Publicado: (2025)
por: Chen, Ruichen, et al.
Publicado: (2025)
Iterative Prompt Refinement for Safer Text-to-Image Generation
por: Jeon, Jinwoo, et al.
Publicado: (2025)
por: Jeon, Jinwoo, et al.
Publicado: (2025)
Comprehensive Information Bottleneck for Unveiling Universal Attribution to Interpret Vision Transformers
por: Hong, Jung-Ho, et al.
Publicado: (2025)
por: Hong, Jung-Ho, et al.
Publicado: (2025)
Iteratively Prompting Multimodal LLMs to Reproduce Natural and AI-Generated Images
por: Naseh, Ali, et al.
Publicado: (2024)
por: Naseh, Ali, et al.
Publicado: (2024)
ClipTBP: Clip-Pair based Temporal Boundary Prediction with Boundary-Aware Learning for Moment Retrieval
por: Kim, Ji-Hyeon, et al.
Publicado: (2026)
por: Kim, Ji-Hyeon, et al.
Publicado: (2026)
Prompt Augmentation for Self-supervised Text-guided Image Manipulation
por: Bodur, Rumeysa, et al.
Publicado: (2024)
por: Bodur, Rumeysa, et al.
Publicado: (2024)
Guiding What Not to Generate: Automated Negative Prompting for Text-Image Alignment
por: Park, Sangha, et al.
Publicado: (2025)
por: Park, Sangha, et al.
Publicado: (2025)
Re-ttention: Ultra Sparse Visual Generation via Attention Statistical Reshape
por: Chen, Ruichen, et al.
Publicado: (2025)
por: Chen, Ruichen, et al.
Publicado: (2025)
Evaluating Hallucination in Text-to-Image Diffusion Models with Scene-Graph based Question-Answering Agent
por: Qin, Ziyuan, et al.
Publicado: (2024)
por: Qin, Ziyuan, et al.
Publicado: (2024)
Revisiting Text-to-Image Evaluation with Gecko: On Metrics, Prompts, and Human Ratings
por: Wiles, Olivia, et al.
Publicado: (2024)
por: Wiles, Olivia, et al.
Publicado: (2024)
Unified Prompt Attack Against Text-to-Image Generation Models
por: Peng, Duo, et al.
Publicado: (2025)
por: Peng, Duo, et al.
Publicado: (2025)
Learning to Sample Effective and Diverse Prompts for Text-to-Image Generation
por: Yun, Taeyoung, et al.
Publicado: (2025)
por: Yun, Taeyoung, et al.
Publicado: (2025)
Culture-TRIP: Culturally-Aware Text-to-Image Generation with Iterative Prompt Refinement
por: Jeong, Suchae, et al.
Publicado: (2025)
por: Jeong, Suchae, et al.
Publicado: (2025)
Dynamic Prompt Optimizing for Text-to-Image Generation
por: Mo, Wenyi, et al.
Publicado: (2024)
por: Mo, Wenyi, et al.
Publicado: (2024)
Fast Prompt Alignment for Text-to-Image Generation
por: Mrini, Khalil, et al.
Publicado: (2024)
por: Mrini, Khalil, et al.
Publicado: (2024)
Naïve Exposure of Generative AI Capabilities Undermines Deepfake Detection
por: Kim, Sunpill, et al.
Publicado: (2026)
por: Kim, Sunpill, et al.
Publicado: (2026)
Multi-Scale Visual Prompting for Lightweight Small-Image Classification
por: Khazem, Salim
Publicado: (2025)
por: Khazem, Salim
Publicado: (2025)
Improving Calibration in Test-Time Prompt Tuning for Vision-Language Models via Data-Free Flatness-Aware Prompt Pretraining
por: Jang, Hyeonseo, et al.
Publicado: (2026)
por: Jang, Hyeonseo, et al.
Publicado: (2026)
Bayesian Autoencoder for Medical Anomaly Detection: Uncertainty-Aware Approach for Brain 2 MRI Analysis
por: Roy, Dip
Publicado: (2025)
por: Roy, Dip
Publicado: (2025)
TIPO: Text to Image with Text Presampling for Prompt Optimization
por: Yeh, Shih-Ying, et al.
Publicado: (2024)
por: Yeh, Shih-Ying, et al.
Publicado: (2024)
Evaluating Text-to-Image Generative Models: An Empirical Study on Human Image Synthesis
por: Chen, Muxi, et al.
Publicado: (2024)
por: Chen, Muxi, et al.
Publicado: (2024)
TextTIGER: Text-based Intelligent Generation with Entity Prompt Refinement for Text-to-Image Generation
por: Ozaki, Shintaro, et al.
Publicado: (2025)
por: Ozaki, Shintaro, et al.
Publicado: (2025)
ProTPS: Prototype-Guided Text Prompt Selection for Continual Learning
por: Mei, Jie, et al.
Publicado: (2026)
por: Mei, Jie, et al.
Publicado: (2026)
Anonymization Prompt Learning for Facial Privacy-Preserving Text-to-Image Generation
por: Shi, Liang, et al.
Publicado: (2024)
por: Shi, Liang, et al.
Publicado: (2024)
Tailored Visions: Enhancing Text-to-Image Generation with Personalized Prompt Rewriting
por: Chen, Zijie, et al.
Publicado: (2023)
por: Chen, Zijie, et al.
Publicado: (2023)
NSFW-Classifier Guided Prompt Sanitization for Safe Text-to-Image Generation
por: Xie, Yu, et al.
Publicado: (2025)
por: Xie, Yu, et al.
Publicado: (2025)
Ejemplares similares
-
ViPCap: Retrieval Text-Based Visual Prompts for Lightweight Image Captioning
por: Kim, Taewhan, et al.
Publicado: (2024) -
CLUE: Controllable Latent space of Unprompted Embeddings for Diversity Management in Text-to-Image Synthesis
por: Park, Keunwoo, et al.
Publicado: (2025) -
Optimizing Prompts for Text-to-Image Generation
por: Hao, Yaru, et al.
Publicado: (2022) -
PromptSR: Cascade Prompting for Lightweight Image Super-Resolution
por: Liu, Wenyang, et al.
Publicado: (2025) -
Token-Efficient Multimodal Reasoning via Image Prompt Packaging
por: Choi, Joong Ho, et al.
Publicado: (2026)