Saved in:
| Main Authors: | Teotia, Revant, Ross, Candace, Ullrich, Karen, Chopra, Sumit, Romero-Soriano, Adriana, Hall, Melissa, Muckley, Matthew J. |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.05108 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DIG In: Evaluating Disparities in Image Generations with Indicators for Geographic Diversity
by: Hall, Melissa, et al.
Published: (2023)
by: Hall, Melissa, et al.
Published: (2023)
Towards Geographic Inclusion in the Evaluation of Text-to-Image Models
by: Hall, Melissa, et al.
Published: (2024)
by: Hall, Melissa, et al.
Published: (2024)
What makes a good metric? Evaluating automatic metrics for text-to-image consistency
by: Ross, Candace, et al.
Published: (2024)
by: Ross, Candace, et al.
Published: (2024)
Multi-Modal Language Models as Text-to-Image Model Evaluators
by: Chen, Jiahui, et al.
Published: (2025)
by: Chen, Jiahui, et al.
Published: (2025)
Improving Geo-diversity of Generated Images with Contextualized Vendi Score Guidance
by: Hemmat, Reyhane Askari, et al.
Published: (2024)
by: Hemmat, Reyhane Askari, et al.
Published: (2024)
Fine-Tuning In-House Large Language Models to Infer Differential Diagnosis from Radiology Reports
by: Chen, Luoyao, et al.
Published: (2024)
by: Chen, Luoyao, et al.
Published: (2024)
Improving Text-to-Image Consistency via Automatic Prompt Optimization
by: Mañas, Oscar, et al.
Published: (2024)
by: Mañas, Oscar, et al.
Published: (2024)
EvalGIM: A Library for Evaluating Generative Image Models
by: Hall, Melissa, et al.
Published: (2024)
by: Hall, Melissa, et al.
Published: (2024)
Understanding and Mitigating Tokenization Bias in Language Models
by: Phan, Buu, et al.
Published: (2024)
by: Phan, Buu, et al.
Published: (2024)
Consistency-diversity-realism Pareto fronts of conditional image generative models
by: Astolfi, Pietro, et al.
Published: (2024)
by: Astolfi, Pietro, et al.
Published: (2024)
Lossless Compression of Vector IDs for Approximate Nearest Neighbor Search
by: Severo, Daniel, et al.
Published: (2025)
by: Severo, Daniel, et al.
Published: (2025)
Task-Dependent Evaluation of LLM Output Homogenization: A Taxonomy-Guided Framework
by: Jain, Shomik, et al.
Published: (2025)
by: Jain, Shomik, et al.
Published: (2025)
Exact Byte-Level Probabilities from Tokenized Language Models for FIM-Tasks and Model Ensembles
by: Phan, Buu, et al.
Published: (2024)
by: Phan, Buu, et al.
Published: (2024)
An Exploration of Default Images in Text-to-Image Generation
by: Simonen, Hannu, et al.
Published: (2025)
by: Simonen, Hannu, et al.
Published: (2025)
Augmented Conditioning Is Enough For Effective Training Image Generation
by: Chen, Jiahui, et al.
Published: (2025)
by: Chen, Jiahui, et al.
Published: (2025)
On Improved Conditioning Mechanisms and Pre-training Strategies for Diffusion Models
by: Ifriqi, Tariq Berrada, et al.
Published: (2024)
by: Ifriqi, Tariq Berrada, et al.
Published: (2024)
Temporal Generalization: A Reality Check
by: Madaan, Divyam, et al.
Published: (2025)
by: Madaan, Divyam, et al.
Published: (2025)
Multiscale Markowitz
by: Nayar, Revant, et al.
Published: (2024)
by: Nayar, Revant, et al.
Published: (2024)
Endogenous Crashes as Phase Transitions
by: Nayar, Revant, et al.
Published: (2024)
by: Nayar, Revant, et al.
Published: (2024)
Beg to Differ: Understanding Reasoning-Answer Misalignment Across Languages
by: Ovalle, Anaelia, et al.
Published: (2025)
by: Ovalle, Anaelia, et al.
Published: (2025)
Pluralistic psychotherapists' and counsellors' experiences of working with actively suicidal clients: A qualitative interpretative phenomenological analysis
by: Leo Muckley
Published: (2024)
by: Leo Muckley
Published: (2024)
Increasing the Utility of Synthetic Images through Chamfer Guidance
by: Dall'Asen, Nicola, et al.
Published: (2025)
by: Dall'Asen, Nicola, et al.
Published: (2025)
EduStory: A Unified Framework for Pedagogically-Consistent Multi-Shot STEM Instructional Video Generation
by: Wu, Xinyi, et al.
Published: (2026)
by: Wu, Xinyi, et al.
Published: (2026)
Self-Sovereign Identity and Digital Product Passports: Building Trusted Digital Ecosystems Part II: Real-World Practice, Adoption Gaps, and Improvement Pathways
by: Teotia, Purvi
Published: (2026)
by: Teotia, Purvi
Published: (2026)
The Number of Solutions to $ax+by+cz=n$ for Fibonacci and Lucas triplets
by: Teotia, Pooja
Published: (2026)
by: Teotia, Pooja
Published: (2026)
Refining Text-to-Image Generation: Towards Accurate Training-Free Glyph-Enhanced Image Generation
by: Lakhanpal, Sanyam, et al.
Published: (2024)
by: Lakhanpal, Sanyam, et al.
Published: (2024)
PGT: Procedurally Generated Tasks for improving visual grounding in MLLMs
by: Assouel, Rim, et al.
Published: (2026)
by: Assouel, Rim, et al.
Published: (2026)
Eval Factsheets: A Structured Framework for Documenting AI Evaluations
by: Bordes, Florian, et al.
Published: (2025)
by: Bordes, Florian, et al.
Published: (2025)
Improving the Physics of Video Generation with VJEPA-2 Reward Signal
by: Yuan, Jianhao, et al.
Published: (2025)
by: Yuan, Jianhao, et al.
Published: (2025)
The Intricate Dance of Prompt Complexity, Quality, Diversity, and Consistency in T2I Models
by: Xiaofeng, Zhang, et al.
Published: (2025)
by: Xiaofeng, Zhang, et al.
Published: (2025)
Training-free Linear Image Inverses via Flows
by: Pokle, Ashwini, et al.
Published: (2023)
by: Pokle, Ashwini, et al.
Published: (2023)
A Trust-Guided Approach to MR Image Reconstruction with Side Information
by: Atalık, Arda, et al.
Published: (2025)
by: Atalık, Arda, et al.
Published: (2025)
Inference-time Physics Alignment of Video Generative Models with Latent World Models
by: Yuan, Jianhao, et al.
Published: (2026)
by: Yuan, Jianhao, et al.
Published: (2026)
Uncovering Regional Defaults from Photorealistic Forests in Text-to-Image Generation with DALL-E 2
by: Liu, Zilong, et al.
Published: (2024)
by: Liu, Zilong, et al.
Published: (2024)
A Picture is Worth More Than 77 Text Tokens: Evaluating CLIP-Style Models on Dense Captions
by: Urbanek, Jack, et al.
Published: (2023)
by: Urbanek, Jack, et al.
Published: (2023)
Topological Risk Parity
by: Nayar, Revant, et al.
Published: (2026)
by: Nayar, Revant, et al.
Published: (2026)
Hybrid Context Retrieval Augmented Generation Pipeline: LLM-Augmented Knowledge Graphs and Vector Database for Accreditation Reporting Assistance
by: Edwards, Candace
Published: (2024)
by: Edwards, Candace
Published: (2024)
Qinco2: Vector Compression and Search with Improved Implicit Neural Codebooks
by: Vallaeys, Théophane, et al.
Published: (2025)
by: Vallaeys, Théophane, et al.
Published: (2025)
Evaluating Text-to-Visual Generation with Image-to-Text Generation
by: Lin, Zhiqiu, et al.
Published: (2024)
by: Lin, Zhiqiu, et al.
Published: (2024)
Characterizing the Predictive Impact of Modalities with Supervised Latent-Variable Modeling
by: Madaan, Divyam, et al.
Published: (2026)
by: Madaan, Divyam, et al.
Published: (2026)
Similar Items
-
DIG In: Evaluating Disparities in Image Generations with Indicators for Geographic Diversity
by: Hall, Melissa, et al.
Published: (2023) -
Towards Geographic Inclusion in the Evaluation of Text-to-Image Models
by: Hall, Melissa, et al.
Published: (2024) -
What makes a good metric? Evaluating automatic metrics for text-to-image consistency
by: Ross, Candace, et al.
Published: (2024) -
Multi-Modal Language Models as Text-to-Image Model Evaluators
by: Chen, Jiahui, et al.
Published: (2025) -
Improving Geo-diversity of Generated Images with Contextualized Vendi Score Guidance
by: Hemmat, Reyhane Askari, et al.
Published: (2024)