:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Teotia, Revant, Ross, Candace, Ullrich, Karen, Chopra, Sumit, Romero-Soriano, Adriana, Hall, Melissa, Muckley, Matthew J.
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2506.05108
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

DIG In: Evaluating Disparities in Image Generations with Indicators for Geographic Diversity
by: Hall, Melissa, et al.
Published: (2023)

Towards Geographic Inclusion in the Evaluation of Text-to-Image Models
by: Hall, Melissa, et al.
Published: (2024)

What makes a good metric? Evaluating automatic metrics for text-to-image consistency
by: Ross, Candace, et al.
Published: (2024)

Multi-Modal Language Models as Text-to-Image Model Evaluators
by: Chen, Jiahui, et al.
Published: (2025)

Improving Geo-diversity of Generated Images with Contextualized Vendi Score Guidance
by: Hemmat, Reyhane Askari, et al.
Published: (2024)

Fine-Tuning In-House Large Language Models to Infer Differential Diagnosis from Radiology Reports
by: Chen, Luoyao, et al.
Published: (2024)

Improving Text-to-Image Consistency via Automatic Prompt Optimization
by: Mañas, Oscar, et al.
Published: (2024)

EvalGIM: A Library for Evaluating Generative Image Models
by: Hall, Melissa, et al.
Published: (2024)

Understanding and Mitigating Tokenization Bias in Language Models
by: Phan, Buu, et al.
Published: (2024)

Consistency-diversity-realism Pareto fronts of conditional image generative models
by: Astolfi, Pietro, et al.
Published: (2024)

Lossless Compression of Vector IDs for Approximate Nearest Neighbor Search
by: Severo, Daniel, et al.
Published: (2025)

Task-Dependent Evaluation of LLM Output Homogenization: A Taxonomy-Guided Framework
by: Jain, Shomik, et al.
Published: (2025)

Exact Byte-Level Probabilities from Tokenized Language Models for FIM-Tasks and Model Ensembles
by: Phan, Buu, et al.
Published: (2024)

An Exploration of Default Images in Text-to-Image Generation
by: Simonen, Hannu, et al.
Published: (2025)

Augmented Conditioning Is Enough For Effective Training Image Generation
by: Chen, Jiahui, et al.
Published: (2025)

On Improved Conditioning Mechanisms and Pre-training Strategies for Diffusion Models
by: Ifriqi, Tariq Berrada, et al.
Published: (2024)

Temporal Generalization: A Reality Check
by: Madaan, Divyam, et al.
Published: (2025)

Multiscale Markowitz
by: Nayar, Revant, et al.
Published: (2024)

Endogenous Crashes as Phase Transitions
by: Nayar, Revant, et al.
Published: (2024)

Beg to Differ: Understanding Reasoning-Answer Misalignment Across Languages
by: Ovalle, Anaelia, et al.
Published: (2025)

Pluralistic psychotherapists' and counsellors' experiences of working with actively suicidal clients: A qualitative interpretative phenomenological analysis
by: Leo Muckley
Published: (2024)

Increasing the Utility of Synthetic Images through Chamfer Guidance
by: Dall'Asen, Nicola, et al.
Published: (2025)

EduStory: A Unified Framework for Pedagogically-Consistent Multi-Shot STEM Instructional Video Generation
by: Wu, Xinyi, et al.
Published: (2026)

Self-Sovereign Identity and Digital Product Passports: Building Trusted Digital Ecosystems Part II: Real-World Practice, Adoption Gaps, and Improvement Pathways
by: Teotia, Purvi
Published: (2026)

The Number of Solutions to $ax+by+cz=n$ for Fibonacci and Lucas triplets
by: Teotia, Pooja
Published: (2026)

Refining Text-to-Image Generation: Towards Accurate Training-Free Glyph-Enhanced Image Generation
by: Lakhanpal, Sanyam, et al.
Published: (2024)

PGT: Procedurally Generated Tasks for improving visual grounding in MLLMs
by: Assouel, Rim, et al.
Published: (2026)

Eval Factsheets: A Structured Framework for Documenting AI Evaluations
by: Bordes, Florian, et al.
Published: (2025)

Improving the Physics of Video Generation with VJEPA-2 Reward Signal
by: Yuan, Jianhao, et al.
Published: (2025)

The Intricate Dance of Prompt Complexity, Quality, Diversity, and Consistency in T2I Models
by: Xiaofeng, Zhang, et al.
Published: (2025)

Training-free Linear Image Inverses via Flows
by: Pokle, Ashwini, et al.
Published: (2023)

A Trust-Guided Approach to MR Image Reconstruction with Side Information
by: Atalık, Arda, et al.
Published: (2025)

Inference-time Physics Alignment of Video Generative Models with Latent World Models
by: Yuan, Jianhao, et al.
Published: (2026)

Uncovering Regional Defaults from Photorealistic Forests in Text-to-Image Generation with DALL-E 2
by: Liu, Zilong, et al.
Published: (2024)

A Picture is Worth More Than 77 Text Tokens: Evaluating CLIP-Style Models on Dense Captions
by: Urbanek, Jack, et al.
Published: (2023)

Topological Risk Parity
by: Nayar, Revant, et al.
Published: (2026)

Hybrid Context Retrieval Augmented Generation Pipeline: LLM-Augmented Knowledge Graphs and Vector Database for Accreditation Reporting Assistance
by: Edwards, Candace
Published: (2024)

Qinco2: Vector Compression and Search with Improved Implicit Neural Codebooks
by: Vallaeys, Théophane, et al.
Published: (2025)

Evaluating Text-to-Visual Generation with Image-to-Text Generation
by: Lin, Zhiqiu, et al.
Published: (2024)

Characterizing the Predictive Impact of Modalities with Supervised Latent-Variable Modeling
by: Madaan, Divyam, et al.
Published: (2026)