:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Light, Dean, Theologitis, Michael, Ghate, Kshitish, Li, Shuyue Stella, Newman, Benjamin, Shah, Chirag, Caliskan, Aylin, Koh, Pang Wei, Suciu, Dan, Tsvetkov, Yulia
Format:	Preprint
Published:	2026
Subjects:	Computation and Language Artificial Intelligence
Online Access:	https://arxiv.org/abs/2605.11388
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Beyond Cooperative Simulators: Generating Realistic User Personas for Robust Evaluation of LLM Agents
by: Chopra, Harshita, et al.
Published: (2026)

Biases Propagate in Encoder-based Vision-Language Models: A Systematic Analysis From Intrinsic Measures to Zero-shot Retrieval Outcomes
by: Ghate, Kshitish, et al.
Published: (2025)

Thucy: An LLM-based Multi-Agent System for Claim Verification across Relational Databases
by: Theologitis, Michael, et al.
Published: (2025)

Intrinsic Bias is Predicted by Pretraining Data and Correlates with Downstream Performance in Vision-Language Encoders
by: Ghate, Kshitish, et al.
Published: (2025)

ClaimDB: A Fact Verification Benchmark over Large Structured Data
by: Theologitis, Michael, et al.
Published: (2026)

Cognitive Foundations for Reasoning and Their Manifestation in LLMs
by: Kargupta, Priyanka, et al.
Published: (2025)

MediQ: Question-Asking LLMs and a Benchmark for Reliable Interactive Clinical Reasoning
by: Li, Shuyue Stella, et al.
Published: (2024)

PrefDisco: Benchmarking Proactive Personalized Reasoning
by: Li, Shuyue Stella, et al.
Published: (2025)

Pre-Calc: Learning to Use the Calculator Improves Numeracy in Language Models
by: Veerendranath, Vishruth, et al.
Published: (2024)

EVALUESTEER: Measuring Reward Model Steerability Towards Values and Preferences
by: Ghate, Kshitish, et al.
Published: (2025)

Personal Information Parroting in Language Models
by: Subramani, Nishant, et al.
Published: (2026)

JPEG-LM: LLMs as Image Generators with Canonical Codec Representations
by: Han, Xiaochuang, et al.
Published: (2024)

A False Sense of Privacy: Evaluating Textual Data Sanitization Beyond Surface-level Privacy Leakage
by: Xin, Rui, et al.
Published: (2025)

Cold-Start Personalization via Training-Free Priors from Structured World Models
by: Bose, Avinandan, et al.
Published: (2026)

Ads in AI Chatbots? An Analysis of How Large Language Models Navigate Conflicts of Interest
by: Wu, Addison J., et al.
Published: (2026)

InfoGatherer: Principled Information Seeking via Evidence Retrieval and Strategic Questioning
by: Taranukhin, Maksym, et al.
Published: (2026)

Precise Information Control in Long-Form Text Generation
by: He, Jacqueline, et al.
Published: (2025)

EvoLM: Self-Evolving Language Models through Co-Evolved Discriminative Rubrics
by: Li, Shuyue Stella, et al.
Published: (2026)

Gender, Race, and Intersectional Bias in Resume Screening via Language Model Retrieval
by: Wilson, Kyra, et al.
Published: (2024)

A Taxonomy of Stereotype Content in Large Language Models
by: Nicolas, Gandalf, et al.
Published: (2024)

ChatGPT Perpetuates Gender Bias in Machine Translation and Ignores Non-Gendered Pronouns: Findings across Bengali and Five other Low-Resource Languages
by: Ghosh, Sourojit, et al.
Published: (2023)

'Person' == Light-skinned, Western Man, and Sexualization of Women of Color: Stereotypes in Stable Diffusion
by: Ghosh, Sourojit, et al.
Published: (2023)

ValueScope: Unveiling Implicit Norms and Values via Return Potential Model of Social Interactions
by: Park, Chan Young, et al.
Published: (2024)

Finding Flawed Fictions: Evaluating Complex Reasoning in Language Models via Plot Hole Detection
by: Ahuja, Kabir, et al.
Published: (2025)

Persona-Assigned Large Language Models Exhibit Human-Like Motivated Reasoning
by: Dash, Saloni, et al.
Published: (2025)

Identifying Features Associated with Bias Against 93 Stigmatized Groups in Language Models and Guardrail Model Safety Mitigation
by: Gueorguieva, Anna-Maria, et al.
Published: (2025)

Privasis: Synthesizing the Largest "Public" Private Dataset from Scratch
by: Kim, Hyunwoo, et al.
Published: (2026)

ALFA: Aligning LLMs to Ask Good Questions A Case Study in Clinical Reasoning
by: Li, Shuyue Stella, et al.
Published: (2025)

Applications of Information Inequalities to Database Theory Problems
by: Suciu, Dan
Published: (2023)

Bias Amplification in Stable Diffusion's Representation of Stigma Through Skin Tones and Their Homogeneity
by: Wilson, Kyra, et al.
Published: (2025)

"I don't see myself represented here at all": User Experiences of Stable Diffusion Outputs Containing Representational Harms across Gender Identities and Nationalities
by: Ghosh, Sourojit, et al.
Published: (2024)

REALM: A Dataset of Real-World LLM Use Cases
by: Cheng, Jingwen, et al.
Published: (2025)

Generative Value Conflicts Reveal LLM Priorities
by: Liu, Andy, et al.
Published: (2025)

Teaching LLMs to Abstain across Languages via Multilingual Feedback
by: Feng, Shangbin, et al.
Published: (2024)

Trusting Your AI Agent Emotionally and Cognitively: Development and Validation of a Semantic Differential Scale for AI Trust
by: Shang, Ruoxi, et al.
Published: (2024)

Spurious Rewards: Rethinking Training Signals in RLVR
by: Shao, Rulin, et al.
Published: (2025)

Mind Over Misinformation: Investigating the Factors of Cognitive Influences in Information Acceptance
by: Mouly Dewan, et al.
Published: (2024)

MADFormer: Mixed Autoregressive and Diffusion Transformers for Continuous Image Generation
by: Chen, Junhao, et al.
Published: (2025)

MentorCollab: Selective Large-to-Small Inference-Time Guidance for Efficient Reasoning
by: Wang, Haojin, et al.
Published: (2026)

Agents Are Not Enough
by: Shah, Chirag, et al.
Published: (2024)