:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Poole-Dayan, Elinor, Wu, Jiayi, Sorensen, Taylor, Pei, Jiaxin, Bakker, Michiel A.
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2512.01351
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

LLM Targeted Underperformance Disproportionately Impacts Vulnerable Users
by: Poole-Dayan, Elinor, et al.
Published: (2024)

On the Relationship between Truth and Political Bias in Language Models
by: Fulay, Suyash, et al.
Published: (2024)

From Delegates to Trustees: How Optimizing for Long-Term Interests Shapes Bias and Alignment in LLM
by: Fulay, Suyash, et al.
Published: (2025)

Do LLMs Understand Social Knowledge? Evaluating the Sociability of Large Language Models with SocKET Benchmark
by: Choi, Minje, et al.
Published: (2023)

An AI-Powered Framework for Analyzing Collective Idea Evolution in Deliberative Assemblies
by: Poole-Dayan, Elinor, et al.
Published: (2025)

Value Profiles for Encoding Human Variation
by: Sorensen, Taylor, et al.
Published: (2025)

Opt-ICL at LeWiDi-2025: Maximizing In-Context Signal from Rater Examples via Meta-Learning
by: Sorensen, Taylor, et al.
Published: (2025)

AI Assistance Reduces Persistence and Hurts Independent Performance
by: Liu, Grace, et al.
Published: (2026)

Evaluating Human-AI Safety: A Framework for Measuring Harmful Capability Uplift
by: Vaccaro, Michelle, et al.
Published: (2026)

When is using AI the rational choice? The importance of counterfactuals in AI deployment decisions
by: Lehner, Paul, et al.
Published: (2025)

Conformal Risk Control for Safety-Critical Wildfire Evacuation Mapping: A Comparative Study of Tabular, Spatial, and Graph-Based Models
by: Dayan, Baljinnyam
Published: (2026)

Belief Engine: Configurable and Inspectable Stance Dynamics in Multi-Agent LLM Deliberation
by: Yang, Joshua C., et al.
Published: (2026)

Tell Me Why: Incentivizing Explanations
by: Srinivasan, Siddarth, et al.
Published: (2025)

Tapilot-Crossing: Benchmarking and Evolving LLMs Towards Interactive Data Analysis Agents
by: Li, Jinyang, et al.
Published: (2024)

Overtone: Cyclic Patch Modulation for Clean, Efficient, and Flexible Physics Emulators
by: Mukhopadhyay, Payel, et al.
Published: (2025)

Measuring What Matters: The AI Pluralism Index
by: Mushkani, Rashid
Published: (2025)

The World According to LLMs: How Geographic Origin Influences LLMs' Entity Deduction Capabilities
by: Lalai, Harsh Nishant, et al.
Published: (2025)

Why Isn't Relational Learning Taking Over the World?
by: Poole, David
Published: (2025)

RE-PO: Robust Enhanced Policy Optimization as a General Framework for LLM Alignment
by: Cao, Xiaoyang, et al.
Published: (2025)

Benchmarking for Domain-Specific LLMs: A Case Study on Academia and Beyond
by: Chen, Rubing, et al.
Published: (2025)

DHP Benchmark: Are LLMs Good NLG Evaluators?
by: Wang, Yicheng, et al.
Published: (2024)

Ontology Learning with LLMs: A Benchmark Study on Axiom Identification
by: Bakker, Roos M., et al.
Published: (2025)

GeoEval: Benchmark for Evaluating LLMs and Multi-Modal Models on Geometry Problem-Solving
by: Zhang, Jiaxin, et al.
Published: (2024)

What Does the AI Doctor Value? Auditing Pluralism in the Clinical Ethics of Language Models
by: Chandak, Payal, et al.
Published: (2026)

AI and Collective Decisions: Strengthening Legitimacy and Losers' Consent
by: Fulay, Suyash, et al.
Published: (2026)

Plurals: A System for Guiding LLMs Via Simulated Social Ensembles
by: Ashkinaze, Joshua, et al.
Published: (2024)

Sociodemographic Prompting is Not Yet an Effective Approach for Simulating Subjective Judgments with LLMs
by: Sun, Huaman, et al.
Published: (2023)

Beyond Face Swapping: A Diffusion-Based Digital Human Benchmark for Multimodal Deepfake Detection
by: Liu, Jiaxin, et al.
Published: (2025)

Stop Automating Peer Review Without Rigorous Evaluation
by: Baumann, Joachim, et al.
Published: (2026)

MULTITEXTEDIT: Benchmarking Cross-Lingual Degradation in Text-in-Image Editing
by: Cheng, Liwei, et al.
Published: (2026)

Error-related Potential Variability: Exploring the Effects on Classification and Transferability
by: Poole, Benjamin, et al.
Published: (2023)

Comparing Traditional and Reinforcement-Learning Methods for Energy Storage Control
by: Ginzburg, Elinor, et al.
Published: (2025)

Beyond Binary Moral Judgment: Modeling Ethical Pluralism in AI
by: Aijaz, Aisha, et al.
Published: (2026)

MolGround: A Benchmark for Molecular Grounding
by: Wu, Jiaxin, et al.
Published: (2025)

KNVQA: A Benchmark for evaluation knowledge-based VQA
by: Cheng, Sirui, et al.
Published: (2023)

Benchmark Health Index: A Systematic Framework for Benchmarking the Benchmarks of LLMs
by: Zhu, Longyuan, et al.
Published: (2026)

HugAgent: Benchmarking LLMs for Simulation of Individualized Human Reasoning
by: Li, Chance Jiajie, et al.
Published: (2025)

FIMA-Q: Post-Training Quantization for Vision Transformers by Fisher Information Matrix Approximation
by: Wu, Zhuguanyu, et al.
Published: (2025)

Steerable Pluralism: Pluralistic Alignment via Few-Shot Comparative Regression
by: Adams, Jadie, et al.
Published: (2025)

Towards Cross-lingual Values Judgment: A Consensus-Pluralism Perspective
by: Chen, Yukun, et al.
Published: (2026)