Saved in:
| Main Authors: | Poole-Dayan, Elinor, Wu, Jiayi, Sorensen, Taylor, Pei, Jiaxin, Bakker, Michiel A. |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.01351 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
LLM Targeted Underperformance Disproportionately Impacts Vulnerable Users
by: Poole-Dayan, Elinor, et al.
Published: (2024)
by: Poole-Dayan, Elinor, et al.
Published: (2024)
On the Relationship between Truth and Political Bias in Language Models
by: Fulay, Suyash, et al.
Published: (2024)
by: Fulay, Suyash, et al.
Published: (2024)
From Delegates to Trustees: How Optimizing for Long-Term Interests Shapes Bias and Alignment in LLM
by: Fulay, Suyash, et al.
Published: (2025)
by: Fulay, Suyash, et al.
Published: (2025)
Do LLMs Understand Social Knowledge? Evaluating the Sociability of Large Language Models with SocKET Benchmark
by: Choi, Minje, et al.
Published: (2023)
by: Choi, Minje, et al.
Published: (2023)
An AI-Powered Framework for Analyzing Collective Idea Evolution in Deliberative Assemblies
by: Poole-Dayan, Elinor, et al.
Published: (2025)
by: Poole-Dayan, Elinor, et al.
Published: (2025)
Value Profiles for Encoding Human Variation
by: Sorensen, Taylor, et al.
Published: (2025)
by: Sorensen, Taylor, et al.
Published: (2025)
Opt-ICL at LeWiDi-2025: Maximizing In-Context Signal from Rater Examples via Meta-Learning
by: Sorensen, Taylor, et al.
Published: (2025)
by: Sorensen, Taylor, et al.
Published: (2025)
AI Assistance Reduces Persistence and Hurts Independent Performance
by: Liu, Grace, et al.
Published: (2026)
by: Liu, Grace, et al.
Published: (2026)
Evaluating Human-AI Safety: A Framework for Measuring Harmful Capability Uplift
by: Vaccaro, Michelle, et al.
Published: (2026)
by: Vaccaro, Michelle, et al.
Published: (2026)
When is using AI the rational choice? The importance of counterfactuals in AI deployment decisions
by: Lehner, Paul, et al.
Published: (2025)
by: Lehner, Paul, et al.
Published: (2025)
Conformal Risk Control for Safety-Critical Wildfire Evacuation Mapping: A Comparative Study of Tabular, Spatial, and Graph-Based Models
by: Dayan, Baljinnyam
Published: (2026)
by: Dayan, Baljinnyam
Published: (2026)
Belief Engine: Configurable and Inspectable Stance Dynamics in Multi-Agent LLM Deliberation
by: Yang, Joshua C., et al.
Published: (2026)
by: Yang, Joshua C., et al.
Published: (2026)
Tell Me Why: Incentivizing Explanations
by: Srinivasan, Siddarth, et al.
Published: (2025)
by: Srinivasan, Siddarth, et al.
Published: (2025)
Tapilot-Crossing: Benchmarking and Evolving LLMs Towards Interactive Data Analysis Agents
by: Li, Jinyang, et al.
Published: (2024)
by: Li, Jinyang, et al.
Published: (2024)
Overtone: Cyclic Patch Modulation for Clean, Efficient, and Flexible Physics Emulators
by: Mukhopadhyay, Payel, et al.
Published: (2025)
by: Mukhopadhyay, Payel, et al.
Published: (2025)
Measuring What Matters: The AI Pluralism Index
by: Mushkani, Rashid
Published: (2025)
by: Mushkani, Rashid
Published: (2025)
The World According to LLMs: How Geographic Origin Influences LLMs' Entity Deduction Capabilities
by: Lalai, Harsh Nishant, et al.
Published: (2025)
by: Lalai, Harsh Nishant, et al.
Published: (2025)
Why Isn't Relational Learning Taking Over the World?
by: Poole, David
Published: (2025)
by: Poole, David
Published: (2025)
RE-PO: Robust Enhanced Policy Optimization as a General Framework for LLM Alignment
by: Cao, Xiaoyang, et al.
Published: (2025)
by: Cao, Xiaoyang, et al.
Published: (2025)
Benchmarking for Domain-Specific LLMs: A Case Study on Academia and Beyond
by: Chen, Rubing, et al.
Published: (2025)
by: Chen, Rubing, et al.
Published: (2025)
DHP Benchmark: Are LLMs Good NLG Evaluators?
by: Wang, Yicheng, et al.
Published: (2024)
by: Wang, Yicheng, et al.
Published: (2024)
Ontology Learning with LLMs: A Benchmark Study on Axiom Identification
by: Bakker, Roos M., et al.
Published: (2025)
by: Bakker, Roos M., et al.
Published: (2025)
GeoEval: Benchmark for Evaluating LLMs and Multi-Modal Models on Geometry Problem-Solving
by: Zhang, Jiaxin, et al.
Published: (2024)
by: Zhang, Jiaxin, et al.
Published: (2024)
What Does the AI Doctor Value? Auditing Pluralism in the Clinical Ethics of Language Models
by: Chandak, Payal, et al.
Published: (2026)
by: Chandak, Payal, et al.
Published: (2026)
AI and Collective Decisions: Strengthening Legitimacy and Losers' Consent
by: Fulay, Suyash, et al.
Published: (2026)
by: Fulay, Suyash, et al.
Published: (2026)
Plurals: A System for Guiding LLMs Via Simulated Social Ensembles
by: Ashkinaze, Joshua, et al.
Published: (2024)
by: Ashkinaze, Joshua, et al.
Published: (2024)
Sociodemographic Prompting is Not Yet an Effective Approach for Simulating Subjective Judgments with LLMs
by: Sun, Huaman, et al.
Published: (2023)
by: Sun, Huaman, et al.
Published: (2023)
Beyond Face Swapping: A Diffusion-Based Digital Human Benchmark for Multimodal Deepfake Detection
by: Liu, Jiaxin, et al.
Published: (2025)
by: Liu, Jiaxin, et al.
Published: (2025)
Stop Automating Peer Review Without Rigorous Evaluation
by: Baumann, Joachim, et al.
Published: (2026)
by: Baumann, Joachim, et al.
Published: (2026)
MULTITEXTEDIT: Benchmarking Cross-Lingual Degradation in Text-in-Image Editing
by: Cheng, Liwei, et al.
Published: (2026)
by: Cheng, Liwei, et al.
Published: (2026)
Error-related Potential Variability: Exploring the Effects on Classification and Transferability
by: Poole, Benjamin, et al.
Published: (2023)
by: Poole, Benjamin, et al.
Published: (2023)
Comparing Traditional and Reinforcement-Learning Methods for Energy Storage Control
by: Ginzburg, Elinor, et al.
Published: (2025)
by: Ginzburg, Elinor, et al.
Published: (2025)
Beyond Binary Moral Judgment: Modeling Ethical Pluralism in AI
by: Aijaz, Aisha, et al.
Published: (2026)
by: Aijaz, Aisha, et al.
Published: (2026)
MolGround: A Benchmark for Molecular Grounding
by: Wu, Jiaxin, et al.
Published: (2025)
by: Wu, Jiaxin, et al.
Published: (2025)
KNVQA: A Benchmark for evaluation knowledge-based VQA
by: Cheng, Sirui, et al.
Published: (2023)
by: Cheng, Sirui, et al.
Published: (2023)
Benchmark Health Index: A Systematic Framework for Benchmarking the Benchmarks of LLMs
by: Zhu, Longyuan, et al.
Published: (2026)
by: Zhu, Longyuan, et al.
Published: (2026)
HugAgent: Benchmarking LLMs for Simulation of Individualized Human Reasoning
by: Li, Chance Jiajie, et al.
Published: (2025)
by: Li, Chance Jiajie, et al.
Published: (2025)
FIMA-Q: Post-Training Quantization for Vision Transformers by Fisher Information Matrix Approximation
by: Wu, Zhuguanyu, et al.
Published: (2025)
by: Wu, Zhuguanyu, et al.
Published: (2025)
Steerable Pluralism: Pluralistic Alignment via Few-Shot Comparative Regression
by: Adams, Jadie, et al.
Published: (2025)
by: Adams, Jadie, et al.
Published: (2025)
Towards Cross-lingual Values Judgment: A Consensus-Pluralism Perspective
by: Chen, Yukun, et al.
Published: (2026)
by: Chen, Yukun, et al.
Published: (2026)
Similar Items
-
LLM Targeted Underperformance Disproportionately Impacts Vulnerable Users
by: Poole-Dayan, Elinor, et al.
Published: (2024) -
On the Relationship between Truth and Political Bias in Language Models
by: Fulay, Suyash, et al.
Published: (2024) -
From Delegates to Trustees: How Optimizing for Long-Term Interests Shapes Bias and Alignment in LLM
by: Fulay, Suyash, et al.
Published: (2025) -
Do LLMs Understand Social Knowledge? Evaluating the Sociability of Large Language Models with SocKET Benchmark
by: Choi, Minje, et al.
Published: (2023) -
An AI-Powered Framework for Analyzing Collective Idea Evolution in Deliberative Assemblies
by: Poole-Dayan, Elinor, et al.
Published: (2025)