:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Garbacea, Cristina, Tan, Chenhao
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2505.00038
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Personalized Benchmarking: Evaluating LLMs by Individual Preferences
by: Garbacea, Cristina, et al.
Published: (2026)

BoNBoN Alignment for Large Language Models and the Sweetness of Best-of-n Sampling
by: Gui, Lin, et al.
Published: (2024)

Why is constrained neural language generation particularly challenging?
by: Garbacea, Cristina, et al.
Published: (2022)

HypoEval: Hypothesis-Guided Evaluation for Natural Language Generation
by: Li, Mingxuan, et al.
Published: (2025)

Hypothesis Generation with Large Language Models
by: Zhou, Yangqiaoyu, et al.
Published: (2024)

AutoChecklist: Composable Pipelines for Checklist Generation and Scoring with LLM-as-a-Judge
by: Zhou, Karen, et al.
Published: (2026)

HypoBench: Towards Systematic and Principled Benchmarking for Hypothesis Generation
by: Liu, Haokun, et al.
Published: (2025)

Literature Meets Data: A Synergistic Approach to Hypothesis Generation
by: Liu, Haokun, et al.
Published: (2024)

RATE: Causal Explainability of Reward Models with Imperfect Counterfactuals
by: Reber, David, et al.
Published: (2024)

TriAlign: Towards Universal Truth Consistency in Personalized LLM Alignment
by: Nguyen, Thi-Nhung, et al.
Published: (2026)

Frame Representation Hypothesis: Multi-Token LLM Interpretability and Concept-Guided Text Generation
by: Valois, Pedro H. V., et al.
Published: (2024)

Align-then-Unlearn: Embedding Alignment for LLM Unlearning
by: Spohn, Philipp, et al.
Published: (2025)

On the Effectiveness and Generalization of Race Representations for Debiasing High-Stakes Decisions
by: Nguyen, Dang, et al.
Published: (2025)

Evaluating the Goal-Directedness of Large Language Models
by: Everitt, Tom, et al.
Published: (2025)

HyKGE: A Hypothesis Knowledge Graph Enhanced Framework for Accurate and Reliable Medical LLMs Responses
by: Jiang, Xinke, et al.
Published: (2023)

Prompting as Scientific Inquiry
by: Holtzman, Ari, et al.
Published: (2025)

Before Generation, Align it! A Novel and Effective Strategy for Mitigating Hallucinations in Text-to-SQL Generation
by: Qu, Ge, et al.
Published: (2024)

PerSEval: Assessing Personalization in Text Summarizers
by: Dasgupta, Sourish, et al.
Published: (2024)

The Story is Not the Science: Execution-Grounded Evaluation of Mechanistic Interpretability Research
by: Bai, Xiaoyan, et al.
Published: (2026)

Defending Against Alignment-Breaking Attacks via Robustly Aligned LLM
by: Cao, Bochuan, et al.
Published: (2023)

MAPS: Motivation-Aware Personalized Search via LLM-Driven Consultation Alignment
by: Qin, Weicong, et al.
Published: (2025)

Revisiting the Superficial Alignment Hypothesis
by: Raghavendra, Mohit, et al.
Published: (2024)

TokAlign: Efficient Vocabulary Adaptation via Token Alignment
by: Li, Chong, et al.
Published: (2025)

Superficial Safety Alignment Hypothesis
by: Li, Jianwei, et al.
Published: (2024)

ExPerT: Effective and Explainable Evaluation of Personalized Long-Form Text Generation
by: Salemi, Alireza, et al.
Published: (2025)

TokAlign++: Advancing Vocabulary Adaptation via Better Token Alignment
by: Li, Chong, et al.
Published: (2026)

PSP: An Interpretable Per-Dimension Accent Benchmark for Indic Text-to-Speech
by: Menta, Venkata Pushpak Teja
Published: (2026)

PerQ: Efficient Evaluation of Multilingual Text Personalization Quality
by: Macko, Dominik, et al.
Published: (2025)

NAIST-SIC-Aligned: an Aligned English-Japanese Simultaneous Interpretation Corpus
by: Zhao, Jinming, et al.
Published: (2023)

LLM-Align: Utilizing Large Language Models for Entity Alignment in Knowledge Graphs
by: Chen, Xuan, et al.
Published: (2024)

The Text Uncanny Valley: Non-Monotonic Performance Degradation in LLM Information Retrieval
by: Tong, Zekai, et al.
Published: (2026)

Evaluating LLM Alignment on Personality Inference from Real-World Interview Data
by: Zhu, Jianfeng, et al.
Published: (2025)

SelfCodeAlign: Self-Alignment for Code Generation
by: Wei, Yuxiang, et al.
Published: (2024)

Learning Personalized Alignment for Evaluating Open-ended Text Generation
by: Wang, Danqing, et al.
Published: (2023)

Clean First, Align Later: Benchmarking Preference Data Cleaning for Reliable LLM Alignment
by: Yeh, Samuel, et al.
Published: (2025)

Aligning the Objective of LLM-based Program Repair
by: Xu, Junjielong, et al.
Published: (2024)

Towards a Client-Centered Assessment of LLM Therapists by Client Simulation
by: Wang, Jiashuo, et al.
Published: (2024)

Aligning to What? Limits to RLHF Based Alignment
by: Barnhart, Logan, et al.
Published: (2025)

A Judge-free LLM Open-ended Generation Benchmark Based on the Distributional Hypothesis
by: Imajo, Kentaro, et al.
Published: (2025)

One Adapts to Any: Meta Reward Modeling for Personalized LLM Alignment
by: Cai, Hongru, et al.
Published: (2026)