Saved in:
| Main Authors: | Garbacea, Cristina, Tan, Chenhao |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.00038 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Personalized Benchmarking: Evaluating LLMs by Individual Preferences
by: Garbacea, Cristina, et al.
Published: (2026)
by: Garbacea, Cristina, et al.
Published: (2026)
BoNBoN Alignment for Large Language Models and the Sweetness of Best-of-n Sampling
by: Gui, Lin, et al.
Published: (2024)
by: Gui, Lin, et al.
Published: (2024)
Why is constrained neural language generation particularly challenging?
by: Garbacea, Cristina, et al.
Published: (2022)
by: Garbacea, Cristina, et al.
Published: (2022)
HypoEval: Hypothesis-Guided Evaluation for Natural Language Generation
by: Li, Mingxuan, et al.
Published: (2025)
by: Li, Mingxuan, et al.
Published: (2025)
Hypothesis Generation with Large Language Models
by: Zhou, Yangqiaoyu, et al.
Published: (2024)
by: Zhou, Yangqiaoyu, et al.
Published: (2024)
AutoChecklist: Composable Pipelines for Checklist Generation and Scoring with LLM-as-a-Judge
by: Zhou, Karen, et al.
Published: (2026)
by: Zhou, Karen, et al.
Published: (2026)
HypoBench: Towards Systematic and Principled Benchmarking for Hypothesis Generation
by: Liu, Haokun, et al.
Published: (2025)
by: Liu, Haokun, et al.
Published: (2025)
Literature Meets Data: A Synergistic Approach to Hypothesis Generation
by: Liu, Haokun, et al.
Published: (2024)
by: Liu, Haokun, et al.
Published: (2024)
RATE: Causal Explainability of Reward Models with Imperfect Counterfactuals
by: Reber, David, et al.
Published: (2024)
by: Reber, David, et al.
Published: (2024)
TriAlign: Towards Universal Truth Consistency in Personalized LLM Alignment
by: Nguyen, Thi-Nhung, et al.
Published: (2026)
by: Nguyen, Thi-Nhung, et al.
Published: (2026)
Frame Representation Hypothesis: Multi-Token LLM Interpretability and Concept-Guided Text Generation
by: Valois, Pedro H. V., et al.
Published: (2024)
by: Valois, Pedro H. V., et al.
Published: (2024)
Align-then-Unlearn: Embedding Alignment for LLM Unlearning
by: Spohn, Philipp, et al.
Published: (2025)
by: Spohn, Philipp, et al.
Published: (2025)
On the Effectiveness and Generalization of Race Representations for Debiasing High-Stakes Decisions
by: Nguyen, Dang, et al.
Published: (2025)
by: Nguyen, Dang, et al.
Published: (2025)
Evaluating the Goal-Directedness of Large Language Models
by: Everitt, Tom, et al.
Published: (2025)
by: Everitt, Tom, et al.
Published: (2025)
HyKGE: A Hypothesis Knowledge Graph Enhanced Framework for Accurate and Reliable Medical LLMs Responses
by: Jiang, Xinke, et al.
Published: (2023)
by: Jiang, Xinke, et al.
Published: (2023)
Prompting as Scientific Inquiry
by: Holtzman, Ari, et al.
Published: (2025)
by: Holtzman, Ari, et al.
Published: (2025)
Before Generation, Align it! A Novel and Effective Strategy for Mitigating Hallucinations in Text-to-SQL Generation
by: Qu, Ge, et al.
Published: (2024)
by: Qu, Ge, et al.
Published: (2024)
PerSEval: Assessing Personalization in Text Summarizers
by: Dasgupta, Sourish, et al.
Published: (2024)
by: Dasgupta, Sourish, et al.
Published: (2024)
The Story is Not the Science: Execution-Grounded Evaluation of Mechanistic Interpretability Research
by: Bai, Xiaoyan, et al.
Published: (2026)
by: Bai, Xiaoyan, et al.
Published: (2026)
Defending Against Alignment-Breaking Attacks via Robustly Aligned LLM
by: Cao, Bochuan, et al.
Published: (2023)
by: Cao, Bochuan, et al.
Published: (2023)
MAPS: Motivation-Aware Personalized Search via LLM-Driven Consultation Alignment
by: Qin, Weicong, et al.
Published: (2025)
by: Qin, Weicong, et al.
Published: (2025)
Revisiting the Superficial Alignment Hypothesis
by: Raghavendra, Mohit, et al.
Published: (2024)
by: Raghavendra, Mohit, et al.
Published: (2024)
TokAlign: Efficient Vocabulary Adaptation via Token Alignment
by: Li, Chong, et al.
Published: (2025)
by: Li, Chong, et al.
Published: (2025)
Superficial Safety Alignment Hypothesis
by: Li, Jianwei, et al.
Published: (2024)
by: Li, Jianwei, et al.
Published: (2024)
ExPerT: Effective and Explainable Evaluation of Personalized Long-Form Text Generation
by: Salemi, Alireza, et al.
Published: (2025)
by: Salemi, Alireza, et al.
Published: (2025)
TokAlign++: Advancing Vocabulary Adaptation via Better Token Alignment
by: Li, Chong, et al.
Published: (2026)
by: Li, Chong, et al.
Published: (2026)
PSP: An Interpretable Per-Dimension Accent Benchmark for Indic Text-to-Speech
by: Menta, Venkata Pushpak Teja
Published: (2026)
by: Menta, Venkata Pushpak Teja
Published: (2026)
PerQ: Efficient Evaluation of Multilingual Text Personalization Quality
by: Macko, Dominik, et al.
Published: (2025)
by: Macko, Dominik, et al.
Published: (2025)
NAIST-SIC-Aligned: an Aligned English-Japanese Simultaneous Interpretation Corpus
by: Zhao, Jinming, et al.
Published: (2023)
by: Zhao, Jinming, et al.
Published: (2023)
LLM-Align: Utilizing Large Language Models for Entity Alignment in Knowledge Graphs
by: Chen, Xuan, et al.
Published: (2024)
by: Chen, Xuan, et al.
Published: (2024)
The Text Uncanny Valley: Non-Monotonic Performance Degradation in LLM Information Retrieval
by: Tong, Zekai, et al.
Published: (2026)
by: Tong, Zekai, et al.
Published: (2026)
Evaluating LLM Alignment on Personality Inference from Real-World Interview Data
by: Zhu, Jianfeng, et al.
Published: (2025)
by: Zhu, Jianfeng, et al.
Published: (2025)
SelfCodeAlign: Self-Alignment for Code Generation
by: Wei, Yuxiang, et al.
Published: (2024)
by: Wei, Yuxiang, et al.
Published: (2024)
Learning Personalized Alignment for Evaluating Open-ended Text Generation
by: Wang, Danqing, et al.
Published: (2023)
by: Wang, Danqing, et al.
Published: (2023)
Clean First, Align Later: Benchmarking Preference Data Cleaning for Reliable LLM Alignment
by: Yeh, Samuel, et al.
Published: (2025)
by: Yeh, Samuel, et al.
Published: (2025)
Aligning the Objective of LLM-based Program Repair
by: Xu, Junjielong, et al.
Published: (2024)
by: Xu, Junjielong, et al.
Published: (2024)
Towards a Client-Centered Assessment of LLM Therapists by Client Simulation
by: Wang, Jiashuo, et al.
Published: (2024)
by: Wang, Jiashuo, et al.
Published: (2024)
Aligning to What? Limits to RLHF Based Alignment
by: Barnhart, Logan, et al.
Published: (2025)
by: Barnhart, Logan, et al.
Published: (2025)
A Judge-free LLM Open-ended Generation Benchmark Based on the Distributional Hypothesis
by: Imajo, Kentaro, et al.
Published: (2025)
by: Imajo, Kentaro, et al.
Published: (2025)
One Adapts to Any: Meta Reward Modeling for Personalized LLM Alignment
by: Cai, Hongru, et al.
Published: (2026)
by: Cai, Hongru, et al.
Published: (2026)
Similar Items
-
Personalized Benchmarking: Evaluating LLMs by Individual Preferences
by: Garbacea, Cristina, et al.
Published: (2026) -
BoNBoN Alignment for Large Language Models and the Sweetness of Best-of-n Sampling
by: Gui, Lin, et al.
Published: (2024) -
Why is constrained neural language generation particularly challenging?
by: Garbacea, Cristina, et al.
Published: (2022) -
HypoEval: Hypothesis-Guided Evaluation for Natural Language Generation
by: Li, Mingxuan, et al.
Published: (2025) -
Hypothesis Generation with Large Language Models
by: Zhou, Yangqiaoyu, et al.
Published: (2024)