:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wang, Ryan, Finlayson, Matthew, Soldaini, Luca, Swayamdipta, Swabha, Jia, Robin
Format:	Preprint
Published:	2025
Subjects:	Computation and Language Machine Learning
Online Access:	https://arxiv.org/abs/2505.03052
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Logits of API-Protected LLMs Leak Proprietary Information
by: Finlayson, Matthew, et al.
Published: (2024)

Robust Data Watermarking in Language Models by Injecting Fictitious Knowledge
by: Cui, Xinyue, et al.
Published: (2025)

Understanding Dataset Difficulty with $\mathcal{V}$-Usable Information
by: Ethayarajh, Kawin, et al.
Published: (2021)

How Reliable is Language Model Micro-Benchmarking?
by: Yauney, Gregory, et al.
Published: (2025)

Better Language Model Inversion by Compactly Representing Next-Token Distributions
by: Nazir, Murtaza, et al.
Published: (2025)

Annotating FrameNet via Structure-Conditioned Language Generation
by: Cui, Xinyue, et al.
Published: (2024)

Every Language Model Has a Forgery-Resistant Signature
by: Finlayson, Matthew, et al.
Published: (2025)

Compare without Despair: Reliable Preference Evaluation with Generation Separability
by: Ghosh, Sayan, et al.
Published: (2024)

Why Fine-Tuning Encourages Hallucinations and How to Fix It
by: Kaplan, Guy, et al.
Published: (2026)

FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions
by: Weller, Orion, et al.
Published: (2024)

Evaluating Evaluation Metrics -- The Mirage of Hallucination Detection
by: Kulkarni, Atharva, et al.
Published: (2025)

Side-by-side Comparison Amplifies Dialect Bias in Language Models
by: Kondapally, Kritee, et al.
Published: (2026)

From Decoding to Meta-Generation: Inference-time Algorithms for Large Language Models
by: Welleck, Sean, et al.
Published: (2024)

Disentangling Geometry, Performance, and Training in Language Models
by: Kulkarni, Atharva, et al.
Published: (2026)

Evaluation Under Imperfect Benchmarks and Ratings: A Case Study in Text Simplification
by: Liu, Joseph, et al.
Published: (2025)

Crowd-Calibrator: Can Annotator Disagreement Inform Calibration in Subjective Tasks?
by: Khurana, Urja, et al.
Published: (2024)

Self-Directed Synthetic Dialogues and Revisions Technical Report
by: Lambert, Nathan, et al.
Published: (2024)

Post-training an LLM for RAG? Train on Self-Generated Demonstrations
by: Finlayson, Matthew, et al.
Published: (2025)

Olmix: A Framework for Data Mixing Throughout LM Development
by: Chen, Mayee F., et al.
Published: (2026)

BenchBrowser: Retrieving Evidence for Evaluating Benchmark Validity
by: Diddee, Harshita, et al.
Published: (2026)

Improving Language Model Personas via Rationalization with Psychological Scaffolds
by: Joshi, Brihi, et al.
Published: (2025)

What's In My Big Data?
by: Elazar, Yanai, et al.
Published: (2023)

DataDecide: How to Predict Best Pretraining Data with Small Experiments
by: Magnusson, Ian, et al.
Published: (2025)

Generative Explanations for Program Synthesizers
by: Nazari, Amirmohammad, et al.
Published: (2024)

Proving membership in LLM pretraining data via data watermarks
by: Wei, Johnny Tian-Zheng, et al.
Published: (2024)

Believing without Seeing: Quality Scores for Contextualizing Vision-Language Model Explanations
by: He, Keyu, et al.
Published: (2025)

Sample, Align, Synthesize: Graph-Based Response Synthesis with ConGrs
by: Ghosh, Sayan, et al.
Published: (2025)

NeuroComparatives: Neuro-Symbolic Distillation of Comparative Knowledge
by: Howard, Phillip, et al.
Published: (2023)

Verify with Caution: The Pitfalls of Relying on Imperfect Factuality Metrics
by: Godbole, Ameya, et al.
Published: (2025)

Pre-trained Large Language Models Use Fourier Features to Compute Addition
by: Zhou, Tianyi, et al.
Published: (2024)

Function Induction and Task Generalization: An Interpretability Study with Off-by-One Addition
by: Ye, Qinyuan, et al.
Published: (2025)

Teaching Your Models to Understand Code via Focal Preference Alignment
by: Wu, Jie, et al.
Published: (2025)

Promote, Suppress, Iterate: How Language Models Answer One-to-Many Factual Queries
by: Yan, Tianyi Lorena, et al.
Published: (2025)

Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability
by: Sundaram, Shobhita, et al.
Published: (2026)

How Can Large Language Models Understand Spatial-Temporal Data?
by: Liu, Lei, et al.
Published: (2024)

Harnessing Event Sensory Data for Error Pattern Prediction in Vehicles: A Language Model Approach
by: Math, Hugo, et al.
Published: (2024)

Hubble: a Model Suite to Advance the Study of LLM Memorization
by: Wei, Johnny Tian-Zheng, et al.
Published: (2025)

Textual Steering Vectors Can Improve Visual Understanding in Multimodal Large Language Models
by: Gan, Woody Haosheng, et al.
Published: (2025)

Out-of-Distribution Detection through Soft Clustering with Non-Negative Kernel Regression
by: Gulati, Aryan, et al.
Published: (2024)

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research
by: Shao, Rulin, et al.
Published: (2025)