:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Pope, Quintin, Balaji, Ajay Hayagreeve, Thibodeau, Jacques, Fern, Xiaoli
Format:	Preprint
Published:	2026
Subjects:	Computation and Language Artificial Intelligence
Online Access:	https://arxiv.org/abs/2605.05090
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Finding your MUSE: Mining Unexpected Solutions Engine
by: Sweed, Nir, et al.
Published: (2025)

Uncovering Hidden Violent Tendencies in LLMs: A Demographic Analysis via Behavioral Vignettes
by: Myers, Quintin, et al.
Published: (2025)

Assessing and Mitigating Data Memorization Risks in Fine-Tuned Large Language Models
by: Ramakrishnan, Badrinath, et al.
Published: (2025)

Large Language Models in the Task of Automatic Validation of Text Classifier Predictions
by: Tsymbalov, Aleksandr, et al.
Published: (2025)

Long-Span Question-Answering: Automatic Question Generation and QA-System Ranking via Side-by-Side Evaluation
by: Bohnet, Bernd, et al.
Published: (2024)

Side-by-side Comparison Amplifies Dialect Bias in Language Models
by: Kondapally, Kritee, et al.
Published: (2026)

Target-Side Paraphrase Augmentation for Sign Language Translation with Large Language Models
by: Bianco, Pedro Dal, et al.
Published: (2026)

Automatic Instruction Evolving for Large Language Models
by: Zeng, Weihao, et al.
Published: (2024)

AutoMix: Automatically Mixing Language Models
by: Aggarwal, Pranjal, et al.
Published: (2023)

KGValidator: A Framework for Automatic Validation of Knowledge Graph Construction
by: Boylan, Jack, et al.
Published: (2024)

IndicMMLU-Pro: Benchmarking Indic Large Language Models on Multi-Task Language Understanding
by: KJ, Sankalp, et al.
Published: (2025)

ARS: Automatic Routing Solver with Large Language Models
by: Li, Kai, et al.
Published: (2025)

Arabic Automatic Story Generation with Large Language Models
by: El-Shangiti, Ahmed Oumar, et al.
Published: (2024)

SIMBA UQ: Similarity-Based Aggregation for Uncertainty Quantification in Large Language Models
by: Bhattacharjya, Debarun, et al.
Published: (2025)

Automatic Adaptation Rule Optimization via Large Language Models
by: Ishimizu, Yusei, et al.
Published: (2024)

Automatically Planning Optimal Parallel Strategy for Large Language Models
by: Li, Zongbiao, et al.
Published: (2024)

Promptomatix: An Automatic Prompt Optimization Framework for Large Language Models
by: Murthy, Rithesh, et al.
Published: (2025)

Information Flow Routes: Automatically Interpreting Language Models at Scale
by: Ferrando, Javier, et al.
Published: (2024)

Applying Large Language Models and Chain-of-Thought for Automatic Scoring
by: Lee, Gyeong-Geon, et al.
Published: (2023)

Automatic Input Rewriting Improves Translation with Large Language Models
by: Ki, Dayeon, et al.
Published: (2025)

LLM Comparator: Visual Analytics for Side-by-Side Evaluation of Large Language Models
by: Kahng, Minsuk, et al.
Published: (2024)

Steering Language Models Before They Speak: Logit-Level Interventions
by: An, Hyeseon, et al.
Published: (2026)

Whispering Experts: Neural Interventions for Toxicity Mitigation in Language Models
by: Suau, Xavier, et al.
Published: (2024)

Evaluating Large Language Models on Rare Disease Diagnosis: A Case Study using House M.D
by: Gupta, Arsh, et al.
Published: (2025)

The Validation Gap: A Mechanistic Analysis of How Language Models Compute Arithmetic but Fail to Validate It
by: Bertolazzi, Leonardo, et al.
Published: (2025)

Can Large Language Models Automatically Score Proficiency of Written Essays?
by: Mansour, Watheq, et al.
Published: (2024)

CausalDetox: Causal Head Selection and Intervention for Language Model Detoxification
by: Wang, Yian, et al.
Published: (2026)

Consistency-Aware Editing for Entity-level Unlearning in Language Models
by: Han, Xiaoqi, et al.
Published: (2025)

Finding Answers in Thought Matters: Revisiting Evaluation on Large Language Models with Reasoning
by: Jo, Hwiyeol, et al.
Published: (2025)

Human-Calibrated Automated Testing and Validation of Generative Language Models
by: Sudjianto, Agus, et al.
Published: (2024)

Automatic Question & Answer Generation Using Generative Large Language Model (LLM)
by: Ehsan, Md. Alvee, et al.
Published: (2025)

Automatic Transmission for LLM Tiers: Optimizing Cost and Accuracy in Large Language Models
by: Na, Injae, et al.
Published: (2025)

Information Gain-Guided Causal Intervention for Autonomous Debiasing Large Language Models
by: Sun, Zhouhao, et al.
Published: (2025)

Non-linear Interventions on Large Language Models
by: Kim, Sangwoo
Published: (2026)

Knowledge AI: Fine-tuning NLP Models for Facilitating Scientific Knowledge Extraction and Understanding
by: Muralidharan, Balaji, et al.
Published: (2024)

Interpretable Traces, Unexpected Outcomes: Investigating the Disconnect in Trace-Based Knowledge Distillation
by: Bhambri, Siddhant, et al.
Published: (2025)

Measuring what Matters: Construct Validity in Large Language Model Benchmarks
by: Bean, Andrew M., et al.
Published: (2025)

Finding Culture-Sensitive Neurons in Vision-Language Models
by: Zhao, Xiutian, et al.
Published: (2025)

Using Large Language Models to Create AI Personas for Replication, Generalization and Prediction of Media Effects: An Empirical Test of 133 Published Experimental Research Findings
by: Yeykelis, Leo, et al.
Published: (2024)

Improving Automatic Summarization of Radiology Reports through Mid-Training of Large Language Models
by: Lyu, Mengxian, et al.
Published: (2026)