Saved in:
| Main Authors: | Pope, Quintin, Balaji, Ajay Hayagreeve, Thibodeau, Jacques, Fern, Xiaoli |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.05090 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Finding your MUSE: Mining Unexpected Solutions Engine
by: Sweed, Nir, et al.
Published: (2025)
by: Sweed, Nir, et al.
Published: (2025)
Uncovering Hidden Violent Tendencies in LLMs: A Demographic Analysis via Behavioral Vignettes
by: Myers, Quintin, et al.
Published: (2025)
by: Myers, Quintin, et al.
Published: (2025)
Assessing and Mitigating Data Memorization Risks in Fine-Tuned Large Language Models
by: Ramakrishnan, Badrinath, et al.
Published: (2025)
by: Ramakrishnan, Badrinath, et al.
Published: (2025)
Large Language Models in the Task of Automatic Validation of Text Classifier Predictions
by: Tsymbalov, Aleksandr, et al.
Published: (2025)
by: Tsymbalov, Aleksandr, et al.
Published: (2025)
Long-Span Question-Answering: Automatic Question Generation and QA-System Ranking via Side-by-Side Evaluation
by: Bohnet, Bernd, et al.
Published: (2024)
by: Bohnet, Bernd, et al.
Published: (2024)
Side-by-side Comparison Amplifies Dialect Bias in Language Models
by: Kondapally, Kritee, et al.
Published: (2026)
by: Kondapally, Kritee, et al.
Published: (2026)
Target-Side Paraphrase Augmentation for Sign Language Translation with Large Language Models
by: Bianco, Pedro Dal, et al.
Published: (2026)
by: Bianco, Pedro Dal, et al.
Published: (2026)
Automatic Instruction Evolving for Large Language Models
by: Zeng, Weihao, et al.
Published: (2024)
by: Zeng, Weihao, et al.
Published: (2024)
AutoMix: Automatically Mixing Language Models
by: Aggarwal, Pranjal, et al.
Published: (2023)
by: Aggarwal, Pranjal, et al.
Published: (2023)
KGValidator: A Framework for Automatic Validation of Knowledge Graph Construction
by: Boylan, Jack, et al.
Published: (2024)
by: Boylan, Jack, et al.
Published: (2024)
IndicMMLU-Pro: Benchmarking Indic Large Language Models on Multi-Task Language Understanding
by: KJ, Sankalp, et al.
Published: (2025)
by: KJ, Sankalp, et al.
Published: (2025)
ARS: Automatic Routing Solver with Large Language Models
by: Li, Kai, et al.
Published: (2025)
by: Li, Kai, et al.
Published: (2025)
Arabic Automatic Story Generation with Large Language Models
by: El-Shangiti, Ahmed Oumar, et al.
Published: (2024)
by: El-Shangiti, Ahmed Oumar, et al.
Published: (2024)
SIMBA UQ: Similarity-Based Aggregation for Uncertainty Quantification in Large Language Models
by: Bhattacharjya, Debarun, et al.
Published: (2025)
by: Bhattacharjya, Debarun, et al.
Published: (2025)
Automatic Adaptation Rule Optimization via Large Language Models
by: Ishimizu, Yusei, et al.
Published: (2024)
by: Ishimizu, Yusei, et al.
Published: (2024)
Automatically Planning Optimal Parallel Strategy for Large Language Models
by: Li, Zongbiao, et al.
Published: (2024)
by: Li, Zongbiao, et al.
Published: (2024)
Promptomatix: An Automatic Prompt Optimization Framework for Large Language Models
by: Murthy, Rithesh, et al.
Published: (2025)
by: Murthy, Rithesh, et al.
Published: (2025)
Information Flow Routes: Automatically Interpreting Language Models at Scale
by: Ferrando, Javier, et al.
Published: (2024)
by: Ferrando, Javier, et al.
Published: (2024)
Applying Large Language Models and Chain-of-Thought for Automatic Scoring
by: Lee, Gyeong-Geon, et al.
Published: (2023)
by: Lee, Gyeong-Geon, et al.
Published: (2023)
Automatic Input Rewriting Improves Translation with Large Language Models
by: Ki, Dayeon, et al.
Published: (2025)
by: Ki, Dayeon, et al.
Published: (2025)
LLM Comparator: Visual Analytics for Side-by-Side Evaluation of Large Language Models
by: Kahng, Minsuk, et al.
Published: (2024)
by: Kahng, Minsuk, et al.
Published: (2024)
Steering Language Models Before They Speak: Logit-Level Interventions
by: An, Hyeseon, et al.
Published: (2026)
by: An, Hyeseon, et al.
Published: (2026)
Whispering Experts: Neural Interventions for Toxicity Mitigation in Language Models
by: Suau, Xavier, et al.
Published: (2024)
by: Suau, Xavier, et al.
Published: (2024)
Evaluating Large Language Models on Rare Disease Diagnosis: A Case Study using House M.D
by: Gupta, Arsh, et al.
Published: (2025)
by: Gupta, Arsh, et al.
Published: (2025)
The Validation Gap: A Mechanistic Analysis of How Language Models Compute Arithmetic but Fail to Validate It
by: Bertolazzi, Leonardo, et al.
Published: (2025)
by: Bertolazzi, Leonardo, et al.
Published: (2025)
Can Large Language Models Automatically Score Proficiency of Written Essays?
by: Mansour, Watheq, et al.
Published: (2024)
by: Mansour, Watheq, et al.
Published: (2024)
CausalDetox: Causal Head Selection and Intervention for Language Model Detoxification
by: Wang, Yian, et al.
Published: (2026)
by: Wang, Yian, et al.
Published: (2026)
Consistency-Aware Editing for Entity-level Unlearning in Language Models
by: Han, Xiaoqi, et al.
Published: (2025)
by: Han, Xiaoqi, et al.
Published: (2025)
Finding Answers in Thought Matters: Revisiting Evaluation on Large Language Models with Reasoning
by: Jo, Hwiyeol, et al.
Published: (2025)
by: Jo, Hwiyeol, et al.
Published: (2025)
Human-Calibrated Automated Testing and Validation of Generative Language Models
by: Sudjianto, Agus, et al.
Published: (2024)
by: Sudjianto, Agus, et al.
Published: (2024)
Automatic Question & Answer Generation Using Generative Large Language Model (LLM)
by: Ehsan, Md. Alvee, et al.
Published: (2025)
by: Ehsan, Md. Alvee, et al.
Published: (2025)
Automatic Transmission for LLM Tiers: Optimizing Cost and Accuracy in Large Language Models
by: Na, Injae, et al.
Published: (2025)
by: Na, Injae, et al.
Published: (2025)
Information Gain-Guided Causal Intervention for Autonomous Debiasing Large Language Models
by: Sun, Zhouhao, et al.
Published: (2025)
by: Sun, Zhouhao, et al.
Published: (2025)
Non-linear Interventions on Large Language Models
by: Kim, Sangwoo
Published: (2026)
by: Kim, Sangwoo
Published: (2026)
Knowledge AI: Fine-tuning NLP Models for Facilitating Scientific Knowledge Extraction and Understanding
by: Muralidharan, Balaji, et al.
Published: (2024)
by: Muralidharan, Balaji, et al.
Published: (2024)
Interpretable Traces, Unexpected Outcomes: Investigating the Disconnect in Trace-Based Knowledge Distillation
by: Bhambri, Siddhant, et al.
Published: (2025)
by: Bhambri, Siddhant, et al.
Published: (2025)
Measuring what Matters: Construct Validity in Large Language Model Benchmarks
by: Bean, Andrew M., et al.
Published: (2025)
by: Bean, Andrew M., et al.
Published: (2025)
Finding Culture-Sensitive Neurons in Vision-Language Models
by: Zhao, Xiutian, et al.
Published: (2025)
by: Zhao, Xiutian, et al.
Published: (2025)
Using Large Language Models to Create AI Personas for Replication, Generalization and Prediction of Media Effects: An Empirical Test of 133 Published Experimental Research Findings
by: Yeykelis, Leo, et al.
Published: (2024)
by: Yeykelis, Leo, et al.
Published: (2024)
Improving Automatic Summarization of Radiology Reports through Mid-Training of Large Language Models
by: Lyu, Mengxian, et al.
Published: (2026)
by: Lyu, Mengxian, et al.
Published: (2026)
Similar Items
-
Finding your MUSE: Mining Unexpected Solutions Engine
by: Sweed, Nir, et al.
Published: (2025) -
Uncovering Hidden Violent Tendencies in LLMs: A Demographic Analysis via Behavioral Vignettes
by: Myers, Quintin, et al.
Published: (2025) -
Assessing and Mitigating Data Memorization Risks in Fine-Tuned Large Language Models
by: Ramakrishnan, Badrinath, et al.
Published: (2025) -
Large Language Models in the Task of Automatic Validation of Text Classifier Predictions
by: Tsymbalov, Aleksandr, et al.
Published: (2025) -
Long-Span Question-Answering: Automatic Question Generation and QA-System Ranking via Side-by-Side Evaluation
by: Bohnet, Bernd, et al.
Published: (2024)