MARC21: :: Library Catalog

Salvato in:

Dettagli Bibliografici
Autori principali:	Zhang, Zhuoxuan, Duan, Jinhao, Kim, Edward, Xu, Kaidi
Natura:	Preprint
Pubblicazione:	2025
Soggetti:	Computation and Language Artificial Intelligence
Accesso online:	https://arxiv.org/abs/2509.13664
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

_version_	1866911158326263808
author	Zhang, Zhuoxuan Duan, Jinhao Kim, Edward Xu, Kaidi
author_facet	Zhang, Zhuoxuan Duan, Jinhao Kim, Edward Xu, Kaidi
contents	Ambiguity is pervasive in real-world questions, yet large language models (LLMs) often respond with confident answers rather than seeking clarification. In this work, we show that question ambiguity is linearly encoded in the internal representations of LLMs and can be both detected and controlled at the neuron level. During the model's pre-filling stage, we identify that a small number of neurons, as few as one, encode question ambiguity information. Probes trained on these Ambiguity-Encoding Neurons (AENs) achieve strong performance on ambiguity detection and generalize across datasets, outperforming prompting-based and representation-based baselines. Layerwise analysis reveals that AENs emerge from shallow layers, suggesting early encoding of ambiguity signals in the model's processing pipeline. Finally, we show that through manipulating AENs, we can control LLM's behavior from direct answering to abstention. Our findings reveal that LLMs form compact internal representations of question ambiguity, enabling interpretable and controllable behavior.
format	Preprint
id	arxiv_https___arxiv_org_abs_2509_13664
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Sparse Neurons Carry Strong Signals of Question Ambiguity in LLMs Zhang, Zhuoxuan Duan, Jinhao Kim, Edward Xu, Kaidi Computation and Language Artificial Intelligence Ambiguity is pervasive in real-world questions, yet large language models (LLMs) often respond with confident answers rather than seeking clarification. In this work, we show that question ambiguity is linearly encoded in the internal representations of LLMs and can be both detected and controlled at the neuron level. During the model's pre-filling stage, we identify that a small number of neurons, as few as one, encode question ambiguity information. Probes trained on these Ambiguity-Encoding Neurons (AENs) achieve strong performance on ambiguity detection and generalize across datasets, outperforming prompting-based and representation-based baselines. Layerwise analysis reveals that AENs emerge from shallow layers, suggesting early encoding of ambiguity signals in the model's processing pipeline. Finally, we show that through manipulating AENs, we can control LLM's behavior from direct answering to abstention. Our findings reveal that LLMs form compact internal representations of question ambiguity, enabling interpretable and controllable behavior.
title	Sparse Neurons Carry Strong Signals of Question Ambiguity in LLMs
topic	Computation and Language Artificial Intelligence
url	https://arxiv.org/abs/2509.13664

Documenti analoghi