Salvato in:
Dettagli Bibliografici
Autori principali: Zhang, Zhuoxuan, Duan, Jinhao, Kim, Edward, Xu, Kaidi
Natura: Preprint
Pubblicazione: 2025
Soggetti:
Accesso online:https://arxiv.org/abs/2509.13664
Tags: Aggiungi Tag
Nessun Tag, puoi essere il primo ad aggiungerne!!
_version_ 1866911158326263808
author Zhang, Zhuoxuan
Duan, Jinhao
Kim, Edward
Xu, Kaidi
author_facet Zhang, Zhuoxuan
Duan, Jinhao
Kim, Edward
Xu, Kaidi
contents Ambiguity is pervasive in real-world questions, yet large language models (LLMs) often respond with confident answers rather than seeking clarification. In this work, we show that question ambiguity is linearly encoded in the internal representations of LLMs and can be both detected and controlled at the neuron level. During the model's pre-filling stage, we identify that a small number of neurons, as few as one, encode question ambiguity information. Probes trained on these Ambiguity-Encoding Neurons (AENs) achieve strong performance on ambiguity detection and generalize across datasets, outperforming prompting-based and representation-based baselines. Layerwise analysis reveals that AENs emerge from shallow layers, suggesting early encoding of ambiguity signals in the model's processing pipeline. Finally, we show that through manipulating AENs, we can control LLM's behavior from direct answering to abstention. Our findings reveal that LLMs form compact internal representations of question ambiguity, enabling interpretable and controllable behavior.
format Preprint
id arxiv_https___arxiv_org_abs_2509_13664
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Sparse Neurons Carry Strong Signals of Question Ambiguity in LLMs
Zhang, Zhuoxuan
Duan, Jinhao
Kim, Edward
Xu, Kaidi
Computation and Language
Artificial Intelligence
Ambiguity is pervasive in real-world questions, yet large language models (LLMs) often respond with confident answers rather than seeking clarification. In this work, we show that question ambiguity is linearly encoded in the internal representations of LLMs and can be both detected and controlled at the neuron level. During the model's pre-filling stage, we identify that a small number of neurons, as few as one, encode question ambiguity information. Probes trained on these Ambiguity-Encoding Neurons (AENs) achieve strong performance on ambiguity detection and generalize across datasets, outperforming prompting-based and representation-based baselines. Layerwise analysis reveals that AENs emerge from shallow layers, suggesting early encoding of ambiguity signals in the model's processing pipeline. Finally, we show that through manipulating AENs, we can control LLM's behavior from direct answering to abstention. Our findings reveal that LLMs form compact internal representations of question ambiguity, enabling interpretable and controllable behavior.
title Sparse Neurons Carry Strong Signals of Question Ambiguity in LLMs
topic Computation and Language
Artificial Intelligence
url https://arxiv.org/abs/2509.13664