Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Lin, Yu-Xiang, Li, Chen-An, Wei, Sheng-Lun, Chen, Po-Chun, Chen, Hsin-Hsi, Lee, Hung-yi
Format:	Preprint
Published:	2025
Subjects:	Sound Computation and Language
Online Access:	https://arxiv.org/abs/2510.00628
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866918352290578432
author	Lin, Yu-Xiang Li, Chen-An Wei, Sheng-Lun Chen, Po-Chun Chen, Hsin-Hsi Lee, Hung-yi
author_facet	Lin, Yu-Xiang Li, Chen-An Wei, Sheng-Lun Chen, Po-Chun Chen, Hsin-Hsi Lee, Hung-yi
contents	Large audio-language models (LALMs) are often used in tasks that involve reasoning over ordered options. An open question is whether their predictions are influenced by the order of answer choices, which would indicate a form of position bias and undermine their reliability. In this paper, we identify and analyze this problem in LALMs. We demonstrate that no model is immune to this bias through extensive experiments on six LALMs across three widely used benchmarks and their spoken counterparts. Shuffling the order of answer options can cause performance fluctuations of up to 24% and even change model rankings, raising concerns about the reliability of current evaluation practices. We also study permutation-based strategies and show that they can mitigate bias in most cases. Our work represents the first systematic investigation of this issue in LALMs, and we hope it raises awareness and motivates further research in this direction.
format	Preprint
id	arxiv_https___arxiv_org_abs_2510_00628
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Hearing the Order: Investigating Position Bias in Large Audio-Language Models Lin, Yu-Xiang Li, Chen-An Wei, Sheng-Lun Chen, Po-Chun Chen, Hsin-Hsi Lee, Hung-yi Sound Computation and Language Large audio-language models (LALMs) are often used in tasks that involve reasoning over ordered options. An open question is whether their predictions are influenced by the order of answer choices, which would indicate a form of position bias and undermine their reliability. In this paper, we identify and analyze this problem in LALMs. We demonstrate that no model is immune to this bias through extensive experiments on six LALMs across three widely used benchmarks and their spoken counterparts. Shuffling the order of answer options can cause performance fluctuations of up to 24% and even change model rankings, raising concerns about the reliability of current evaluation practices. We also study permutation-based strategies and show that they can mitigate bias in most cases. Our work represents the first systematic investigation of this issue in LALMs, and we hope it raises awareness and motivates further research in this direction.
title	Hearing the Order: Investigating Position Bias in Large Audio-Language Models
topic	Sound Computation and Language
url	https://arxiv.org/abs/2510.00628

Similar Items