Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Chen, Qixian, Xu, Yuxiong, Mandelli, Sara, Li, Sheng, Li, Bin
Format:	Preprint
Published:	2025
Subjects:	Audio and Speech Processing
Online Access:	https://arxiv.org/abs/2503.12010
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866910935418929152
author	Chen, Qixian Xu, Yuxiong Mandelli, Sara Li, Sheng Li, Bin
author_facet	Chen, Qixian Xu, Yuxiong Mandelli, Sara Li, Sheng Li, Bin
contents	In audio spoofing detection, most studies rely on clean datasets, making models susceptible to real-world post-processing attacks, such as channel compression and noise. To overcome this challenge, we propose the Adaptive MixtUre Low-rank ExperTs (AMULET) framework, which enhances resilience by leveraging attack-specific knowledge and dynamically adapting to varied attack conditions. Specifically, AMULET employs Attack-Specific Experts (ASEs) fine-tuned with Low-Rank Adaptation (LoRA), allowing each expert to focus on distinct post-processing patterns using just 1.13\% of the parameters required for full fine-tuning. Furthermore, we introduce Adaptive Expert Fusion (AEF), which adaptively selects and integrates expert knowledge to enhance the robustness of spoofing detection. Experimental results demonstrate that AMULET significantly enhances robustness by improving noise resilience and exhibiting greater adaptability to unseen post-processing methods compared to models trained with full fine-tuning. Additionally, our framework outperforms both single expert and other expert aggregation strategies under various mixed attacks, demonstrating its superior robustness and adaptability in managing complex real-world scenarios.
format	Preprint
id	arxiv_https___arxiv_org_abs_2503_12010
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Adaptive Mixture of Low-Rank Experts for Robust Audio Spoofing Detection Chen, Qixian Xu, Yuxiong Mandelli, Sara Li, Sheng Li, Bin Audio and Speech Processing In audio spoofing detection, most studies rely on clean datasets, making models susceptible to real-world post-processing attacks, such as channel compression and noise. To overcome this challenge, we propose the Adaptive MixtUre Low-rank ExperTs (AMULET) framework, which enhances resilience by leveraging attack-specific knowledge and dynamically adapting to varied attack conditions. Specifically, AMULET employs Attack-Specific Experts (ASEs) fine-tuned with Low-Rank Adaptation (LoRA), allowing each expert to focus on distinct post-processing patterns using just 1.13\% of the parameters required for full fine-tuning. Furthermore, we introduce Adaptive Expert Fusion (AEF), which adaptively selects and integrates expert knowledge to enhance the robustness of spoofing detection. Experimental results demonstrate that AMULET significantly enhances robustness by improving noise resilience and exhibiting greater adaptability to unseen post-processing methods compared to models trained with full fine-tuning. Additionally, our framework outperforms both single expert and other expert aggregation strategies under various mixed attacks, demonstrating its superior robustness and adaptability in managing complex real-world scenarios.
title	Adaptive Mixture of Low-Rank Experts for Robust Audio Spoofing Detection
topic	Audio and Speech Processing
url	https://arxiv.org/abs/2503.12010

Similar Items