Saved in:
Bibliographic Details
Main Authors: Chen, Qixian, Xu, Yuxiong, Mandelli, Sara, Li, Sheng, Li, Bin
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2503.12010
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910935418929152
author Chen, Qixian
Xu, Yuxiong
Mandelli, Sara
Li, Sheng
Li, Bin
author_facet Chen, Qixian
Xu, Yuxiong
Mandelli, Sara
Li, Sheng
Li, Bin
contents In audio spoofing detection, most studies rely on clean datasets, making models susceptible to real-world post-processing attacks, such as channel compression and noise. To overcome this challenge, we propose the Adaptive MixtUre Low-rank ExperTs (AMULET) framework, which enhances resilience by leveraging attack-specific knowledge and dynamically adapting to varied attack conditions. Specifically, AMULET employs Attack-Specific Experts (ASEs) fine-tuned with Low-Rank Adaptation (LoRA), allowing each expert to focus on distinct post-processing patterns using just 1.13\% of the parameters required for full fine-tuning. Furthermore, we introduce Adaptive Expert Fusion (AEF), which adaptively selects and integrates expert knowledge to enhance the robustness of spoofing detection. Experimental results demonstrate that AMULET significantly enhances robustness by improving noise resilience and exhibiting greater adaptability to unseen post-processing methods compared to models trained with full fine-tuning. Additionally, our framework outperforms both single expert and other expert aggregation strategies under various mixed attacks, demonstrating its superior robustness and adaptability in managing complex real-world scenarios.
format Preprint
id arxiv_https___arxiv_org_abs_2503_12010
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Adaptive Mixture of Low-Rank Experts for Robust Audio Spoofing Detection
Chen, Qixian
Xu, Yuxiong
Mandelli, Sara
Li, Sheng
Li, Bin
Audio and Speech Processing
In audio spoofing detection, most studies rely on clean datasets, making models susceptible to real-world post-processing attacks, such as channel compression and noise. To overcome this challenge, we propose the Adaptive MixtUre Low-rank ExperTs (AMULET) framework, which enhances resilience by leveraging attack-specific knowledge and dynamically adapting to varied attack conditions. Specifically, AMULET employs Attack-Specific Experts (ASEs) fine-tuned with Low-Rank Adaptation (LoRA), allowing each expert to focus on distinct post-processing patterns using just 1.13\% of the parameters required for full fine-tuning. Furthermore, we introduce Adaptive Expert Fusion (AEF), which adaptively selects and integrates expert knowledge to enhance the robustness of spoofing detection. Experimental results demonstrate that AMULET significantly enhances robustness by improving noise resilience and exhibiting greater adaptability to unseen post-processing methods compared to models trained with full fine-tuning. Additionally, our framework outperforms both single expert and other expert aggregation strategies under various mixed attacks, demonstrating its superior robustness and adaptability in managing complex real-world scenarios.
title Adaptive Mixture of Low-Rank Experts for Robust Audio Spoofing Detection
topic Audio and Speech Processing
url https://arxiv.org/abs/2503.12010