Enregistré dans:
Détails bibliographiques
Auteurs principaux: Tang, Chengyu, Baskiyar, Sanjeev
Format: Preprint
Publié: 2025
Sujets:
Accès en ligne:https://arxiv.org/abs/2512.03563
Tags: Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
_version_ 1866914487636852736
author Tang, Chengyu
Baskiyar, Sanjeev
author_facet Tang, Chengyu
Baskiyar, Sanjeev
contents In this study, we evaluate the efficacy of the Mamba architecture bioacoustics by introducing BioMamba, a Mamba-based audio representation model for wildlife sounds. We pre-train a BioMamba using self-supervised learning on a large audio corpus and evaluate it on the BEANS benchmark across diverse classification and detection tasks. Compared to the state-of-the-art Transformer-based model (AVES), BioMamba achieves comparable performance while significantly reducing VRAM consumption. Our results demonstrate Mamba's potential as a computationally efficient alternative for real-world environmental monitoring.
format Preprint
id arxiv_https___arxiv_org_abs_2512_03563
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle State Space Models for Bioacoustics: A Comparative Evaluation with Transformers
Tang, Chengyu
Baskiyar, Sanjeev
Sound
Artificial Intelligence
In this study, we evaluate the efficacy of the Mamba architecture bioacoustics by introducing BioMamba, a Mamba-based audio representation model for wildlife sounds. We pre-train a BioMamba using self-supervised learning on a large audio corpus and evaluate it on the BEANS benchmark across diverse classification and detection tasks. Compared to the state-of-the-art Transformer-based model (AVES), BioMamba achieves comparable performance while significantly reducing VRAM consumption. Our results demonstrate Mamba's potential as a computationally efficient alternative for real-world environmental monitoring.
title State Space Models for Bioacoustics: A Comparative Evaluation with Transformers
topic Sound
Artificial Intelligence
url https://arxiv.org/abs/2512.03563