:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Liang, Jinhua, Nolasco, Ines, Ghani, Burooj, Phan, Huy, Benetos, Emmanouil, Stowell, Dan
Format:	Preprint
Published:	2024
Subjects:	Audio and Speech Processing
Online Access:	https://arxiv.org/abs/2403.18638
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Acoustic identification of individual animals with hierarchical contrastive learning
by: Nolasco, Ines, et al.
Published: (2024)

LHGNN: Local-Higher Order Graph Neural Networks For Audio Classification and Tagging
by: Singh, Shubhr, et al.
Published: (2025)

InsectSet459: an open dataset of insect sounds for bioacoustic machine learning
by: Faiß, Marius, et al.
Published: (2025)

From Aesthetics to Human Preferences: Comparative Perspectives of Evaluating Text-to-Music Systems
by: Zhang, Huan, et al.
Published: (2025)

Acoustic Prompt Tuning: Empowering Large Language Models with Audition Capabilities
by: Liang, Jinhua, et al.
Published: (2023)

Rank-based loss for learning hierarchical representations
by: Nolasco, Ines, et al.
Published: (2021)

Domain-Invariant Representation Learning of Bird Sounds
by: Moummad, Ilyass, et al.
Published: (2024)

Generalization in birdsong classification: impact of transfer learning methods and dataset characteristics
by: Ghani, Burooj, et al.
Published: (2024)

WavCraft: Audio Editing and Generation with Large Language Models
by: Liang, Jinhua, et al.
Published: (2024)

Adaptive Representations of Sound for Automatic Insect Recognition
by: Faiß, Marius, et al.
Published: (2023)

Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model
by: Huang, Jiawen, et al.
Published: (2024)

Few-Shot Bioacoustic Event Detection with Frame-Level Embedding Learning System
by: Zhao, PengYuan, et al.
Published: (2024)

RUMAA: Repeat-Aware Unified Music Audio Analysis for Score-Performance Alignment, Transcription, and Mistake Detection
by: Chang, Sungkyun, et al.
Published: (2025)

Computational bioacoustics with deep learning: a review and roadmap
by: Stowell, Dan
Published: (2021)

Hybrid Disagreement-Diversity Active Learning for Bioacoustic Sound Event Detection
by: Zhang, Shiqi, et al.
Published: (2025)

Automated data curation for self-supervised learning in underwater acoustic analysis
by: Hummel, Hilde I, et al.
Published: (2025)

LC-Protonets: Multi-Label Few-Shot Learning for World Music Audio Tagging
by: Papaioannou, Charilaos, et al.
Published: (2024)

Learning Music Audio Representations With Limited Data
by: Plachouras, Christos, et al.
Published: (2025)

Aggregation Strategies for Efficient Annotation of Bioacoustic Sound Events Using Active Learning
by: Lindholm, Richard, et al.
Published: (2025)

Adaptive Learning via a Negative Selection Strategy for Few-Shot Bioacoustic Event Detection
by: Chen, Yaxiong, et al.
Published: (2024)

Decodable but not structured: linear probing enables Underwater Acoustic Target Recognition with pretrained audio embeddings
by: Hummel, Hilde I., et al.
Published: (2026)

Enhancing Lyrics Transcription on Music Mixtures with Consistency Loss
by: Huang, Jiawen, et al.
Published: (2025)

Regularized Contrastive Pre-training for Few-shot Bioacoustic Sound Detection
by: Moummad, Ilyass, et al.
Published: (2023)

A Data-Driven Analysis of Robust Automatic Piano Transcription
by: Edwards, Drew, et al.
Published: (2024)

DG-SED: Domain Generalization for Sound Event Detection with Heterogeneous Training Data
by: Xiao, Yang, et al.
Published: (2024)

YourMT3+: Multi-instrument Music Transcription with Enhanced Transformer Architectures and Cross-dataset Stem Augmentation
by: Chang, Sungkyun, et al.
Published: (2024)

SCRAPL: Scattering Transform with Random Paths for Machine Learning
by: Mitcheltree, Christopher, et al.
Published: (2026)

Mind the Gap: Detecting Cluster Exits for Robust Local Density-Based Score Normalization in Anomalous Sound Detection
by: Wilkinghoff, Kevin, et al.
Published: (2026)

Universal Music Representations? Evaluating Foundation Models on World Music Corpora
by: Papaioannou, Charilaos, et al.
Published: (2025)

The Search for Squawk: Agile Modeling in Bioacoustics
by: Dumoulin, Vincent, et al.
Published: (2025)

Automatic acoustic detection of birds through deep learning: the first Bird Audio Detection challenge
by: Stowell, Dan, et al.
Published: (2018)

Robust Bioacoustic Detection via Richly Labelled Synthetic Soundscape Augmentation
by: Soltero, Kaspar, et al.
Published: (2025)

Generalized Multi-Source Inference for Text Conditioned Music Diffusion Models
by: Postolache, Emilian, et al.
Published: (2024)

Classification of Spontaneous and Scripted Speech for Multilingual Audio
by: Elisha, Shahar, et al.
Published: (2024)

Learning Domain-Robust Bioacoustic Representations for Mosquito Species Classification with Contrastive Learning and Distribution Alignment
by: Hou, Yuanbo, et al.
Published: (2025)

Class-Incremental Learning for Sound Event Localization and Detection
by: Pandey, Ruchi, et al.
Published: (2024)

Audio-JEPA: Joint-Embedding Predictive Architecture for Audio Representation Learning
by: Tuncay, Ludovic, et al.
Published: (2025)

ST-ITO: Controlling Audio Effects for Style Transfer with Inference-Time Optimization
by: Steinmetz, Christian J., et al.
Published: (2024)

Frequency Dynamic Convolutions for Sound Event Detection
by: Nam, Hyeonuk
Published: (2025)

Zero- and Few-shot Sound Event Localization and Detection
by: Shimada, Kazuki, et al.
Published: (2023)