:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Thewes, Nicolas, Steinhauer, Philipp, Trampert, Patrick, Pauly, Markus, Schneider, Georg
Format:	Preprint
Published:	2025
Subjects:	Applications Sound Audio and Speech Processing Computation 62 G.3
Online Access:	https://arxiv.org/abs/2506.21921
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Pièces de viole des Cinq Livres and their statistical signatures: the musical work of Marin Marais and Jordi Savall
by: Lugo, Igor, et al.
Published: (2024)

Complexity of frequency fluctuations and the interpretive style in the bass viola da gamba
by: Lugo, Igor, et al.
Published: (2025)

Generative AI-based data augmentation for improved bioacoustic classification in noisy environments
by: Gibbons, Anthony, et al.
Published: (2024)

The Rest is Silence: Leveraging Unseen Species Models for Computational Musicology
by: Moss, Fabian C., et al.
Published: (2025)

Deep functional multiple index models with an application to SER
by: Saumard, Matthieu, et al.
Published: (2024)

Audio signal interpolation using optimal transportation of spectrograms
by: Valdivia, David, et al.
Published: (2025)

Musical composition and 2D cellular automata based on music intervals
by: Lugo, Igor, et al.
Published: (2024)

Dirichlet process mixture model based on topologically augmented signal representation for clustering infant vocalizations
by: Bonafos, Guillem, et al.
Published: (2024)

Multi-Representation Attention Framework for Underwater Bioacoustic Denoising and Recognition
by: Razig, Amine, et al.
Published: (2025)

Frequency-aware convolution for sound event detection
by: Song, Tao, et al.
Published: (2024)

Explainable speech emotion recognition through attentive pooling: insights from attention-based temporal localization
by: Leygue, Tahitoa, et al.
Published: (2025)

learning discriminative features from spectrograms using center loss for speech emotion recognition
by: Dai, Dongyang, et al.
Published: (2025)

Comparison of spectrogram scaling in multi-label Music Genre Recognition
by: Karpiński, Bartosz, et al.
Published: (2025)

Onset and offset weighted loss function for sound event detection
by: Song, Tao
Published: (2024)

Fine-tune the pretrained ATST model for sound event detection
by: Shao, Nian, et al.
Published: (2023)

The impact of non-target events in synthetic soundscapes for sound event detection
by: Ronchini, Francesca, et al.
Published: (2021)

Representational learning for an anomalous sound detection system with source separation model
by: Shin, Seunghyeon, et al.
Published: (2024)

The Neural-SRP method for positional sound source localization
by: Grinstein, Eric, et al.
Published: (2024)

Interaural time difference loss for binaural target sound extraction
by: Hernandez-Olivan, Carlos, et al.
Published: (2024)

An overview of neural architectures for self-supervised audio representation learning from masked spectrograms
by: Yadav, Sarthak, et al.
Published: (2025)

Serial-OE: Anomalous sound detection based on serial method with outlier exposure capable of using small amounts of anomalous data for training
by: Kuroyanagi, Ibuki, et al.
Published: (2025)

Bayesian Restoration of Audio Degraded by Low-Frequency Pulses Modeled via Gaussian Process
by: de Carvalho, Hugo Tremonte, et al.
Published: (2020)

animal2vec and MeerKAT: A self-supervised transformer for rare-event raw audio input and a large-scale reference dataset for bioacoustics
by: Schäfer-Zimmermann, Julian C., et al.
Published: (2024)

Full-frequency dynamic convolution: a physical frequency-dependent convolution for sound event detection
by: Yue, Haobo, et al.
Published: (2024)

Stereo sound event localization and detection based on PSELDnet pretraining and BiMamba sequence modeling
by: Gao, Wenmiao, et al.
Published: (2025)

Performance and energy balance: a comprehensive study of state-of-the-art sound event detection systems
by: Ronchini, Francesca, et al.
Published: (2023)

Non-locally averaged pruned reassigned spectrograms: a tool for glottal pulse visualization and analysis
by: Griswold, Gabriel J., et al.
Published: (2025)

Resnet-conformer network with shared weights and attention mechanism for sound event localization, detection, and distance estimation
by: Vo, Quoc Thinh, et al.
Published: (2025)

Robust detection of overlapping bioacoustic sound events
by: Mahon, Louis, et al.
Published: (2025)

Learning to detect an animal sound from five examples
by: Nolasco, Inês, et al.
Published: (2023)

The role of direct sound spherical harmonics representation in externalization using binaural reproduction
by: Miller, Eran, et al.
Published: (2024)

Multispecies bird sound recognition using a fully convolutional neural network
by: García-Ordás, María Teresa, et al.
Published: (2024)

MelHuBERT: A simplified HuBERT on Mel spectrograms
by: Lin, Tzu-Quan, et al.
Published: (2022)

Text-only domain adaptation for end-to-end ASR using integrated text-to-mel-spectrogram generator
by: Bataev, Vladimir, et al.
Published: (2023)

Binaural sound source localization using a hybrid time and frequency domain model
by: Geva, Gil, et al.
Published: (2024)

Differentiable physics for sound field reconstruction
by: Verburg, Samuel A., et al.
Published: (2025)

Synthetic data enables context-aware bioacoustic sound event detection
by: Hoffman, Benjamin, et al.
Published: (2025)

Repurposing Image Diffusion Models for Training-Free Music Style Transfer on Mel-spectrograms
by: Wang, Heehwan, et al.
Published: (2024)

Some clues to build a sound analysis relevant to hearing
by: Millot, Laurent
Published: (2024)

A benchmark of state-of-the-art sound event detection systems evaluated on synthetic soundscapes
by: Ronchini, Francesca, et al.
Published: (2022)