Saved in:
| Main Authors: | Thewes, Nicolas, Steinhauer, Philipp, Trampert, Patrick, Pauly, Markus, Schneider, Georg |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.21921 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Pièces de viole des Cinq Livres and their statistical signatures: the musical work of Marin Marais and Jordi Savall
by: Lugo, Igor, et al.
Published: (2024)
by: Lugo, Igor, et al.
Published: (2024)
Complexity of frequency fluctuations and the interpretive style in the bass viola da gamba
by: Lugo, Igor, et al.
Published: (2025)
by: Lugo, Igor, et al.
Published: (2025)
Generative AI-based data augmentation for improved bioacoustic classification in noisy environments
by: Gibbons, Anthony, et al.
Published: (2024)
by: Gibbons, Anthony, et al.
Published: (2024)
The Rest is Silence: Leveraging Unseen Species Models for Computational Musicology
by: Moss, Fabian C., et al.
Published: (2025)
by: Moss, Fabian C., et al.
Published: (2025)
Deep functional multiple index models with an application to SER
by: Saumard, Matthieu, et al.
Published: (2024)
by: Saumard, Matthieu, et al.
Published: (2024)
Audio signal interpolation using optimal transportation of spectrograms
by: Valdivia, David, et al.
Published: (2025)
by: Valdivia, David, et al.
Published: (2025)
Musical composition and 2D cellular automata based on music intervals
by: Lugo, Igor, et al.
Published: (2024)
by: Lugo, Igor, et al.
Published: (2024)
Dirichlet process mixture model based on topologically augmented signal representation for clustering infant vocalizations
by: Bonafos, Guillem, et al.
Published: (2024)
by: Bonafos, Guillem, et al.
Published: (2024)
Multi-Representation Attention Framework for Underwater Bioacoustic Denoising and Recognition
by: Razig, Amine, et al.
Published: (2025)
by: Razig, Amine, et al.
Published: (2025)
Frequency-aware convolution for sound event detection
by: Song, Tao, et al.
Published: (2024)
by: Song, Tao, et al.
Published: (2024)
Explainable speech emotion recognition through attentive pooling: insights from attention-based temporal localization
by: Leygue, Tahitoa, et al.
Published: (2025)
by: Leygue, Tahitoa, et al.
Published: (2025)
learning discriminative features from spectrograms using center loss for speech emotion recognition
by: Dai, Dongyang, et al.
Published: (2025)
by: Dai, Dongyang, et al.
Published: (2025)
Comparison of spectrogram scaling in multi-label Music Genre Recognition
by: Karpiński, Bartosz, et al.
Published: (2025)
by: Karpiński, Bartosz, et al.
Published: (2025)
Onset and offset weighted loss function for sound event detection
by: Song, Tao
Published: (2024)
by: Song, Tao
Published: (2024)
Fine-tune the pretrained ATST model for sound event detection
by: Shao, Nian, et al.
Published: (2023)
by: Shao, Nian, et al.
Published: (2023)
The impact of non-target events in synthetic soundscapes for sound event detection
by: Ronchini, Francesca, et al.
Published: (2021)
by: Ronchini, Francesca, et al.
Published: (2021)
Representational learning for an anomalous sound detection system with source separation model
by: Shin, Seunghyeon, et al.
Published: (2024)
by: Shin, Seunghyeon, et al.
Published: (2024)
The Neural-SRP method for positional sound source localization
by: Grinstein, Eric, et al.
Published: (2024)
by: Grinstein, Eric, et al.
Published: (2024)
Interaural time difference loss for binaural target sound extraction
by: Hernandez-Olivan, Carlos, et al.
Published: (2024)
by: Hernandez-Olivan, Carlos, et al.
Published: (2024)
An overview of neural architectures for self-supervised audio representation learning from masked spectrograms
by: Yadav, Sarthak, et al.
Published: (2025)
by: Yadav, Sarthak, et al.
Published: (2025)
Serial-OE: Anomalous sound detection based on serial method with outlier exposure capable of using small amounts of anomalous data for training
by: Kuroyanagi, Ibuki, et al.
Published: (2025)
by: Kuroyanagi, Ibuki, et al.
Published: (2025)
Bayesian Restoration of Audio Degraded by Low-Frequency Pulses Modeled via Gaussian Process
by: de Carvalho, Hugo Tremonte, et al.
Published: (2020)
by: de Carvalho, Hugo Tremonte, et al.
Published: (2020)
animal2vec and MeerKAT: A self-supervised transformer for rare-event raw audio input and a large-scale reference dataset for bioacoustics
by: Schäfer-Zimmermann, Julian C., et al.
Published: (2024)
by: Schäfer-Zimmermann, Julian C., et al.
Published: (2024)
Full-frequency dynamic convolution: a physical frequency-dependent convolution for sound event detection
by: Yue, Haobo, et al.
Published: (2024)
by: Yue, Haobo, et al.
Published: (2024)
Stereo sound event localization and detection based on PSELDnet pretraining and BiMamba sequence modeling
by: Gao, Wenmiao, et al.
Published: (2025)
by: Gao, Wenmiao, et al.
Published: (2025)
Performance and energy balance: a comprehensive study of state-of-the-art sound event detection systems
by: Ronchini, Francesca, et al.
Published: (2023)
by: Ronchini, Francesca, et al.
Published: (2023)
Non-locally averaged pruned reassigned spectrograms: a tool for glottal pulse visualization and analysis
by: Griswold, Gabriel J., et al.
Published: (2025)
by: Griswold, Gabriel J., et al.
Published: (2025)
Resnet-conformer network with shared weights and attention mechanism for sound event localization, detection, and distance estimation
by: Vo, Quoc Thinh, et al.
Published: (2025)
by: Vo, Quoc Thinh, et al.
Published: (2025)
Robust detection of overlapping bioacoustic sound events
by: Mahon, Louis, et al.
Published: (2025)
by: Mahon, Louis, et al.
Published: (2025)
Learning to detect an animal sound from five examples
by: Nolasco, Inês, et al.
Published: (2023)
by: Nolasco, Inês, et al.
Published: (2023)
The role of direct sound spherical harmonics representation in externalization using binaural reproduction
by: Miller, Eran, et al.
Published: (2024)
by: Miller, Eran, et al.
Published: (2024)
Multispecies bird sound recognition using a fully convolutional neural network
by: García-Ordás, María Teresa, et al.
Published: (2024)
by: García-Ordás, María Teresa, et al.
Published: (2024)
MelHuBERT: A simplified HuBERT on Mel spectrograms
by: Lin, Tzu-Quan, et al.
Published: (2022)
by: Lin, Tzu-Quan, et al.
Published: (2022)
Text-only domain adaptation for end-to-end ASR using integrated text-to-mel-spectrogram generator
by: Bataev, Vladimir, et al.
Published: (2023)
by: Bataev, Vladimir, et al.
Published: (2023)
Binaural sound source localization using a hybrid time and frequency domain model
by: Geva, Gil, et al.
Published: (2024)
by: Geva, Gil, et al.
Published: (2024)
Differentiable physics for sound field reconstruction
by: Verburg, Samuel A., et al.
Published: (2025)
by: Verburg, Samuel A., et al.
Published: (2025)
Synthetic data enables context-aware bioacoustic sound event detection
by: Hoffman, Benjamin, et al.
Published: (2025)
by: Hoffman, Benjamin, et al.
Published: (2025)
Repurposing Image Diffusion Models for Training-Free Music Style Transfer on Mel-spectrograms
by: Wang, Heehwan, et al.
Published: (2024)
by: Wang, Heehwan, et al.
Published: (2024)
Some clues to build a sound analysis relevant to hearing
by: Millot, Laurent
Published: (2024)
by: Millot, Laurent
Published: (2024)
A benchmark of state-of-the-art sound event detection systems evaluated on synthetic soundscapes
by: Ronchini, Francesca, et al.
Published: (2022)
by: Ronchini, Francesca, et al.
Published: (2022)
Similar Items
-
Pièces de viole des Cinq Livres and their statistical signatures: the musical work of Marin Marais and Jordi Savall
by: Lugo, Igor, et al.
Published: (2024) -
Complexity of frequency fluctuations and the interpretive style in the bass viola da gamba
by: Lugo, Igor, et al.
Published: (2025) -
Generative AI-based data augmentation for improved bioacoustic classification in noisy environments
by: Gibbons, Anthony, et al.
Published: (2024) -
The Rest is Silence: Leveraging Unseen Species Models for Computational Musicology
by: Moss, Fabian C., et al.
Published: (2025) -
Deep functional multiple index models with an application to SER
by: Saumard, Matthieu, et al.
Published: (2024)