:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Sechet, Dylan, Bugiotti, Francesca, Kowalski, Matthieu, d'Hérouville, Edouard, Langiewicz, Filip
Format:	Preprint
Published:	2025
Subjects:	Sound Artificial Intelligence Audio and Speech Processing
Online Access:	https://arxiv.org/abs/2506.21167
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Meta-Learning Approaches for Improving Detection of Unseen Speech Deepfakes
by: Kukanov, Ivan, et al.
Published: (2024)

Music Genre Classification: A Comparative Analysis of Classical Machine Learning and Deep Learning Approaches
by: Prajuli, Sachin, et al.
Published: (2026)

HierCon: Hierarchical Contrastive Attention for Audio Deepfake Detection
by: Liang, Zhili Nicholas, et al.
Published: (2026)

HASS: Hierarchical Simulation of Logopenic Aphasic Speech for Scalable PPA Detection
by: Li, Harrison, et al.
Published: (2026)

A Two-Stage Hierarchical Deep Filtering Framework for Real-Time Speech Enhancement
by: Lu, Shenghui, et al.
Published: (2025)

Joint Learning of Emotions in Music and Generalized Sounds
by: Simonetta, Federico, et al.
Published: (2024)

Deepfake Audio Detection Using Spectrogram-based Feature and Ensemble of Deep Learning Models
by: Pham, Lam, et al.
Published: (2024)

Deep Generic Representations for Domain-Generalized Anomalous Sound Detection
by: Saengthong, Phurich, et al.
Published: (2024)

Deep Neural Network for Musical Instrument Recognition using MFCCs
by: Mahanta, Saranga Kingkor, et al.
Published: (2021)

Towards Deep Active Learning in Avian Bioacoustics
by: Rauch, Lukas, et al.
Published: (2024)

A Toolchain for Comprehensive Audio/Video Analysis Using Deep Learning Based Multimodal Approach (A use case of riot or violent context detection)
by: Pham, Lam, et al.
Published: (2024)

4,500 Seconds: Small Data Training Approaches for Deep UAV Audio Classification
by: Berg, Andrew P., et al.
Published: (2025)

Automatic Speech Recognition using Advanced Deep Learning Approaches: A survey
by: Kheddar, Hamza, et al.
Published: (2024)

Advances in Intelligent Hearing Aids: Deep Learning Approaches to Selective Noise Cancellation
by: Khan, Haris, et al.
Published: (2025)

Heart Sound Segmentation Using Deep Learning Techniques
by: Madine, Manas
Published: (2024)

Arabic Music Classification and Generation using Deep Learning
by: Elshaarawy, Mohamed, et al.
Published: (2024)

Transfer Learning-Based Deep Residual Learning for Speech Recognition in Clean and Noisy Environments
by: Djeffal, Noussaiba, et al.
Published: (2025)

Audio-to-Image Encoding for Improved Voice Characteristic Detection Using Deep Convolutional Neural Networks
by: Atif, Youness
Published: (2025)

MIMII-Gen: Generative Modeling Approach for Simulated Evaluation of Anomalous Sound Detection System
by: Purohit, Harsh, et al.
Published: (2024)

Effects of Dataset Sampling Rate for Noise Cancellation through Deep Learning
by: Colelough, Brandon, et al.
Published: (2024)

Continuous Modeling of the Denoising Process for Speech Enhancement Based on Deep Learning
by: Guo, Zilu, et al.
Published: (2023)

Memory-Efficient Training for Deep Speaker Embedding Learning in Speaker Verification
by: Liu, Bei, et al.
Published: (2024)

Hierarchical Self-Supervised Representation Learning for Depression Detection from Speech
by: Li, Yuxin, et al.
Published: (2025)

Contrastive Augmentation: An Unsupervised Learning Approach for Keyword Spotting in Speech Technology
by: Dai, Weinan, et al.
Published: (2024)

End-to-End Supervised Hierarchical Graph Clustering for Speaker Diarization
by: Singh, Prachi, et al.
Published: (2024)

Hear: Hierarchically Enhanced Aesthetic Representations For Multidimensional Music Evaluation
by: Liu, Shuyang, et al.
Published: (2025)

When LLMs Meets Acoustic Landmarks: An Efficient Approach to Integrate Speech into Large Language Models for Depression Detection
by: Zhang, Xiangyu, et al.
Published: (2024)

Contrastive Learning with Spectrum Information Augmentation in Abnormal Sound Detection
by: Meng, Xinxin, et al.
Published: (2025)

Towards Attention-based Contrastive Learning for Audio Spoof Detection
by: Goel, Chirag, et al.
Published: (2024)

Region-Based Optimization in Continual Learning for Audio Deepfake Detection
by: Chen, Yujie, et al.
Published: (2024)

Speech Emotion Recognition Using MFCC Features and LSTM-Based Deep Learning Model
by: Oluwademilade, Adelekun, et al.
Published: (2026)

Explaining Deep Learning Embeddings for Speech Emotion Recognition by Predicting Interpretable Acoustic Features
by: Dixit, Satvik, et al.
Published: (2024)

A Multi-Stream Fusion Approach with One-Class Learning for Audio-Visual Deepfake Detection
by: Lee, Kyungbok, et al.
Published: (2024)

FastSLM: Hierarchical Temporal Abstraction for Efficient Long-Form Speech Adaptation
by: Lee, Junseok, et al.
Published: (2026)

Deep Learning for Speaker Identification: Architectural Insights from AB-1 Corpus Analysis and Performance Evaluation
by: Bartolo, Matthias
Published: (2024)

Improving Pretrained YAMNet for Enhanced Speech Command Detection via Transfer Learning
by: Lachenani, Sidahmed, et al.
Published: (2025)

NarraScore: Bridging Visual Narrative and Musical Dynamics via Hierarchical Affective Control
by: Wen, Yufan, et al.
Published: (2026)

GeHirNet: A Gender-Aware Hierarchical Model for Voice Pathology Classification
by: Wu, Fan, et al.
Published: (2025)

End-to-End Real-World Polyphonic Piano Audio-to-Score Transcription with Hierarchical Decoding
by: Zeng, Wei, et al.
Published: (2024)

CycleGuardian: A Framework for Automatic RespiratorySound classification Based on Improved Deep clustering and Contrastive Learning
by: Chu, Yun, et al.
Published: (2025)