Saved in:
| Main Authors: | Sechet, Dylan, Bugiotti, Francesca, Kowalski, Matthieu, d'Hérouville, Edouard, Langiewicz, Filip |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.21167 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Meta-Learning Approaches for Improving Detection of Unseen Speech Deepfakes
by: Kukanov, Ivan, et al.
Published: (2024)
by: Kukanov, Ivan, et al.
Published: (2024)
Music Genre Classification: A Comparative Analysis of Classical Machine Learning and Deep Learning Approaches
by: Prajuli, Sachin, et al.
Published: (2026)
by: Prajuli, Sachin, et al.
Published: (2026)
HierCon: Hierarchical Contrastive Attention for Audio Deepfake Detection
by: Liang, Zhili Nicholas, et al.
Published: (2026)
by: Liang, Zhili Nicholas, et al.
Published: (2026)
HASS: Hierarchical Simulation of Logopenic Aphasic Speech for Scalable PPA Detection
by: Li, Harrison, et al.
Published: (2026)
by: Li, Harrison, et al.
Published: (2026)
A Two-Stage Hierarchical Deep Filtering Framework for Real-Time Speech Enhancement
by: Lu, Shenghui, et al.
Published: (2025)
by: Lu, Shenghui, et al.
Published: (2025)
Joint Learning of Emotions in Music and Generalized Sounds
by: Simonetta, Federico, et al.
Published: (2024)
by: Simonetta, Federico, et al.
Published: (2024)
Deepfake Audio Detection Using Spectrogram-based Feature and Ensemble of Deep Learning Models
by: Pham, Lam, et al.
Published: (2024)
by: Pham, Lam, et al.
Published: (2024)
Deep Generic Representations for Domain-Generalized Anomalous Sound Detection
by: Saengthong, Phurich, et al.
Published: (2024)
by: Saengthong, Phurich, et al.
Published: (2024)
Deep Neural Network for Musical Instrument Recognition using MFCCs
by: Mahanta, Saranga Kingkor, et al.
Published: (2021)
by: Mahanta, Saranga Kingkor, et al.
Published: (2021)
Towards Deep Active Learning in Avian Bioacoustics
by: Rauch, Lukas, et al.
Published: (2024)
by: Rauch, Lukas, et al.
Published: (2024)
A Toolchain for Comprehensive Audio/Video Analysis Using Deep Learning Based Multimodal Approach (A use case of riot or violent context detection)
by: Pham, Lam, et al.
Published: (2024)
by: Pham, Lam, et al.
Published: (2024)
4,500 Seconds: Small Data Training Approaches for Deep UAV Audio Classification
by: Berg, Andrew P., et al.
Published: (2025)
by: Berg, Andrew P., et al.
Published: (2025)
Automatic Speech Recognition using Advanced Deep Learning Approaches: A survey
by: Kheddar, Hamza, et al.
Published: (2024)
by: Kheddar, Hamza, et al.
Published: (2024)
Advances in Intelligent Hearing Aids: Deep Learning Approaches to Selective Noise Cancellation
by: Khan, Haris, et al.
Published: (2025)
by: Khan, Haris, et al.
Published: (2025)
Heart Sound Segmentation Using Deep Learning Techniques
by: Madine, Manas
Published: (2024)
by: Madine, Manas
Published: (2024)
Arabic Music Classification and Generation using Deep Learning
by: Elshaarawy, Mohamed, et al.
Published: (2024)
by: Elshaarawy, Mohamed, et al.
Published: (2024)
Transfer Learning-Based Deep Residual Learning for Speech Recognition in Clean and Noisy Environments
by: Djeffal, Noussaiba, et al.
Published: (2025)
by: Djeffal, Noussaiba, et al.
Published: (2025)
Audio-to-Image Encoding for Improved Voice Characteristic Detection Using Deep Convolutional Neural Networks
by: Atif, Youness
Published: (2025)
by: Atif, Youness
Published: (2025)
MIMII-Gen: Generative Modeling Approach for Simulated Evaluation of Anomalous Sound Detection System
by: Purohit, Harsh, et al.
Published: (2024)
by: Purohit, Harsh, et al.
Published: (2024)
Effects of Dataset Sampling Rate for Noise Cancellation through Deep Learning
by: Colelough, Brandon, et al.
Published: (2024)
by: Colelough, Brandon, et al.
Published: (2024)
Continuous Modeling of the Denoising Process for Speech Enhancement Based on Deep Learning
by: Guo, Zilu, et al.
Published: (2023)
by: Guo, Zilu, et al.
Published: (2023)
Memory-Efficient Training for Deep Speaker Embedding Learning in Speaker Verification
by: Liu, Bei, et al.
Published: (2024)
by: Liu, Bei, et al.
Published: (2024)
Hierarchical Self-Supervised Representation Learning for Depression Detection from Speech
by: Li, Yuxin, et al.
Published: (2025)
by: Li, Yuxin, et al.
Published: (2025)
Contrastive Augmentation: An Unsupervised Learning Approach for Keyword Spotting in Speech Technology
by: Dai, Weinan, et al.
Published: (2024)
by: Dai, Weinan, et al.
Published: (2024)
End-to-End Supervised Hierarchical Graph Clustering for Speaker Diarization
by: Singh, Prachi, et al.
Published: (2024)
by: Singh, Prachi, et al.
Published: (2024)
Hear: Hierarchically Enhanced Aesthetic Representations For Multidimensional Music Evaluation
by: Liu, Shuyang, et al.
Published: (2025)
by: Liu, Shuyang, et al.
Published: (2025)
When LLMs Meets Acoustic Landmarks: An Efficient Approach to Integrate Speech into Large Language Models for Depression Detection
by: Zhang, Xiangyu, et al.
Published: (2024)
by: Zhang, Xiangyu, et al.
Published: (2024)
Contrastive Learning with Spectrum Information Augmentation in Abnormal Sound Detection
by: Meng, Xinxin, et al.
Published: (2025)
by: Meng, Xinxin, et al.
Published: (2025)
Towards Attention-based Contrastive Learning for Audio Spoof Detection
by: Goel, Chirag, et al.
Published: (2024)
by: Goel, Chirag, et al.
Published: (2024)
Region-Based Optimization in Continual Learning for Audio Deepfake Detection
by: Chen, Yujie, et al.
Published: (2024)
by: Chen, Yujie, et al.
Published: (2024)
Speech Emotion Recognition Using MFCC Features and LSTM-Based Deep Learning Model
by: Oluwademilade, Adelekun, et al.
Published: (2026)
by: Oluwademilade, Adelekun, et al.
Published: (2026)
Explaining Deep Learning Embeddings for Speech Emotion Recognition by Predicting Interpretable Acoustic Features
by: Dixit, Satvik, et al.
Published: (2024)
by: Dixit, Satvik, et al.
Published: (2024)
A Multi-Stream Fusion Approach with One-Class Learning for Audio-Visual Deepfake Detection
by: Lee, Kyungbok, et al.
Published: (2024)
by: Lee, Kyungbok, et al.
Published: (2024)
FastSLM: Hierarchical Temporal Abstraction for Efficient Long-Form Speech Adaptation
by: Lee, Junseok, et al.
Published: (2026)
by: Lee, Junseok, et al.
Published: (2026)
Deep Learning for Speaker Identification: Architectural Insights from AB-1 Corpus Analysis and Performance Evaluation
by: Bartolo, Matthias
Published: (2024)
by: Bartolo, Matthias
Published: (2024)
Improving Pretrained YAMNet for Enhanced Speech Command Detection via Transfer Learning
by: Lachenani, Sidahmed, et al.
Published: (2025)
by: Lachenani, Sidahmed, et al.
Published: (2025)
NarraScore: Bridging Visual Narrative and Musical Dynamics via Hierarchical Affective Control
by: Wen, Yufan, et al.
Published: (2026)
by: Wen, Yufan, et al.
Published: (2026)
GeHirNet: A Gender-Aware Hierarchical Model for Voice Pathology Classification
by: Wu, Fan, et al.
Published: (2025)
by: Wu, Fan, et al.
Published: (2025)
End-to-End Real-World Polyphonic Piano Audio-to-Score Transcription with Hierarchical Decoding
by: Zeng, Wei, et al.
Published: (2024)
by: Zeng, Wei, et al.
Published: (2024)
CycleGuardian: A Framework for Automatic RespiratorySound classification Based on Improved Deep clustering and Contrastive Learning
by: Chu, Yun, et al.
Published: (2025)
by: Chu, Yun, et al.
Published: (2025)
Similar Items
-
Meta-Learning Approaches for Improving Detection of Unseen Speech Deepfakes
by: Kukanov, Ivan, et al.
Published: (2024) -
Music Genre Classification: A Comparative Analysis of Classical Machine Learning and Deep Learning Approaches
by: Prajuli, Sachin, et al.
Published: (2026) -
HierCon: Hierarchical Contrastive Attention for Audio Deepfake Detection
by: Liang, Zhili Nicholas, et al.
Published: (2026) -
HASS: Hierarchical Simulation of Logopenic Aphasic Speech for Scalable PPA Detection
by: Li, Harrison, et al.
Published: (2026) -
A Two-Stage Hierarchical Deep Filtering Framework for Real-Time Speech Enhancement
by: Lu, Shenghui, et al.
Published: (2025)