:: Library Catalog

Copertina

Salvato in:

Dettagli Bibliografici
Autori principali:	Ma, Chengyuan, Jia, Peng, Guo, Hongyue, Yang, Wenming
Natura:	Preprint
Pubblicazione:	2025
Soggetti:	Sound Machine Learning
Accesso online:	https://arxiv.org/abs/2509.02471
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

Documenti analoghi

TLDiffGAN: A Latent Diffusion-GAN Framework with Temporal Information Fusion for Anomalous Sound Detection
di: Ma, Chengyuan, et al.
Pubblicazione: (2026)

MTDA-HSED: Mutual-Assistance Tuning and Dual-Branch Aggregating for Heterogeneous Sound Event Detection
di: Wang, Zehao, et al.
Pubblicazione: (2024)

An Enhanced Audio Feature Tailored for Anomalous Sound Detection Based on Pre-trained Models
di: Zhong, Guirui, et al.
Pubblicazione: (2025)

Activity-Guided Industrial Anomalous Sound Detection against Interferences
di: Lee, Yunjoo, et al.
Pubblicazione: (2024)

A Framework for Evaluating Faithfulness in Explainable AI for Machine Anomalous Sound Detection Using Frequency-Band Perturbation
di: Buck, Alexander, et al.
Pubblicazione: (2026)

MIMII-Agent: Leveraging LLMs with Function Calling for Relative Evaluation of Anomalous Sound Detection
di: Purohit, Harsh, et al.
Pubblicazione: (2025)

Dual Knowledge Distillation for Efficient Sound Event Detection
di: Xiao, Yang, et al.
Pubblicazione: (2024)

Description and Discussion on DCASE 2024 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring
di: Nishida, Tomoya, et al.
Pubblicazione: (2024)

Solution for Temporal Sound Localisation Task of ECCV Second Perception Test Challenge 2024
di: Gu, Haowei, et al.
Pubblicazione: (2024)

Audio Flamingo Sound-CoT Technical Report: Improving Chain-of-Thought Reasoning in Sound Understanding
di: Kong, Zhifeng, et al.
Pubblicazione: (2025)

Schrödinger Bridge Mamba for One-Step Speech Enhancement
di: Yang, Jing, et al.
Pubblicazione: (2025)

Advanced Framework for Animal Sound Classification With Features Optimization
di: Yang, Qiang, et al.
Pubblicazione: (2024)

SepMamba: State-space models for speaker separation using Mamba
di: Avenstrup, Thor Højhus, et al.
Pubblicazione: (2024)

Sound Event Detection and Localization with Distance Estimation
di: Krause, Daniel Aleksander, et al.
Pubblicazione: (2024)

How to Label Resynthesized Audio: The Dual Role of Neural Audio Codecs in Audio Deepfake Detection
di: Xiao, Yixuan, et al.
Pubblicazione: (2026)

H-Infinity Filter Enhanced CNN-LSTM for Arrhythmia Detection from Heart Sound Recordings
di: Kumar, Rohith Shinoj, et al.
Pubblicazione: (2025)

MambaVoiceCloning: Efficient and Expressive Text-to-Speech via State-Space Modeling and Diffusion Control
di: Kumar, Sahil, et al.
Pubblicazione: (2026)

Energy Consumption Trends in Sound Event Detection Systems
di: Douwes, Constance, et al.
Pubblicazione: (2024)

Uncertainty Calibration of Multi-Label Bird Sound Classifiers
di: Schwinger, Raphael, et al.
Pubblicazione: (2025)

Benchmarking LLMs on the Massive Sound Embedding Benchmark (MSEB)
di: Allauzen, Cyril, et al.
Pubblicazione: (2026)

XAI-Driven Spectral Analysis of Cough Sounds for Respiratory Disease Characterization
di: Amado-Caballero, Patricia, et al.
Pubblicazione: (2025)

Unleashing the Power of Natural Audio Featuring Multiple Sound Sources
di: Cheng, Xize, et al.
Pubblicazione: (2025)

Exploring Performance-Complexity Trade-Offs in Sound Event Detection Models
di: Morocutti, Tobias, et al.
Pubblicazione: (2025)

RCT: Random Consistency Training for Semi-supervised Sound Event Detection
di: Shao, Nian, et al.
Pubblicazione: (2021)

Regularized Contrastive Pre-training for Few-shot Bioacoustic Sound Detection
di: Moummad, Ilyass, et al.
Pubblicazione: (2023)

Sound and Music Biases in Deep Music Transcription Models: A Systematic Analysis
di: Marták, Lukáš Samuel, et al.
Pubblicazione: (2025)

Sound Signal Synthesis with Auxiliary Classifier GAN, COVID-19 cough as an example
di: Saleh, Yahya Sherif Solayman Mohamed, et al.
Pubblicazione: (2025)

T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining
di: Yuan, Yi, et al.
Pubblicazione: (2024)

Do Audio-Visual Segmentation Models Truly Segment Sounding Objects?
di: Li, Jia, et al.
Pubblicazione: (2025)

Improving Deep Learning-based Respiratory Sound Analysis with Frequency Selection and Attention Mechanism
di: Fraihi, Nouhaila, et al.
Pubblicazione: (2025)

CodecSep: Prompt-Driven Universal Sound Separation on Neural Audio Codec Latents
di: Banerjee, Adhiraj, et al.
Pubblicazione: (2025)

U-Mamba-Net: A highly efficient Mamba-based U-net style network for noisy and reverberant speech separation
di: Dang, Shaoxiang, et al.
Pubblicazione: (2024)

Cross-Referencing Self-Training Network for Sound Event Detection in Audio Mixtures
di: Park, Sangwook, et al.
Pubblicazione: (2021)

An Experimental Study on Joint Modeling for Sound Event Localization and Detection with Source Distance Estimation
di: Dong, Yuxuan, et al.
Pubblicazione: (2025)

Real-Time Voicemail Detection in Telephony Audio Using Temporal Speech Activity Features
di: Saurav, Kumar
Pubblicazione: (2026)

Improving Speaker-independent Speech Emotion Recognition Using Dynamic Joint Distribution Adaptation
di: Lu, Cheng, et al.
Pubblicazione: (2024)

SoundMorpher: Perceptually-Uniform Sound Morphing with Diffusion Model
di: Niu, Xinlei, et al.
Pubblicazione: (2024)

Enhancing Speech Emotion Recognition using Dynamic Spectral Features and Kalman Smoothing
di: Hizabri, Marouane El, et al.
Pubblicazione: (2026)

The Solution for Temporal Sound Localisation Task of ICCV 1st Perception Test Challenge 2023
di: Huang, Yurui, et al.
Pubblicazione: (2024)

Detection of Electric Motor Damage Through Analysis of Sound Signals Using Bayesian Neural Networks
di: Bauer, Waldemar, et al.
Pubblicazione: (2024)