:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Ge, Shijia, Zhang, Weixiang, Xie, Shuzhao, Yan, Baixu, Wang, Zhi
Format:	Preprint
Published:	2024
Subjects:	Sound Machine Learning Audio and Speech Processing
Online Access:	https://arxiv.org/abs/2501.00064
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Mixture of Mixups for Multi-label Classification of Rare Anuran Sounds
by: Moummad, Ilyass, et al.
Published: (2024)

Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification
by: Bae, Sangmin, et al.
Published: (2023)

Abnormal Respiratory Sound Identification Using Audio-Spectrogram Vision Transformer
by: Ariyanti, Whenty, et al.
Published: (2024)

SoundCTM: Unifying Score-based and Consistency Models for Full-band Text-to-Sound Generation
by: Saito, Koichi, et al.
Published: (2024)

Advanced Framework for Animal Sound Classification With Features Optimization
by: Yang, Qiang, et al.
Published: (2024)

Adaptive Differential Denoising for Respiratory Sounds Classification
by: Dong, Gaoyang, et al.
Published: (2025)

XAI-Driven Spectral Analysis of Cough Sounds for Respiratory Disease Characterization
by: Amado-Caballero, Patricia, et al.
Published: (2025)

Classification of Short Segment Pediatric Heart Sounds Based on a Transformer-Based Convolutional Neural Network
by: Hassanuzzaman, Md, et al.
Published: (2024)

Focal Modulation Networks for Interpretable Sound Classification
by: Della Libera, Luca, et al.
Published: (2024)

SoundReactor: Frame-level Online Video-to-Audio Generation
by: Saito, Koichi, et al.
Published: (2025)

Geometry-Aware Optimization for Respiratory Sound Classification: Enhancing Sensitivity with SAM-Optimized Audio Spectrogram Transformers
by: Işık, Atakan, et al.
Published: (2025)

Embedding-Space Diffusion for Zero-Shot Environmental Sound Classification
by: Sims, Ysobel, et al.
Published: (2024)

Self-Supervised Learning for Few-Shot Bird Sound Classification
by: Moummad, Ilyass, et al.
Published: (2023)

Feature Aggregation in Joint Sound Classification and Localization Neural Networks
by: Healy, Brendan, et al.
Published: (2023)

Aggregation Strategies for Efficient Annotation of Bioacoustic Sound Events Using Active Learning
by: Lindholm, Richard, et al.
Published: (2025)

SoundMorpher: Perceptually-Uniform Sound Morphing with Diffusion Model
by: Niu, Xinlei, et al.
Published: (2024)

EVA-GAN: Enhanced Various Audio Generation via Scalable Generative Adversarial Networks
by: Liao, Shijia, et al.
Published: (2024)

An Enhanced Audio Feature Tailored for Anomalous Sound Detection Based on Pre-trained Models
by: Zhong, Guirui, et al.
Published: (2025)

AFEN: Respiratory Disease Classification using Ensemble Learning
by: Nadkarni, Rahul, et al.
Published: (2024)

RA-QA: A Benchmarking System for Respiratory Audio Question Answering Under Real-World Heterogeneity
by: Bertolino, Gaia A., et al.
Published: (2026)

Patient-Aware Feature Alignment for Robust Lung Sound Classification:Cohesion-Separation and Global Alignment Losses
by: Jeong, Seung Gyu, et al.
Published: (2025)

Disentangling Dual-Encoder Masked Autoencoder for Respiratory Sound Classification
by: Wei, Peidong, et al.
Published: (2025)

Automatic Inspection Based on Switch Sounds of Electric Point Machines
by: Shibata, Ayano, et al.
Published: (2025)

Microphone Conversion: Mitigating Device Variability in Sound Event Classification
by: Ryu, Myeonghoon, et al.
Published: (2024)

Sound event localization and classification using WASN in Outdoor Environment
by: Zhang, Dongzhe, et al.
Published: (2024)

Analysis-Driven Procedural Generation of an Engine Sound Dataset with Embedded Control Annotations
by: Doerfler, Robin, et al.
Published: (2026)

SoundSculpt: Direction and Semantics Driven Ambisonic Target Sound Extraction
by: Chen, Tuochao, et al.
Published: (2025)

Multi-label Zero-Shot Audio Classification with Temporal Attention
by: Dogan, Duygu, et al.
Published: (2024)

Audio Geolocation: A Natural Sounds Benchmark
by: Chasmai, Mustafa, et al.
Published: (2025)

From Sound to Setting: AI-Based Equalizer Parameter Prediction for Piano Tone Replication
by: Yu, Song-Ze
Published: (2025)

HyperGANStrument: Instrument Sound Synthesis and Editing with Pitch-Invariant Hypernetworks
by: Zhang, Zhe, et al.
Published: (2024)

MTDA-HSED: Mutual-Assistance Tuning and Dual-Branch Aggregating for Heterogeneous Sound Event Detection
by: Wang, Zehao, et al.
Published: (2024)

The iNaturalist Sounds Dataset
by: Chasmai, Mustafa, et al.
Published: (2025)

Contrasting Deep Learning Models for Direct Respiratory Insufficiency Detection Versus Blood Oxygen Saturation Estimation
by: Gauy, Marcelo Matheus, et al.
Published: (2024)

An AI-enabled Bias-Free Respiratory Disease Diagnosis Model using Cough Audio: A Case Study for COVID-19
by: Saeed, Tabish, et al.
Published: (2024)

Music-Aligned Holistic 3D Dance Generation via Hierarchical Motion Modeling
by: Li, Xiaojie, et al.
Published: (2025)

Maximum Likelihood Estimation of the Direction of Sound In A Reverberant Noisy Environment
by: Mansour, Mohamed F.
Published: (2024)

A Comprehensive Survey on Heart Sound Analysis in the Deep Learning Era
by: Ren, Zhao, et al.
Published: (2023)

Sound Event Detection and Localization with Distance Estimation
by: Krause, Daniel Aleksander, et al.
Published: (2024)

Sound Tagging in Infant-centric Home Soundscapes
by: Khan, Mohammad Nur Hossain, et al.
Published: (2024)