Saved in:
| Main Authors: | Ge, Shijia, Zhang, Weixiang, Xie, Shuzhao, Yan, Baixu, Wang, Zhi |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2501.00064 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Mixture of Mixups for Multi-label Classification of Rare Anuran Sounds
by: Moummad, Ilyass, et al.
Published: (2024)
by: Moummad, Ilyass, et al.
Published: (2024)
Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification
by: Bae, Sangmin, et al.
Published: (2023)
by: Bae, Sangmin, et al.
Published: (2023)
Abnormal Respiratory Sound Identification Using Audio-Spectrogram Vision Transformer
by: Ariyanti, Whenty, et al.
Published: (2024)
by: Ariyanti, Whenty, et al.
Published: (2024)
SoundCTM: Unifying Score-based and Consistency Models for Full-band Text-to-Sound Generation
by: Saito, Koichi, et al.
Published: (2024)
by: Saito, Koichi, et al.
Published: (2024)
Advanced Framework for Animal Sound Classification With Features Optimization
by: Yang, Qiang, et al.
Published: (2024)
by: Yang, Qiang, et al.
Published: (2024)
Adaptive Differential Denoising for Respiratory Sounds Classification
by: Dong, Gaoyang, et al.
Published: (2025)
by: Dong, Gaoyang, et al.
Published: (2025)
XAI-Driven Spectral Analysis of Cough Sounds for Respiratory Disease Characterization
by: Amado-Caballero, Patricia, et al.
Published: (2025)
by: Amado-Caballero, Patricia, et al.
Published: (2025)
Classification of Short Segment Pediatric Heart Sounds Based on a Transformer-Based Convolutional Neural Network
by: Hassanuzzaman, Md, et al.
Published: (2024)
by: Hassanuzzaman, Md, et al.
Published: (2024)
Focal Modulation Networks for Interpretable Sound Classification
by: Della Libera, Luca, et al.
Published: (2024)
by: Della Libera, Luca, et al.
Published: (2024)
SoundReactor: Frame-level Online Video-to-Audio Generation
by: Saito, Koichi, et al.
Published: (2025)
by: Saito, Koichi, et al.
Published: (2025)
Geometry-Aware Optimization for Respiratory Sound Classification: Enhancing Sensitivity with SAM-Optimized Audio Spectrogram Transformers
by: Işık, Atakan, et al.
Published: (2025)
by: Işık, Atakan, et al.
Published: (2025)
Embedding-Space Diffusion for Zero-Shot Environmental Sound Classification
by: Sims, Ysobel, et al.
Published: (2024)
by: Sims, Ysobel, et al.
Published: (2024)
Self-Supervised Learning for Few-Shot Bird Sound Classification
by: Moummad, Ilyass, et al.
Published: (2023)
by: Moummad, Ilyass, et al.
Published: (2023)
Feature Aggregation in Joint Sound Classification and Localization Neural Networks
by: Healy, Brendan, et al.
Published: (2023)
by: Healy, Brendan, et al.
Published: (2023)
Aggregation Strategies for Efficient Annotation of Bioacoustic Sound Events Using Active Learning
by: Lindholm, Richard, et al.
Published: (2025)
by: Lindholm, Richard, et al.
Published: (2025)
SoundMorpher: Perceptually-Uniform Sound Morphing with Diffusion Model
by: Niu, Xinlei, et al.
Published: (2024)
by: Niu, Xinlei, et al.
Published: (2024)
EVA-GAN: Enhanced Various Audio Generation via Scalable Generative Adversarial Networks
by: Liao, Shijia, et al.
Published: (2024)
by: Liao, Shijia, et al.
Published: (2024)
An Enhanced Audio Feature Tailored for Anomalous Sound Detection Based on Pre-trained Models
by: Zhong, Guirui, et al.
Published: (2025)
by: Zhong, Guirui, et al.
Published: (2025)
AFEN: Respiratory Disease Classification using Ensemble Learning
by: Nadkarni, Rahul, et al.
Published: (2024)
by: Nadkarni, Rahul, et al.
Published: (2024)
RA-QA: A Benchmarking System for Respiratory Audio Question Answering Under Real-World Heterogeneity
by: Bertolino, Gaia A., et al.
Published: (2026)
by: Bertolino, Gaia A., et al.
Published: (2026)
Patient-Aware Feature Alignment for Robust Lung Sound Classification:Cohesion-Separation and Global Alignment Losses
by: Jeong, Seung Gyu, et al.
Published: (2025)
by: Jeong, Seung Gyu, et al.
Published: (2025)
Disentangling Dual-Encoder Masked Autoencoder for Respiratory Sound Classification
by: Wei, Peidong, et al.
Published: (2025)
by: Wei, Peidong, et al.
Published: (2025)
Automatic Inspection Based on Switch Sounds of Electric Point Machines
by: Shibata, Ayano, et al.
Published: (2025)
by: Shibata, Ayano, et al.
Published: (2025)
Microphone Conversion: Mitigating Device Variability in Sound Event Classification
by: Ryu, Myeonghoon, et al.
Published: (2024)
by: Ryu, Myeonghoon, et al.
Published: (2024)
Sound event localization and classification using WASN in Outdoor Environment
by: Zhang, Dongzhe, et al.
Published: (2024)
by: Zhang, Dongzhe, et al.
Published: (2024)
Analysis-Driven Procedural Generation of an Engine Sound Dataset with Embedded Control Annotations
by: Doerfler, Robin, et al.
Published: (2026)
by: Doerfler, Robin, et al.
Published: (2026)
SoundSculpt: Direction and Semantics Driven Ambisonic Target Sound Extraction
by: Chen, Tuochao, et al.
Published: (2025)
by: Chen, Tuochao, et al.
Published: (2025)
Multi-label Zero-Shot Audio Classification with Temporal Attention
by: Dogan, Duygu, et al.
Published: (2024)
by: Dogan, Duygu, et al.
Published: (2024)
Audio Geolocation: A Natural Sounds Benchmark
by: Chasmai, Mustafa, et al.
Published: (2025)
by: Chasmai, Mustafa, et al.
Published: (2025)
From Sound to Setting: AI-Based Equalizer Parameter Prediction for Piano Tone Replication
by: Yu, Song-Ze
Published: (2025)
by: Yu, Song-Ze
Published: (2025)
HyperGANStrument: Instrument Sound Synthesis and Editing with Pitch-Invariant Hypernetworks
by: Zhang, Zhe, et al.
Published: (2024)
by: Zhang, Zhe, et al.
Published: (2024)
MTDA-HSED: Mutual-Assistance Tuning and Dual-Branch Aggregating for Heterogeneous Sound Event Detection
by: Wang, Zehao, et al.
Published: (2024)
by: Wang, Zehao, et al.
Published: (2024)
The iNaturalist Sounds Dataset
by: Chasmai, Mustafa, et al.
Published: (2025)
by: Chasmai, Mustafa, et al.
Published: (2025)
Contrasting Deep Learning Models for Direct Respiratory Insufficiency Detection Versus Blood Oxygen Saturation Estimation
by: Gauy, Marcelo Matheus, et al.
Published: (2024)
by: Gauy, Marcelo Matheus, et al.
Published: (2024)
An AI-enabled Bias-Free Respiratory Disease Diagnosis Model using Cough Audio: A Case Study for COVID-19
by: Saeed, Tabish, et al.
Published: (2024)
by: Saeed, Tabish, et al.
Published: (2024)
Music-Aligned Holistic 3D Dance Generation via Hierarchical Motion Modeling
by: Li, Xiaojie, et al.
Published: (2025)
by: Li, Xiaojie, et al.
Published: (2025)
Maximum Likelihood Estimation of the Direction of Sound In A Reverberant Noisy Environment
by: Mansour, Mohamed F.
Published: (2024)
by: Mansour, Mohamed F.
Published: (2024)
A Comprehensive Survey on Heart Sound Analysis in the Deep Learning Era
by: Ren, Zhao, et al.
Published: (2023)
by: Ren, Zhao, et al.
Published: (2023)
Sound Event Detection and Localization with Distance Estimation
by: Krause, Daniel Aleksander, et al.
Published: (2024)
by: Krause, Daniel Aleksander, et al.
Published: (2024)
Sound Tagging in Infant-centric Home Soundscapes
by: Khan, Mohammad Nur Hossain, et al.
Published: (2024)
by: Khan, Mohammad Nur Hossain, et al.
Published: (2024)
Similar Items
-
Mixture of Mixups for Multi-label Classification of Rare Anuran Sounds
by: Moummad, Ilyass, et al.
Published: (2024) -
Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification
by: Bae, Sangmin, et al.
Published: (2023) -
Abnormal Respiratory Sound Identification Using Audio-Spectrogram Vision Transformer
by: Ariyanti, Whenty, et al.
Published: (2024) -
SoundCTM: Unifying Score-based and Consistency Models for Full-band Text-to-Sound Generation
by: Saito, Koichi, et al.
Published: (2024) -
Advanced Framework for Animal Sound Classification With Features Optimization
by: Yang, Qiang, et al.
Published: (2024)