:: Library Catalog

Copertina

Salvato in:

Dettagli Bibliografici
Autori principali:	Matynia, Igor, Nowak, Robert
Natura:	Preprint
Pubblicazione:	2025
Soggetti:	Sound Audio and Speech Processing 68 J.3
Accesso online:	https://arxiv.org/abs/2504.08659
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

Documenti analoghi

Assessing the Utility of Audio Foundation Models for Heart and Respiratory Sound Analysis
di: Niizumi, Daisuke, et al.
Pubblicazione: (2025)

Towards Pre-training an Effective Respiratory Audio Foundation Model
di: Niizumi, Daisuke, et al.
Pubblicazione: (2025)

Sound Safeguarding for Acoustic Measurement Using Any Sounds: Tools and Applications
di: Kawahara, Hideki, et al.
Pubblicazione: (2025)

A Generalist Audio Foundation Model for Comprehensive Body Sound Auscultation
di: Wang, Pingjie, et al.
Pubblicazione: (2024)

Proposal of protocols for speech materials acquisition and presentation assisted by tools based on structured test signals
di: Kawahara, Hideki, et al.
Pubblicazione: (2024)

Towards Objective Gastrointestinal Auscultation: Automated Segmentation and Annotation of Bowel Sound Patterns
di: Mansour, Zahra, et al.
Pubblicazione: (2026)

Exploring Gender-Specific Speech Patterns in Automatic Suicide Risk Assessment
di: Gerczuk, Maurice, et al.
Pubblicazione: (2024)

A methodological framework and exemplar protocol for the collection and analysis of repeated speech samples
di: Cummins, Nicholas, et al.
Pubblicazione: (2024)

Can Sound Replace Vision in LLaVA With Token Substitution?
di: Vosoughi, Ali, et al.
Pubblicazione: (2025)

FakeSound2: A Benchmark for Explainable and Generalizable Deepfake Sound Detection
di: Xie, Zeyu, et al.
Pubblicazione: (2025)

Sound Terminology Describing Production and Perception of Sonification
di: Ziemer, Tim
Pubblicazione: (2023)

Intelligent Cardiac Auscultation for Murmur Detection via Parallel-Attentive Models with Uncertainty Estimation
di: Zhang, Zixing, et al.
Pubblicazione: (2024)

Energy-based features and bi-LSTM neural network for EEG-based music and voice classification
di: Ariza, Isaac, et al.
Pubblicazione: (2024)

FakeSound: Deepfake General Audio Detection
di: Xie, Zeyu, et al.
Pubblicazione: (2024)

Insights on Harmonic Tones from a Generative Music Experiment
di: Deruty, Emmanuel, et al.
Pubblicazione: (2025)

Neural Proxies for Sound Synthesizers: Learning Perceptually Informed Preset Representations
di: Combes, Paolo, et al.
Pubblicazione: (2025)

Frequency Dynamic Convolutions for Sound Event Detection
di: Nam, Hyeonuk
Pubblicazione: (2025)

Masked Modeling Duo: Towards a Universal Audio Pre-training Framework
di: Niizumi, Daisuke, et al.
Pubblicazione: (2024)

Exploring Pre-trained General-purpose Audio Representations for Heart Murmur Detection
di: Niizumi, Daisuke, et al.
Pubblicazione: (2024)

Patient-Level Multimodal Question Answering from Multi-Site Auscultation Recordings
di: Wu, Fan, et al.
Pubblicazione: (2026)

Cervical Auscultation Machine Learning for Dysphagia Assessment
di: Chia, An An, et al.
Pubblicazione: (2024)

Foundation Model Hidden Representations for Heart Rate Estimation from Auscultation
di: Nie, Jingping, et al.
Pubblicazione: (2025)

Efficient Sound Field Reconstruction with Conditional Invertible Neural Networks
di: Karakonstantis, Xenofon, et al.
Pubblicazione: (2024)

Building music with Lego bricks and Raspberry Pi
di: Barbancho, Ana M., et al.
Pubblicazione: (2024)

Binaural Sound Event Localization and Detection Neural Network based on HRTF Localization Cues for Humanoid Robots
di: Lee, Gyeong-Tae
Pubblicazione: (2025)

Benchmarking Foundation Speech and Language Models for Alzheimer's Disease and Related Dementia Detection from Spontaneous Speech
di: Li, Jingyu, et al.
Pubblicazione: (2025)

Temporal Attention Pooling for Frequency Dynamic Convolution in Sound Event Detection
di: Nam, Hyeonuk, et al.
Pubblicazione: (2025)

Diversifying and Expanding Frequency-Adaptive Convolution Kernels for Sound Event Detection
di: Nam, Hyeonuk, et al.
Pubblicazione: (2024)

Phase Repair for Time-Domain Convolutional Neural Networks in Music Super-Resolution
di: Zhang, Yenan, et al.
Pubblicazione: (2023)

Region-Specific Audio Tagging for Spatial Sound
di: Zhao, Jinzheng, et al.
Pubblicazione: (2025)

Pushing the Limit of Sound Event Detection with Multi-Dilated Frequency Dynamic Convolution
di: Nam, Hyeonuk, et al.
Pubblicazione: (2024)

Sound Field Reconstruction Using a Compact Acoustics-informed Neural Network
di: Ma, Fei, et al.
Pubblicazione: (2024)

Classification of Short Segment Pediatric Heart Sounds Based on a Transformer-Based Convolutional Neural Network
di: Hassanuzzaman, Md, et al.
Pubblicazione: (2024)

Generative Deep Learning and Signal Processing for Data Augmentation of Cardiac Auscultation Signals: Improving Model Robustness Using Synthetic Audio
di: Abbott, Leigh, et al.
Pubblicazione: (2024)

BUET Multi-disease Heart Sound Dataset: A Comprehensive Auscultation Dataset for Developing Computer-Aided Diagnostic Systems
di: Ali, Shams Nafisa, et al.
Pubblicazione: (2024)

Spectral oversubtraction? An approach for speech enhancement after robot ego speech filtering in semi-real-time
di: Li, Yue, et al.
Pubblicazione: (2024)

M2D-CLAP: Masked Modeling Duo Meets CLAP for Learning General-purpose Audio-Language Representation
di: Niizumi, Daisuke, et al.
Pubblicazione: (2024)

Automatic Sound Event Detection and Classification of Great Ape Calls Using Neural Networks
di: Jiang, Zifan, et al.
Pubblicazione: (2023)

ToMoBrush: Exploring Dental Health Sensing using a Sonic Toothbrush
di: Yuan, Kuang, et al.
Pubblicazione: (2024)

PSELDNets: Pre-trained Neural Networks on a Large-scale Synthetic Dataset for Sound Event Localization and Detection
di: Hu, Jinbo, et al.
Pubblicazione: (2024)