Saved in:
| Main Authors: | Kafentzis, George P., Tetsing, Stephane, Brew, Joe, Jover, Lola, Galvosas, Mindaugas, Chaccour, Carlos, Small, Peter M. |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2307.04842 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Leveraging cough sounds to optimize chest x-ray usage in low-resource settings
by: Philip, Alexander, et al.
Published: (2024)
by: Philip, Alexander, et al.
Published: (2024)
Tuberculosis Screening from Cough Audio: Baseline Models, Clinical Variables, and Uncertainty Quantification
by: Kafentzis, George P., et al.
Published: (2026)
by: Kafentzis, George P., et al.
Published: (2026)
On the Parameter Estimation of Sinusoidal Models for Speech and Audio Signals
by: Kafentzis, George P.
Published: (2024)
by: Kafentzis, George P.
Published: (2024)
CoughViT: A Self-Supervised Vision Transformer for Cough Audio Representation Learning
by: Luong, Justin, et al.
Published: (2025)
by: Luong, Justin, et al.
Published: (2025)
Deep Learning for Tuberculosis Screening in a High-burden Setting using Cough Analysis and Speech Foundation Models
by: Ma, Ning, et al.
Published: (2025)
by: Ma, Ning, et al.
Published: (2025)
A Machine Hearing System for Robust Cough Detection Based on a High-Level Representation of Band-Specific Audio Features
by: Monge-Alvarez, Jesús, et al.
Published: (2024)
by: Monge-Alvarez, Jesús, et al.
Published: (2024)
Fusing Audio and Metadata Embeddings Improves Language-based Audio Retrieval
by: Primus, Paul, et al.
Published: (2024)
by: Primus, Paul, et al.
Published: (2024)
Cough-E: A multimodal, privacy-preserving cough detection algorithm for the edge
by: Albini, Stefano, et al.
Published: (2024)
by: Albini, Stefano, et al.
Published: (2024)
Multi-Granularity Adaptive Time-Frequency Attention Framework for Audio Deepfake Detection under Real-World Communication Degradations
by: Shi, Haohan, et al.
Published: (2025)
by: Shi, Haohan, et al.
Published: (2025)
First-Shot Unsupervised Anomalous Sound Detection With Unknown Anomalies Estimated by Metadata-Assisted Audio Generation
by: Zhang, Hejing, et al.
Published: (2023)
by: Zhang, Hejing, et al.
Published: (2023)
COVID-19 Detection System: A Comparative Analysis of System Performance Based on Acoustic Features of Cough Audio Signals
by: Shati, Asmaa, et al.
Published: (2023)
by: Shati, Asmaa, et al.
Published: (2023)
COVID-19 Diagnosis from Cough Acoustics using ConvNets and Data Augmentation
by: Mahanta, Saranga Kingkor, et al.
Published: (2021)
by: Mahanta, Saranga Kingkor, et al.
Published: (2021)
BickGraphing: Web-Based Application for Visual Inspection of Audio Recordings
by: Seow, Kayley, et al.
Published: (2026)
by: Seow, Kayley, et al.
Published: (2026)
Cough activity detection for automatic tuberculosis screening
by: van Vüren, Joshua Jansen, et al.
Published: (2026)
by: van Vüren, Joshua Jansen, et al.
Published: (2026)
Comparison of Classification Algorithms for COVID19 Detection using Cough Acoustic Signals
by: Erdoğan, Yunus Emre, et al.
Published: (2022)
by: Erdoğan, Yunus Emre, et al.
Published: (2022)
Robust Nasality Representation Learning for Cleft Palate-Related Velopharyngeal Dysfunction Screening in Real-World Settings
by: Liu, Weixin, et al.
Published: (2026)
by: Liu, Weixin, et al.
Published: (2026)
Low-Complexity Neural Wind Noise Reduction for Audio Recordings
by: Eftekhari, Hesam, et al.
Published: (2025)
by: Eftekhari, Hesam, et al.
Published: (2025)
An AI-enabled Bias-Free Respiratory Disease Diagnosis Model using Cough Audio: A Case Study for COVID-19
by: Saeed, Tabish, et al.
Published: (2024)
by: Saeed, Tabish, et al.
Published: (2024)
Audio Enhancement from Multiple Crowdsourced Recordings: A Simple and Effective Baseline
by: Aziz, Shiran, et al.
Published: (2024)
by: Aziz, Shiran, et al.
Published: (2024)
Classical Machine Learning Baselines for Deepfake Audio Detection on the Fake-or-Real Dataset
by: Ahmad, Faheem, et al.
Published: (2026)
by: Ahmad, Faheem, et al.
Published: (2026)
XAI-Driven Spectral Analysis of Cough Sounds for Respiratory Disease Characterization
by: Amado-Caballero, Patricia, et al.
Published: (2025)
by: Amado-Caballero, Patricia, et al.
Published: (2025)
PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings
by: Kalda, Joonas, et al.
Published: (2024)
by: Kalda, Joonas, et al.
Published: (2024)
Classical Guitar Duet Separation using GuitarDuets -- a Dataset of Real and Synthesized Guitar Recordings
by: Glytsos, Marios, et al.
Published: (2025)
by: Glytsos, Marios, et al.
Published: (2025)
POLIPHONE: A Dataset for Smartphone Model Identification from Audio Recordings
by: Salvi, Davide, et al.
Published: (2024)
by: Salvi, Davide, et al.
Published: (2024)
AudioGAN: A Compact and Efficient Framework for Real-Time High-Fidelity Text-to-Audio Generation
by: Chung, HaeChun
Published: (2025)
by: Chung, HaeChun
Published: (2025)
Benchmarking Audio Deepfake Detection Robustness in Real-world Communication Scenarios
by: Shi, Haohan, et al.
Published: (2025)
by: Shi, Haohan, et al.
Published: (2025)
Toward Robust Real-World Audio Deepfake Detection: Closing the Explainability Gap
by: Channing, Georgia, et al.
Published: (2024)
by: Channing, Georgia, et al.
Published: (2024)
Audio-Cogito: Towards Deep Audio Reasoning in Large Audio Language Models
by: Li, Longhao, et al.
Published: (2026)
by: Li, Longhao, et al.
Published: (2026)
RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization
by: Yang, Bing, et al.
Published: (2024)
by: Yang, Bing, et al.
Published: (2024)
Scalable Frameworks for Real-World Audio-Visual Speech Recognition
by: Kim, Sungnyun
Published: (2025)
by: Kim, Sungnyun
Published: (2025)
HumDial-EIBench: A Human-Recorded Multi-Turn Emotional Intelligence Benchmark for Audio Language Models
by: Wang, Shuiyuan, et al.
Published: (2026)
by: Wang, Shuiyuan, et al.
Published: (2026)
InfiniteAudio: Infinite-Length Audio Generation with Consistency
by: Jung, Chaeyoung, et al.
Published: (2025)
by: Jung, Chaeyoung, et al.
Published: (2025)
End-to-End Real-World Polyphonic Piano Audio-to-Score Transcription with Hierarchical Decoding
by: Zeng, Wei, et al.
Published: (2024)
by: Zeng, Wei, et al.
Published: (2024)
Audiosockets: A Python socket package for Real-Time Audio Processing
by: Shu, Nicolas, et al.
Published: (2024)
by: Shu, Nicolas, et al.
Published: (2024)
Measuring Audio's Impact on Correctness: Audio-Contribution-Aware Post-Training of Large Audio Language Models
by: He, Haolin, et al.
Published: (2025)
by: He, Haolin, et al.
Published: (2025)
Aud-Sur: An Audio Analyzer Assistant for Audio Surveillance Applications
by: Lam, Phat, et al.
Published: (2025)
by: Lam, Phat, et al.
Published: (2025)
SALAD-VAE: Semantic Audio Compression with Language-Audio Distillation
by: Braun, Sebastian, et al.
Published: (2025)
by: Braun, Sebastian, et al.
Published: (2025)
AudioSetCaps: An Enriched Audio-Caption Dataset using Automated Generation Pipeline with Large Audio and Language Models
by: Bai, Jisheng, et al.
Published: (2024)
by: Bai, Jisheng, et al.
Published: (2024)
UPV_RIR_DB: A Structured Room Impulse Response Database with Hierarchical Metadata and Acoustic Indicators
by: García-Gamborino, Jesús, et al.
Published: (2026)
by: García-Gamborino, Jesús, et al.
Published: (2026)
AudioNet: Supervised Deep Hashing for Retrieval of Similar Audio Events
by: Dutta, Sagar, et al.
Published: (2025)
by: Dutta, Sagar, et al.
Published: (2025)
Similar Items
-
Leveraging cough sounds to optimize chest x-ray usage in low-resource settings
by: Philip, Alexander, et al.
Published: (2024) -
Tuberculosis Screening from Cough Audio: Baseline Models, Clinical Variables, and Uncertainty Quantification
by: Kafentzis, George P., et al.
Published: (2026) -
On the Parameter Estimation of Sinusoidal Models for Speech and Audio Signals
by: Kafentzis, George P.
Published: (2024) -
CoughViT: A Self-Supervised Vision Transformer for Cough Audio Representation Learning
by: Luong, Justin, et al.
Published: (2025) -
Deep Learning for Tuberculosis Screening in a High-burden Setting using Cough Analysis and Speech Foundation Models
by: Ma, Ning, et al.
Published: (2025)