:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Kafentzis, George P., Tetsing, Stephane, Brew, Joe, Jover, Lola, Galvosas, Mindaugas, Chaccour, Carlos, Small, Peter M.
Format:	Preprint
Published:	2023
Subjects:	Audio and Speech Processing Artificial Intelligence
Online Access:	https://arxiv.org/abs/2307.04842
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Leveraging cough sounds to optimize chest x-ray usage in low-resource settings
by: Philip, Alexander, et al.
Published: (2024)

Tuberculosis Screening from Cough Audio: Baseline Models, Clinical Variables, and Uncertainty Quantification
by: Kafentzis, George P., et al.
Published: (2026)

On the Parameter Estimation of Sinusoidal Models for Speech and Audio Signals
by: Kafentzis, George P.
Published: (2024)

CoughViT: A Self-Supervised Vision Transformer for Cough Audio Representation Learning
by: Luong, Justin, et al.
Published: (2025)

Deep Learning for Tuberculosis Screening in a High-burden Setting using Cough Analysis and Speech Foundation Models
by: Ma, Ning, et al.
Published: (2025)

A Machine Hearing System for Robust Cough Detection Based on a High-Level Representation of Band-Specific Audio Features
by: Monge-Alvarez, Jesús, et al.
Published: (2024)

Fusing Audio and Metadata Embeddings Improves Language-based Audio Retrieval
by: Primus, Paul, et al.
Published: (2024)

Cough-E: A multimodal, privacy-preserving cough detection algorithm for the edge
by: Albini, Stefano, et al.
Published: (2024)

Multi-Granularity Adaptive Time-Frequency Attention Framework for Audio Deepfake Detection under Real-World Communication Degradations
by: Shi, Haohan, et al.
Published: (2025)

First-Shot Unsupervised Anomalous Sound Detection With Unknown Anomalies Estimated by Metadata-Assisted Audio Generation
by: Zhang, Hejing, et al.
Published: (2023)

COVID-19 Detection System: A Comparative Analysis of System Performance Based on Acoustic Features of Cough Audio Signals
by: Shati, Asmaa, et al.
Published: (2023)

COVID-19 Diagnosis from Cough Acoustics using ConvNets and Data Augmentation
by: Mahanta, Saranga Kingkor, et al.
Published: (2021)

BickGraphing: Web-Based Application for Visual Inspection of Audio Recordings
by: Seow, Kayley, et al.
Published: (2026)

Cough activity detection for automatic tuberculosis screening
by: van Vüren, Joshua Jansen, et al.
Published: (2026)

Comparison of Classification Algorithms for COVID19 Detection using Cough Acoustic Signals
by: Erdoğan, Yunus Emre, et al.
Published: (2022)

Robust Nasality Representation Learning for Cleft Palate-Related Velopharyngeal Dysfunction Screening in Real-World Settings
by: Liu, Weixin, et al.
Published: (2026)

Low-Complexity Neural Wind Noise Reduction for Audio Recordings
by: Eftekhari, Hesam, et al.
Published: (2025)

An AI-enabled Bias-Free Respiratory Disease Diagnosis Model using Cough Audio: A Case Study for COVID-19
by: Saeed, Tabish, et al.
Published: (2024)

Audio Enhancement from Multiple Crowdsourced Recordings: A Simple and Effective Baseline
by: Aziz, Shiran, et al.
Published: (2024)

Classical Machine Learning Baselines for Deepfake Audio Detection on the Fake-or-Real Dataset
by: Ahmad, Faheem, et al.
Published: (2026)

XAI-Driven Spectral Analysis of Cough Sounds for Respiratory Disease Characterization
by: Amado-Caballero, Patricia, et al.
Published: (2025)

PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings
by: Kalda, Joonas, et al.
Published: (2024)

Classical Guitar Duet Separation using GuitarDuets -- a Dataset of Real and Synthesized Guitar Recordings
by: Glytsos, Marios, et al.
Published: (2025)

POLIPHONE: A Dataset for Smartphone Model Identification from Audio Recordings
by: Salvi, Davide, et al.
Published: (2024)

AudioGAN: A Compact and Efficient Framework for Real-Time High-Fidelity Text-to-Audio Generation
by: Chung, HaeChun
Published: (2025)

Benchmarking Audio Deepfake Detection Robustness in Real-world Communication Scenarios
by: Shi, Haohan, et al.
Published: (2025)

Toward Robust Real-World Audio Deepfake Detection: Closing the Explainability Gap
by: Channing, Georgia, et al.
Published: (2024)

Audio-Cogito: Towards Deep Audio Reasoning in Large Audio Language Models
by: Li, Longhao, et al.
Published: (2026)

RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization
by: Yang, Bing, et al.
Published: (2024)

Scalable Frameworks for Real-World Audio-Visual Speech Recognition
by: Kim, Sungnyun
Published: (2025)

HumDial-EIBench: A Human-Recorded Multi-Turn Emotional Intelligence Benchmark for Audio Language Models
by: Wang, Shuiyuan, et al.
Published: (2026)

InfiniteAudio: Infinite-Length Audio Generation with Consistency
by: Jung, Chaeyoung, et al.
Published: (2025)

End-to-End Real-World Polyphonic Piano Audio-to-Score Transcription with Hierarchical Decoding
by: Zeng, Wei, et al.
Published: (2024)

Audiosockets: A Python socket package for Real-Time Audio Processing
by: Shu, Nicolas, et al.
Published: (2024)

Measuring Audio's Impact on Correctness: Audio-Contribution-Aware Post-Training of Large Audio Language Models
by: He, Haolin, et al.
Published: (2025)

Aud-Sur: An Audio Analyzer Assistant for Audio Surveillance Applications
by: Lam, Phat, et al.
Published: (2025)

SALAD-VAE: Semantic Audio Compression with Language-Audio Distillation
by: Braun, Sebastian, et al.
Published: (2025)

AudioSetCaps: An Enriched Audio-Caption Dataset using Automated Generation Pipeline with Large Audio and Language Models
by: Bai, Jisheng, et al.
Published: (2024)

UPV_RIR_DB: A Structured Room Impulse Response Database with Hierarchical Metadata and Acoustic Indicators
by: García-Gamborino, Jesús, et al.
Published: (2026)

AudioNet: Supervised Deep Hashing for Retrieval of Similar Audio Events
by: Dutta, Sagar, et al.
Published: (2025)