:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Rabuge, Miguel, Lourenço, Nuno
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Sound
Online Access:	https://arxiv.org/abs/2502.08785
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Single Microphone Own Voice Detection based on Simulated Transfer Functions for Hearing Aids
by: Mayuravaani, Mathuranathan, et al.
Published: (2026)

Fine-grained Soundscape Control for Augmented Hearing
by: Oh, Seunghyun, et al.
Published: (2026)

Developing an AI-Guided Assistant Device for the Deaf and Hearing Impaired
by: Jiayu, et al.
Published: (2025)

Non-Intrusive Speech Intelligibility Prediction for Hearing Aids using Whisper and Metadata
by: Zezario, Ryandhimas E., et al.
Published: (2023)

Improving Machine Hearing on Limited Data Sets
by: Harar, Pavol, et al.
Published: (2019)

Text-Independent Speaker Identification Using Audio Looping With Margin Based Loss Functions
by: Garcia, Elliot Q C, et al.
Published: (2025)

HAAQI-Net: A Non-intrusive Neural Music Audio Quality Assessment Model for Hearing Aids
by: Wisnu, Dyah A. M. G., et al.
Published: (2024)

AuditoryBench++: Can Language Models Understand Auditory Knowledge without Hearing?
by: Ok, Hyunjong, et al.
Published: (2025)

Remixing Music for Hearing Aids Using Ensemble of Fine-Tuned Source Separators
by: Daly, Matthew
Published: (2024)

Hearing Your Blood Sugar: Non-Invasive Glucose Measurement Through Simple Vocal Signals, Transforming any Speech into a Sensor with Machine Learning
by: Ahmadli, Nihat, et al.
Published: (2024)

Count The Notes: Histogram-Based Supervision for Automatic Music Transcription
by: Yaffe, Jonathan, et al.
Published: (2025)

Hearing Anywhere in Any Environment
by: Liu, Xiulong, et al.
Published: (2025)

Transformer Based Machine Fault Detection From Audio Input
by: Holla, Kiran Voderhobli
Published: (2026)

EmoHRNet: High-Resolution Neural Network Based Speech Emotion Recognition
by: Muppidi, Akshay, et al.
Published: (2025)

Audio Question Answering with GRPO-Based Fine-Tuning and Calibrated Segment-Level Predictions
by: Gibier, Marcel, et al.
Published: (2025)

Revisit Modality Imbalance at the Decision Layer
by: Ma, Xiaoyu, et al.
Published: (2025)

How to Count Coughs: An Event-Based Framework for Evaluating Automatic Cough Detection Algorithm Performance
by: Orlandic, Lara, et al.
Published: (2024)

Improving Perceptual Audio Aesthetic Assessment via Triplet Loss and Self-Supervised Embeddings
by: Wisnu, Dyah A. M. G., et al.
Published: (2025)

GE2E-AC: Generalized End-to-End Loss Training for Accent Classification
by: Watanabe, Chihiro, et al.
Published: (2024)

Improving Out-of-Domain Audio Deepfake Detection via Layer Selection and Fusion of SSL-Based Countermeasures
by: Serrano, Pierre, et al.
Published: (2025)

Automatic Identification of Samples in Hip-Hop Music via Multi-Loss Training and an Artificial Dataset
by: Cheston, Huw, et al.
Published: (2025)

Monaural Speech Enhancement with Complex Convolutional Block Attention Module and Joint Time Frequency Losses
by: Zhao, Shengkui, et al.
Published: (2021)

Morse Code-Enabled Speech Recognition for Individuals with Visual and Hearing Impairments
by: Choudhury, Ritabrata Roy
Published: (2024)

Hybrid Losses for Hierarchical Embedding Learning
by: Tian, Haokun, et al.
Published: (2025)

Patient-Aware Feature Alignment for Robust Lung Sound Classification:Cohesion-Separation and Global Alignment Losses
by: Jeong, Seung Gyu, et al.
Published: (2025)

Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Speech Processing
by: Fu, Yonggan, et al.
Published: (2022)

Imagine to Hear: Auditory Knowledge Generation can be an Effective Assistant for Language Models
by: Yoo, Suho, et al.
Published: (2025)

What Do Language Models Hear? Probing for Auditory Representations in Language Models
by: Ngo, Jerry, et al.
Published: (2024)

On the Condition Monitoring of Bolted Joints through Acoustic Emission and Deep Transfer Learning: Generalization, Ordinal Loss and Super-Convergence
by: Ramasso, Emmanuel, et al.
Published: (2024)

Pruning-aware Loss Functions for STOI-Optimized Pruned Recurrent Autoencoders for the Compression of the Stimulation Patterns of Cochlear Implants at Zero Delay
by: Hinrichs, Reemt, et al.
Published: (2025)

An Independence-promoting Loss for Music Generation with Language Models
by: Lemercier, Jean-Marie, et al.
Published: (2024)

Hear What Matters! Text-conditioned Selective Video-to-Audio Generation
by: Lee, Junwon, et al.
Published: (2025)

Testing chatbots on the creation of encoders for audio conditioned image generation
by: León, Jorge E., et al.
Published: (2025)

What You Read Isn't What You Hear: Linguistic Sensitivity in Deepfake Speech Detection
by: Nguyen, Binh, et al.
Published: (2025)

An Attention Long Short-Term Memory based system for automatic classification of speech intelligibility
by: Fernández-Díaz, Miguel, et al.
Published: (2024)

Improving Membership Inference in ASR Model Auditing with Perturbed Loss Features
by: Teixeira, Francisco, et al.
Published: (2024)

Representation-Based Data Quality Audits for Audio
by: Gonzalez-Jimenez, Alvaro, et al.
Published: (2025)

Respiratory Disease Classification and Biometric Analysis Using Biosignals from Digital Stethoscopes
by: Casado, Constantino Álvarez, et al.
Published: (2023)

Enhanced ASR Robustness to Packet Loss with a Front-End Adaptation Network
by: Dissen, Yehoshua, et al.
Published: (2024)

Focal Loss based Residual Convolutional Neural Network for Speech Emotion Recognition
by: Tripathi, Suraj, et al.
Published: (2019)