Saved in:
| Main Authors: | Rabuge, Miguel, Lourenço, Nuno |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.08785 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Single Microphone Own Voice Detection based on Simulated Transfer Functions for Hearing Aids
by: Mayuravaani, Mathuranathan, et al.
Published: (2026)
by: Mayuravaani, Mathuranathan, et al.
Published: (2026)
Fine-grained Soundscape Control for Augmented Hearing
by: Oh, Seunghyun, et al.
Published: (2026)
by: Oh, Seunghyun, et al.
Published: (2026)
Developing an AI-Guided Assistant Device for the Deaf and Hearing Impaired
by: Jiayu, et al.
Published: (2025)
by: Jiayu, et al.
Published: (2025)
Non-Intrusive Speech Intelligibility Prediction for Hearing Aids using Whisper and Metadata
by: Zezario, Ryandhimas E., et al.
Published: (2023)
by: Zezario, Ryandhimas E., et al.
Published: (2023)
Improving Machine Hearing on Limited Data Sets
by: Harar, Pavol, et al.
Published: (2019)
by: Harar, Pavol, et al.
Published: (2019)
Text-Independent Speaker Identification Using Audio Looping With Margin Based Loss Functions
by: Garcia, Elliot Q C, et al.
Published: (2025)
by: Garcia, Elliot Q C, et al.
Published: (2025)
HAAQI-Net: A Non-intrusive Neural Music Audio Quality Assessment Model for Hearing Aids
by: Wisnu, Dyah A. M. G., et al.
Published: (2024)
by: Wisnu, Dyah A. M. G., et al.
Published: (2024)
AuditoryBench++: Can Language Models Understand Auditory Knowledge without Hearing?
by: Ok, Hyunjong, et al.
Published: (2025)
by: Ok, Hyunjong, et al.
Published: (2025)
Remixing Music for Hearing Aids Using Ensemble of Fine-Tuned Source Separators
by: Daly, Matthew
Published: (2024)
by: Daly, Matthew
Published: (2024)
Hearing Your Blood Sugar: Non-Invasive Glucose Measurement Through Simple Vocal Signals, Transforming any Speech into a Sensor with Machine Learning
by: Ahmadli, Nihat, et al.
Published: (2024)
by: Ahmadli, Nihat, et al.
Published: (2024)
Count The Notes: Histogram-Based Supervision for Automatic Music Transcription
by: Yaffe, Jonathan, et al.
Published: (2025)
by: Yaffe, Jonathan, et al.
Published: (2025)
Hearing Anywhere in Any Environment
by: Liu, Xiulong, et al.
Published: (2025)
by: Liu, Xiulong, et al.
Published: (2025)
Transformer Based Machine Fault Detection From Audio Input
by: Holla, Kiran Voderhobli
Published: (2026)
by: Holla, Kiran Voderhobli
Published: (2026)
EmoHRNet: High-Resolution Neural Network Based Speech Emotion Recognition
by: Muppidi, Akshay, et al.
Published: (2025)
by: Muppidi, Akshay, et al.
Published: (2025)
Audio Question Answering with GRPO-Based Fine-Tuning and Calibrated Segment-Level Predictions
by: Gibier, Marcel, et al.
Published: (2025)
by: Gibier, Marcel, et al.
Published: (2025)
Revisit Modality Imbalance at the Decision Layer
by: Ma, Xiaoyu, et al.
Published: (2025)
by: Ma, Xiaoyu, et al.
Published: (2025)
How to Count Coughs: An Event-Based Framework for Evaluating Automatic Cough Detection Algorithm Performance
by: Orlandic, Lara, et al.
Published: (2024)
by: Orlandic, Lara, et al.
Published: (2024)
Improving Perceptual Audio Aesthetic Assessment via Triplet Loss and Self-Supervised Embeddings
by: Wisnu, Dyah A. M. G., et al.
Published: (2025)
by: Wisnu, Dyah A. M. G., et al.
Published: (2025)
GE2E-AC: Generalized End-to-End Loss Training for Accent Classification
by: Watanabe, Chihiro, et al.
Published: (2024)
by: Watanabe, Chihiro, et al.
Published: (2024)
Improving Out-of-Domain Audio Deepfake Detection via Layer Selection and Fusion of SSL-Based Countermeasures
by: Serrano, Pierre, et al.
Published: (2025)
by: Serrano, Pierre, et al.
Published: (2025)
Automatic Identification of Samples in Hip-Hop Music via Multi-Loss Training and an Artificial Dataset
by: Cheston, Huw, et al.
Published: (2025)
by: Cheston, Huw, et al.
Published: (2025)
Monaural Speech Enhancement with Complex Convolutional Block Attention Module and Joint Time Frequency Losses
by: Zhao, Shengkui, et al.
Published: (2021)
by: Zhao, Shengkui, et al.
Published: (2021)
Morse Code-Enabled Speech Recognition for Individuals with Visual and Hearing Impairments
by: Choudhury, Ritabrata Roy
Published: (2024)
by: Choudhury, Ritabrata Roy
Published: (2024)
Hybrid Losses for Hierarchical Embedding Learning
by: Tian, Haokun, et al.
Published: (2025)
by: Tian, Haokun, et al.
Published: (2025)
Patient-Aware Feature Alignment for Robust Lung Sound Classification:Cohesion-Separation and Global Alignment Losses
by: Jeong, Seung Gyu, et al.
Published: (2025)
by: Jeong, Seung Gyu, et al.
Published: (2025)
Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Speech Processing
by: Fu, Yonggan, et al.
Published: (2022)
by: Fu, Yonggan, et al.
Published: (2022)
Imagine to Hear: Auditory Knowledge Generation can be an Effective Assistant for Language Models
by: Yoo, Suho, et al.
Published: (2025)
by: Yoo, Suho, et al.
Published: (2025)
What Do Language Models Hear? Probing for Auditory Representations in Language Models
by: Ngo, Jerry, et al.
Published: (2024)
by: Ngo, Jerry, et al.
Published: (2024)
On the Condition Monitoring of Bolted Joints through Acoustic Emission and Deep Transfer Learning: Generalization, Ordinal Loss and Super-Convergence
by: Ramasso, Emmanuel, et al.
Published: (2024)
by: Ramasso, Emmanuel, et al.
Published: (2024)
Pruning-aware Loss Functions for STOI-Optimized Pruned Recurrent Autoencoders for the Compression of the Stimulation Patterns of Cochlear Implants at Zero Delay
by: Hinrichs, Reemt, et al.
Published: (2025)
by: Hinrichs, Reemt, et al.
Published: (2025)
An Independence-promoting Loss for Music Generation with Language Models
by: Lemercier, Jean-Marie, et al.
Published: (2024)
by: Lemercier, Jean-Marie, et al.
Published: (2024)
Hear What Matters! Text-conditioned Selective Video-to-Audio Generation
by: Lee, Junwon, et al.
Published: (2025)
by: Lee, Junwon, et al.
Published: (2025)
Testing chatbots on the creation of encoders for audio conditioned image generation
by: León, Jorge E., et al.
Published: (2025)
by: León, Jorge E., et al.
Published: (2025)
What You Read Isn't What You Hear: Linguistic Sensitivity in Deepfake Speech Detection
by: Nguyen, Binh, et al.
Published: (2025)
by: Nguyen, Binh, et al.
Published: (2025)
An Attention Long Short-Term Memory based system for automatic classification of speech intelligibility
by: Fernández-Díaz, Miguel, et al.
Published: (2024)
by: Fernández-Díaz, Miguel, et al.
Published: (2024)
Improving Membership Inference in ASR Model Auditing with Perturbed Loss Features
by: Teixeira, Francisco, et al.
Published: (2024)
by: Teixeira, Francisco, et al.
Published: (2024)
Representation-Based Data Quality Audits for Audio
by: Gonzalez-Jimenez, Alvaro, et al.
Published: (2025)
by: Gonzalez-Jimenez, Alvaro, et al.
Published: (2025)
Respiratory Disease Classification and Biometric Analysis Using Biosignals from Digital Stethoscopes
by: Casado, Constantino Álvarez, et al.
Published: (2023)
by: Casado, Constantino Álvarez, et al.
Published: (2023)
Enhanced ASR Robustness to Packet Loss with a Front-End Adaptation Network
by: Dissen, Yehoshua, et al.
Published: (2024)
by: Dissen, Yehoshua, et al.
Published: (2024)
Focal Loss based Residual Convolutional Neural Network for Speech Emotion Recognition
by: Tripathi, Suraj, et al.
Published: (2019)
by: Tripathi, Suraj, et al.
Published: (2019)
Similar Items
-
Single Microphone Own Voice Detection based on Simulated Transfer Functions for Hearing Aids
by: Mayuravaani, Mathuranathan, et al.
Published: (2026) -
Fine-grained Soundscape Control for Augmented Hearing
by: Oh, Seunghyun, et al.
Published: (2026) -
Developing an AI-Guided Assistant Device for the Deaf and Hearing Impaired
by: Jiayu, et al.
Published: (2025) -
Non-Intrusive Speech Intelligibility Prediction for Hearing Aids using Whisper and Metadata
by: Zezario, Ryandhimas E., et al.
Published: (2023) -
Improving Machine Hearing on Limited Data Sets
by: Harar, Pavol, et al.
Published: (2019)