:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Panah, Davoud Shariat, Barry, Dan, Ragano, Alessandro, Skoglund, Jan, Hines, Andrew
Format:	Preprint
Published:	2025
Subjects:	Audio and Speech Processing Sound
Online Access:	https://arxiv.org/abs/2505.11915
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Binamix -- A Python Library for Generating Binaural Audio Datasets
by: Barry, Dan, et al.
Published: (2025)

NOMAD: Unsupervised Learning of Perceptual Embeddings for Speech Enhancement and Non-matching Reference Audio Quality Assessment
by: Ragano, Alessandro, et al.
Published: (2023)

SCOREQ: Speech Quality Assessment with Contrastive Regression
by: Ragano, Alessandro, et al.
Published: (2024)

Binaspect -- A Python Library for Binaural Audio Analysis, Visualization & Feature Generation
by: Barry, Dan, et al.
Published: (2025)

Reduce, Reuse, Recycle: Is Perturbed Data better than Other Language augmentation for Low Resource Self-Supervised Speech Models
by: Ullah, Asad, et al.
Published: (2023)

Beyond Correlation: Evaluating Multimedia Quality Models with the Constrained Concordance Index
by: Ragano, Alessandro, et al.
Published: (2024)

Deep Learning for Personalized Binaural Audio Reproduction
by: Lu, Xikun, et al.
Published: (2025)

Respiratory Inhaler Sound Event Classification Using Self-Supervised Learning
by: Panah, Davoud Shariat, et al.
Published: (2025)

Lightweight Implicit Neural Network for Binaural Audio Synthesis
by: Lu, Xikun, et al.
Published: (2025)

BANC: Towards Efficient Binaural Audio Neural Codec for Overlapping Speech
by: Ratnarajah, Anton, et al.
Published: (2023)

AudioBERTScore: Objective Evaluation of Environmental Sound Synthesis Based on Similarity of Audio embedding Sequences
by: Kishi, Minoru, et al.
Published: (2025)

Neural Speech and Audio Coding: Modern AI Technology Meets Traditional Codecs
by: Kim, Minje, et al.
Published: (2024)

BAST: Binaural Audio Spectrogram Transformer for Binaural Sound Localization
by: Kuang, Sheng, et al.
Published: (2022)

A Reference-free Metric for Language-Queried Audio Source Separation using Contrastive Language-Audio Pretraining
by: Xiao, Feiyang, et al.
Published: (2024)

Binaural Sound Event Localization and Detection based on HRTF Cues for Humanoid Robots
by: Lee, Gyeong-Tae, et al.
Published: (2025)

Do Music Source Separation Models Preserve Spatial Information in Binaural Audio?
by: Namballa, Richa, et al.
Published: (2025)

Binaural Sound Event Localization and Detection Neural Network based on HRTF Localization Cues for Humanoid Robots
by: Lee, Gyeong-Tae
Published: (2025)

Stereo Audio Rendering for Personal Sound Zones Using a Binaural Spatially Adaptive Neural Network (BSANN)
by: Jiang, Hao, et al.
Published: (2026)

AuralNet: Hierarchical Attention-based 3D Binaural Localization of Overlapping Speakers
by: Fu, Linya, et al.
Published: (2025)

TTMBA: Towards Text To Multiple Sources Binaural Audio Generation
by: He, Yuxuan, et al.
Published: (2025)

Perceptually Transparent Binaural Auralization of Simulated Sound Fields
by: Ahrens, Jens
Published: (2024)

Binaural Target Speaker Extraction using Individualized HRTF
by: Ellinson, Yoav, et al.
Published: (2025)

Ambisonics Binaural Rendering via Masked Magnitude Least Squares
by: Berebi, Or, et al.
Published: (2025)

Binaural rendering from microphone array signals of arbitrary geometry
by: Iijima, Naoto, et al.
Published: (2021)

SHroom: A Python Framework for Ambisonics Room Acoustics Simulation and Binaural Rendering
by: Gayer, Yhonatan
Published: (2026)

Generalizable Audio-Visual Navigation via Binaural Difference Attention and Action Transition Prediction
by: Li, Jia, et al.
Published: (2026)

Assessing the Alignment of Audio Representations with Timbre Similarity Ratings
by: Tian, Haokun, et al.
Published: (2025)

Masked Audio Modeling with CLAP and Multi-Objective Learning
by: Xin, Yifei, et al.
Published: (2024)

HRTF-guided Binaural Target Speaker Extraction with Real-World Validation
by: Ellinson, Yoav, et al.
Published: (2026)

CCStereo: Audio-Visual Contextual and Contrastive Learning for Binaural Audio Generation
by: Chen, Yuanhong, et al.
Published: (2025)

A Lightweight Fourier-based Network for Binaural Speech Enhancement with Spatial Cue Preservation
by: Lu, Xikun, et al.
Published: (2025)

Evaluating Sound Similarity Metrics for Differentiable, Iterative Sound-Matching
by: Salimi, Amir, et al.
Published: (2025)

Binaural Selective Attention Model for Target Speaker Extraction
by: Meng, Hanyu, et al.
Published: (2024)

Binaural Angular Separation Network
by: Yang, Yang, et al.
Published: (2024)

Non-Intrusive Binaural Speech Intelligibility Prediction Using Mamba for Hearing-Impaired Listeners
by: Yamamoto, Katsuhiko, et al.
Published: (2025)

Binaural sound source localization using a hybrid time and frequency domain model
by: Geva, Gil, et al.
Published: (2024)

BinauralFlow: A Causal and Streamable Approach for High-Quality Binaural Speech Synthesis with Flow Matching Models
by: Liang, Susan, et al.
Published: (2025)

Optimal Transport Audio Distance with Learned Riemannian Ground Metrics
by: Jeong, Wonwoo
Published: (2026)

The Extended SONICOM HRTF Dataset and Spatial Audio Metrics Toolbox
by: Poole, Katarina C., et al.
Published: (2025)

Leveraging Mamba with Full-Face Vision for Audio-Visual Speech Enhancement
by: Chao, Rong, et al.
Published: (2025)