:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Kotani, Rina, Miyazaki, Chiaki, Suzuki, Shiro
Format:	Preprint
Published:	2025
Subjects:	Audio and Speech Processing
Online Access:	https://arxiv.org/abs/2509.01929
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Binaural Localization Model for Speech in Noise
by: Tokala, Vikas, et al.
Published: (2025)

Non-Intrusive Binaural Speech Intelligibility Prediction Using Mamba for Hearing-Impaired Listeners
by: Yamamoto, Katsuhiko, et al.
Published: (2025)

Binaural Speech Enhancement Using Deep Complex Convolutional Transformer Networks
by: Tokala, Vikas, et al.
Published: (2024)

Binaural Speech Enhancement Using Complex Convolutional Recurrent Networks
by: Tokala, Vikas, et al.
Published: (2025)

Phase Aware Ear-Conditioned Learning for Multi-Channel Binaural Speaker Separation
by: Jeremiah, Ruben Johnson Robert, et al.
Published: (2025)

BANC: Towards Efficient Binaural Audio Neural Codec for Overlapping Speech
by: Ratnarajah, Anton, et al.
Published: (2023)

Zero-Shot Mono-to-Binaural Speech Synthesis
by: Levkovitch, Alon, et al.
Published: (2024)

SpatialNet with Binaural Loss Function for Correcting Binaural Signal Matching Outputs under Head Rotations
by: Shamay, Dor, et al.
Published: (2025)

A Lightweight Fourier-based Network for Binaural Speech Enhancement with Spatial Cue Preservation
by: Lu, Xikun, et al.
Published: (2025)

BinauralFlow: A Causal and Streamable Approach for High-Quality Binaural Speech Synthesis with Flow Matching Models
by: Liang, Susan, et al.
Published: (2025)

Spatial Speech Translation: Translating Across Space With Binaural Hearables
by: Chen, Tuochao, et al.
Published: (2025)

Mamba-based Decoder-Only Approach with Bidirectional Speech Modeling for Speech Recognition
by: Masuyama, Yoshiki, et al.
Published: (2024)

Binaural Signal Matching with Wearable Arrays for Near-Field Sources
by: Goldring, Sapir, et al.
Published: (2025)

Unmasking Deepfakes: Leveraging Augmentations and Features Variability for Deepfake Speech Detection
by: Rimon, Inbal, et al.
Published: (2025)

Exploring the Capability of Mamba in Speech Applications
by: Miyazaki, Koichi, et al.
Published: (2024)

Deep Learning for Personalized Binaural Audio Reproduction
by: Lu, Xikun, et al.
Published: (2025)

Ambisonics Encoder for Wearable Array with Improved Binaural Reproduction
by: Gayer, Yhonatan, et al.
Published: (2025)

Blind Identification of Binaural Room Impulse Responses from Smart Glasses
by: Deppisch, Thomas, et al.
Published: (2024)

An Attribute Interpolation Method in Speech Synthesis by Model Merging
by: Murata, Masato, et al.
Published: (2024)

Lightweight Implicit Neural Network for Binaural Audio Synthesis
by: Lu, Xikun, et al.
Published: (2025)

Binaural Target Speaker Extraction using Individualized HRTF
by: Ellinson, Yoav, et al.
Published: (2025)

Perceptually Transparent Binaural Auralization of Simulated Sound Fields
by: Ahrens, Jens
Published: (2024)

Interpretable Binaural Deep Beamforming Guided by Time-Varying Relative Transfer Function
by: Zaidel, Ilai, et al.
Published: (2025)

Binaural Signal Matching with Wearable Arrays for Near-Field Sources and Directional Focus
by: Goldring, Sapir, et al.
Published: (2025)

Ambisonics Encoding For Arbitrary Microphone Arrays Incorporating Residual Channels For Binaural Reproduction
by: Gayer, Yhonatan, et al.
Published: (2024)

Assessing the Impact of Noise and Speech Enhancement on the Intelligibility of Speech Codecs
by: Behringer, Lyonel, et al.
Published: (2026)

LuSeeL: Language-queried Binaural Universal Sound Event Extraction and Localization
by: Pan, Zexu, et al.
Published: (2026)

Design and Analysis of Binaural Signal Matching with Arbitrary Microphone Arrays and Listener Head Rotations
by: Madmoni, Lior, et al.
Published: (2024)

A Lightweight and Real-Time Binaural Speech Enhancement Model with Spatial Cues Preservation
by: Wang, Jingyuan, et al.
Published: (2024)

Ambisonics Binaural Rendering via Masked Magnitude Least Squares
by: Berebi, Or, et al.
Published: (2025)

Binamix -- A Python Library for Generating Binaural Audio Datasets
by: Barry, Dan, et al.
Published: (2025)

Binaural rendering from microphone array signals of arbitrary geometry
by: Iijima, Naoto, et al.
Published: (2021)

Binaural Selective Attention Model for Target Speaker Extraction
by: Meng, Hanyu, et al.
Published: (2024)

Binaural Angular Separation Network
by: Yang, Yang, et al.
Published: (2024)

HRTF-guided Binaural Target Speaker Extraction with Real-World Validation
by: Ellinson, Yoav, et al.
Published: (2026)

Investigation of Speech and Noise Latent Representations in Single-channel VAE-based Speech Enhancement
by: Li, Jiatong, et al.
Published: (2025)

Flexible Multichannel Speech Enhancement for Noise-Robust Frontend
by: Jukić, Ante, et al.
Published: (2024)

Noise-robust Speech Separation with Fast Generative Correction
by: Wang, Helin, et al.
Published: (2024)

Binaural Sound Event Localization and Detection based on HRTF Cues for Humanoid Robots
by: Lee, Gyeong-Tae, et al.
Published: (2025)

BINAQUAL: A Full-Reference Objective Localization Similarity Metric for Binaural Audio
by: Panah, Davoud Shariat, et al.
Published: (2025)