Saved in:
| Main Authors: | Kotani, Rina, Miyazaki, Chiaki, Suzuki, Shiro |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.01929 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Binaural Localization Model for Speech in Noise
by: Tokala, Vikas, et al.
Published: (2025)
by: Tokala, Vikas, et al.
Published: (2025)
Non-Intrusive Binaural Speech Intelligibility Prediction Using Mamba for Hearing-Impaired Listeners
by: Yamamoto, Katsuhiko, et al.
Published: (2025)
by: Yamamoto, Katsuhiko, et al.
Published: (2025)
Binaural Speech Enhancement Using Deep Complex Convolutional Transformer Networks
by: Tokala, Vikas, et al.
Published: (2024)
by: Tokala, Vikas, et al.
Published: (2024)
Binaural Speech Enhancement Using Complex Convolutional Recurrent Networks
by: Tokala, Vikas, et al.
Published: (2025)
by: Tokala, Vikas, et al.
Published: (2025)
Phase Aware Ear-Conditioned Learning for Multi-Channel Binaural Speaker Separation
by: Jeremiah, Ruben Johnson Robert, et al.
Published: (2025)
by: Jeremiah, Ruben Johnson Robert, et al.
Published: (2025)
BANC: Towards Efficient Binaural Audio Neural Codec for Overlapping Speech
by: Ratnarajah, Anton, et al.
Published: (2023)
by: Ratnarajah, Anton, et al.
Published: (2023)
Zero-Shot Mono-to-Binaural Speech Synthesis
by: Levkovitch, Alon, et al.
Published: (2024)
by: Levkovitch, Alon, et al.
Published: (2024)
SpatialNet with Binaural Loss Function for Correcting Binaural Signal Matching Outputs under Head Rotations
by: Shamay, Dor, et al.
Published: (2025)
by: Shamay, Dor, et al.
Published: (2025)
A Lightweight Fourier-based Network for Binaural Speech Enhancement with Spatial Cue Preservation
by: Lu, Xikun, et al.
Published: (2025)
by: Lu, Xikun, et al.
Published: (2025)
BinauralFlow: A Causal and Streamable Approach for High-Quality Binaural Speech Synthesis with Flow Matching Models
by: Liang, Susan, et al.
Published: (2025)
by: Liang, Susan, et al.
Published: (2025)
Spatial Speech Translation: Translating Across Space With Binaural Hearables
by: Chen, Tuochao, et al.
Published: (2025)
by: Chen, Tuochao, et al.
Published: (2025)
Mamba-based Decoder-Only Approach with Bidirectional Speech Modeling for Speech Recognition
by: Masuyama, Yoshiki, et al.
Published: (2024)
by: Masuyama, Yoshiki, et al.
Published: (2024)
Binaural Signal Matching with Wearable Arrays for Near-Field Sources
by: Goldring, Sapir, et al.
Published: (2025)
by: Goldring, Sapir, et al.
Published: (2025)
Unmasking Deepfakes: Leveraging Augmentations and Features Variability for Deepfake Speech Detection
by: Rimon, Inbal, et al.
Published: (2025)
by: Rimon, Inbal, et al.
Published: (2025)
Exploring the Capability of Mamba in Speech Applications
by: Miyazaki, Koichi, et al.
Published: (2024)
by: Miyazaki, Koichi, et al.
Published: (2024)
Deep Learning for Personalized Binaural Audio Reproduction
by: Lu, Xikun, et al.
Published: (2025)
by: Lu, Xikun, et al.
Published: (2025)
Ambisonics Encoder for Wearable Array with Improved Binaural Reproduction
by: Gayer, Yhonatan, et al.
Published: (2025)
by: Gayer, Yhonatan, et al.
Published: (2025)
Blind Identification of Binaural Room Impulse Responses from Smart Glasses
by: Deppisch, Thomas, et al.
Published: (2024)
by: Deppisch, Thomas, et al.
Published: (2024)
An Attribute Interpolation Method in Speech Synthesis by Model Merging
by: Murata, Masato, et al.
Published: (2024)
by: Murata, Masato, et al.
Published: (2024)
Lightweight Implicit Neural Network for Binaural Audio Synthesis
by: Lu, Xikun, et al.
Published: (2025)
by: Lu, Xikun, et al.
Published: (2025)
Binaural Target Speaker Extraction using Individualized HRTF
by: Ellinson, Yoav, et al.
Published: (2025)
by: Ellinson, Yoav, et al.
Published: (2025)
Perceptually Transparent Binaural Auralization of Simulated Sound Fields
by: Ahrens, Jens
Published: (2024)
by: Ahrens, Jens
Published: (2024)
Interpretable Binaural Deep Beamforming Guided by Time-Varying Relative Transfer Function
by: Zaidel, Ilai, et al.
Published: (2025)
by: Zaidel, Ilai, et al.
Published: (2025)
Binaural Signal Matching with Wearable Arrays for Near-Field Sources and Directional Focus
by: Goldring, Sapir, et al.
Published: (2025)
by: Goldring, Sapir, et al.
Published: (2025)
Ambisonics Encoding For Arbitrary Microphone Arrays Incorporating Residual Channels For Binaural Reproduction
by: Gayer, Yhonatan, et al.
Published: (2024)
by: Gayer, Yhonatan, et al.
Published: (2024)
Assessing the Impact of Noise and Speech Enhancement on the Intelligibility of Speech Codecs
by: Behringer, Lyonel, et al.
Published: (2026)
by: Behringer, Lyonel, et al.
Published: (2026)
LuSeeL: Language-queried Binaural Universal Sound Event Extraction and Localization
by: Pan, Zexu, et al.
Published: (2026)
by: Pan, Zexu, et al.
Published: (2026)
Design and Analysis of Binaural Signal Matching with Arbitrary Microphone Arrays and Listener Head Rotations
by: Madmoni, Lior, et al.
Published: (2024)
by: Madmoni, Lior, et al.
Published: (2024)
A Lightweight and Real-Time Binaural Speech Enhancement Model with Spatial Cues Preservation
by: Wang, Jingyuan, et al.
Published: (2024)
by: Wang, Jingyuan, et al.
Published: (2024)
Ambisonics Binaural Rendering via Masked Magnitude Least Squares
by: Berebi, Or, et al.
Published: (2025)
by: Berebi, Or, et al.
Published: (2025)
Binamix -- A Python Library for Generating Binaural Audio Datasets
by: Barry, Dan, et al.
Published: (2025)
by: Barry, Dan, et al.
Published: (2025)
Binaural rendering from microphone array signals of arbitrary geometry
by: Iijima, Naoto, et al.
Published: (2021)
by: Iijima, Naoto, et al.
Published: (2021)
Binaural Selective Attention Model for Target Speaker Extraction
by: Meng, Hanyu, et al.
Published: (2024)
by: Meng, Hanyu, et al.
Published: (2024)
Binaural Angular Separation Network
by: Yang, Yang, et al.
Published: (2024)
by: Yang, Yang, et al.
Published: (2024)
HRTF-guided Binaural Target Speaker Extraction with Real-World Validation
by: Ellinson, Yoav, et al.
Published: (2026)
by: Ellinson, Yoav, et al.
Published: (2026)
Investigation of Speech and Noise Latent Representations in Single-channel VAE-based Speech Enhancement
by: Li, Jiatong, et al.
Published: (2025)
by: Li, Jiatong, et al.
Published: (2025)
Flexible Multichannel Speech Enhancement for Noise-Robust Frontend
by: Jukić, Ante, et al.
Published: (2024)
by: Jukić, Ante, et al.
Published: (2024)
Noise-robust Speech Separation with Fast Generative Correction
by: Wang, Helin, et al.
Published: (2024)
by: Wang, Helin, et al.
Published: (2024)
Binaural Sound Event Localization and Detection based on HRTF Cues for Humanoid Robots
by: Lee, Gyeong-Tae, et al.
Published: (2025)
by: Lee, Gyeong-Tae, et al.
Published: (2025)
BINAQUAL: A Full-Reference Objective Localization Similarity Metric for Binaural Audio
by: Panah, Davoud Shariat, et al.
Published: (2025)
by: Panah, Davoud Shariat, et al.
Published: (2025)
Similar Items
-
Binaural Localization Model for Speech in Noise
by: Tokala, Vikas, et al.
Published: (2025) -
Non-Intrusive Binaural Speech Intelligibility Prediction Using Mamba for Hearing-Impaired Listeners
by: Yamamoto, Katsuhiko, et al.
Published: (2025) -
Binaural Speech Enhancement Using Deep Complex Convolutional Transformer Networks
by: Tokala, Vikas, et al.
Published: (2024) -
Binaural Speech Enhancement Using Complex Convolutional Recurrent Networks
by: Tokala, Vikas, et al.
Published: (2025) -
Phase Aware Ear-Conditioned Learning for Multi-Channel Binaural Speaker Separation
by: Jeremiah, Ruben Johnson Robert, et al.
Published: (2025)