:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Fu, Linya, Liu, Yu, Liu, Zhijie, Yang, Zedong, Wang, Zhong-Qiu, Li, Youfu, Kong, He
Format:	Preprint
Published:	2025
Subjects:	Audio and Speech Processing Sound
Online Access:	https://arxiv.org/abs/2506.02773
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Perceptually Transparent Binaural Auralization of Simulated Sound Fields
by: Ahrens, Jens
Published: (2024)

Binaural Selective Attention Model for Target Speaker Extraction
by: Meng, Hanyu, et al.
Published: (2024)

BANC: Towards Efficient Binaural Audio Neural Codec for Overlapping Speech
by: Ratnarajah, Anton, et al.
Published: (2023)

Binaural Target Speaker Extraction using Individualized HRTF
by: Ellinson, Yoav, et al.
Published: (2025)

HRTF-guided Binaural Target Speaker Extraction with Real-World Validation
by: Ellinson, Yoav, et al.
Published: (2026)

Multi-Speaker DOA Estimation in Binaural Hearing Aids using Deep Learning and Speaker Count Fusion
by: Jazaeri, Farnaz, et al.
Published: (2025)

Listen to Extract: Onset-Prompted Target Speaker Extraction
by: Shen, Pengjie, et al.
Published: (2025)

Binaural Sound Event Localization and Detection based on HRTF Cues for Humanoid Robots
by: Lee, Gyeong-Tae, et al.
Published: (2025)

Binaural Sound Event Localization and Detection Neural Network based on HRTF Localization Cues for Humanoid Robots
by: Lee, Gyeong-Tae
Published: (2025)

PhiNet: Speaker Verification with Phonetic Interpretability
by: Ma, Yi, et al.
Published: (2026)

Lightweight Implicit Neural Network for Binaural Audio Synthesis
by: Lu, Xikun, et al.
Published: (2025)

Deep Learning for Personalized Binaural Audio Reproduction
by: Lu, Xikun, et al.
Published: (2025)

Comparison of Frequency-Fusion Mechanisms for Binaural Direction-of-Arrival Estimation for Multiple Speakers
by: Fejgin, Daniel, et al.
Published: (2024)

BINAQUAL: A Full-Reference Objective Localization Similarity Metric for Binaural Audio
by: Panah, Davoud Shariat, et al.
Published: (2025)

Boosting Unknown-number Speaker Separation with Transformer Decoder-based Attractor
by: Lee, Younglo, et al.
Published: (2024)

Lightweight Target-Speaker-Based Overlap Transcription for Practical Streaming ASR
by: Pražák, Aleš, et al.
Published: (2025)

Joint Training of Speaker Embedding Extractor, Speech and Overlap Detection for Diarization
by: Pálka, Petr, et al.
Published: (2024)

Binaural Angular Separation Network
by: Yang, Yang, et al.
Published: (2024)

Accelerated Interactive Auralization of Highly Reverberant Spaces using Graphics Hardware
by: Rosseel, Hannes, et al.
Published: (2025)

Generalizable Audio-Visual Navigation via Binaural Difference Attention and Action Transition Prediction
by: Li, Jia, et al.
Published: (2026)

Exploiting an External Microphone for Binaural RTF-Vector-Based Direction of Arrival Estimation for Multiple Speakers
by: Fejgin, Daniel, et al.
Published: (2023)

Speaker Diarization with Overlapping Community Detection Using Graph Attention Networks and Label Propagation Algorithm
by: Li, Zhaoyang, et al.
Published: (2025)

MASV: Speaker Verification with Global and Local Context Mamba
by: Liu, Yang, et al.
Published: (2024)

Xi+: Uncertainty Supervision for Robust Speaker Embedding
by: Li, Junjie, et al.
Published: (2025)

Speaker Characterization by means of Attention Pooling
by: Costa, Federico, et al.
Published: (2024)

Golden Gemini is All You Need: Finding the Sweet Spots for Speaker Verification
by: Liu, Tianchi, et al.
Published: (2023)

A Lightweight Fourier-based Network for Binaural Speech Enhancement with Spatial Cue Preservation
by: Lu, Xikun, et al.
Published: (2025)

Overlap-Adaptive Hybrid Speaker Diarization and ASR-Aware Observation Addition for MISP 2025 Challenge
by: Huang, Shangkun, et al.
Published: (2025)

ASRRL-TTS: Agile Speaker Representation Reinforcement Learning for Text-to-Speech Speaker Adaptation
by: Fu, Ruibo, et al.
Published: (2024)

Emotion Recognition in Multi-Speaker Conversations through Speaker Identification, Knowledge Distillation, and Hierarchical Fusion
by: Li, Xiao, et al.
Published: (2025)

Ambisonics Binaural Rendering via Masked Magnitude Least Squares
by: Berebi, Or, et al.
Published: (2025)

Binaural rendering from microphone array signals of arbitrary geometry
by: Iijima, Naoto, et al.
Published: (2021)

Binamix -- A Python Library for Generating Binaural Audio Datasets
by: Barry, Dan, et al.
Published: (2025)

Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning
by: Wang, Shuai, et al.
Published: (2024)

Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Automatic Speaker Verification
by: Truong, Duc-Tuan, et al.
Published: (2023)

Beyond Speaker Identity: Text Guided Target Speech Extraction
by: Huo, Mingyue, et al.
Published: (2025)

Libri2Vox Dataset: Target Speaker Extraction with Diverse Speaker Conditions and Synthetic Data
by: Liu, Yun, et al.
Published: (2024)

BinauralFlow: A Causal and Streamable Approach for High-Quality Binaural Speech Synthesis with Flow Matching Models
by: Liang, Susan, et al.
Published: (2025)

Target Speaker Extraction with Curriculum Learning
by: Liu, Yun, et al.
Published: (2024)

Speech Enhancement with Overlapped-Frame Information Fusion and Causal Self-Attention
by: Zhang, Yuewei, et al.
Published: (2025)