Saved in:
| Main Authors: | Li, Jiatong, Middelberg, Wiebke, Doclo, Simon |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.18442 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
BRUDEX Database: Binaural Room Impulse Responses with Uniformly Distributed External Microphones
by: Fejgin, Daniel, et al.
Published: (2023)
by: Fejgin, Daniel, et al.
Published: (2023)
I-DCCRN-VAE: An Improved Deep Representation Learning Framework for Complex VAE-based Single-channel Speech Enhancement
by: Li, Jiatong, et al.
Published: (2025)
by: Li, Jiatong, et al.
Published: (2025)
Investigation of Speech and Noise Latent Representations in Single-channel VAE-based Speech Enhancement
by: Li, Jiatong, et al.
Published: (2025)
by: Li, Jiatong, et al.
Published: (2025)
Coherence-Based Frequency Subset Selection For Binaural RTF-Vector-Based Direction of Arrival Estimation for Multiple Speakers
by: Fejgin, Daniel, et al.
Published: (2022)
by: Fejgin, Daniel, et al.
Published: (2022)
MC-LExt: Multi-Channel Target Speaker Extraction with Onset-Prompted Speaker Conditioning Mechanism
by: Ling, Tongtao, et al.
Published: (2025)
by: Ling, Tongtao, et al.
Published: (2025)
GAN-Based Multi-Microphone Spatial Target Speaker Extraction
by: Shetu, Shrishti Saha, et al.
Published: (2025)
by: Shetu, Shrishti Saha, et al.
Published: (2025)
Array Geometry-Robust Attention-Based Neural Beamformer for Moving Speakers
by: Tammen, Marvin, et al.
Published: (2024)
by: Tammen, Marvin, et al.
Published: (2024)
Exploiting an External Microphone for Binaural RTF-Vector-Based Direction of Arrival Estimation for Multiple Speakers
by: Fejgin, Daniel, et al.
Published: (2023)
by: Fejgin, Daniel, et al.
Published: (2023)
Completing Sets of Prototype Transfer Functions for Subspace-based Direction of Arrival Estimation of Multiple Speakers
by: Fejgin, Daniel, et al.
Published: (2025)
by: Fejgin, Daniel, et al.
Published: (2025)
Multi-Channel Multi-Speaker ASR Using Target Speaker's Solo Segment
by: Shao, Yiwen, et al.
Published: (2024)
by: Shao, Yiwen, et al.
Published: (2024)
Multi-Level Speaker Representation for Target Speaker Extraction
by: Zhang, Ke, et al.
Published: (2024)
by: Zhang, Ke, et al.
Published: (2024)
Spatially Selective Active Noise Control for Open-fitting Hearables with Acausal Optimization
by: Xiao, Tong, et al.
Published: (2025)
by: Xiao, Tong, et al.
Published: (2025)
DNN-Based Online Source Counting Based on Spatial Generalized Magnitude Squared Coherence
by: Gode, Henri, et al.
Published: (2026)
by: Gode, Henri, et al.
Published: (2026)
Multi-Source Position and Direction-of-Arrival Estimation Based on Euclidean Distance Matrices
by: Brümann, Klaus, et al.
Published: (2025)
by: Brümann, Klaus, et al.
Published: (2025)
Multi-View Based Audio Visual Target Speaker Extraction
by: Yang, Peijun, et al.
Published: (2026)
by: Yang, Peijun, et al.
Published: (2026)
Microphone Occlusion Mitigation for Own-Voice Enhancement in Head-Worn Microphone Arrays Using Switching-Adaptive Beamforming
by: Middelberg, Wiebke, et al.
Published: (2025)
by: Middelberg, Wiebke, et al.
Published: (2025)
Audio-Visual Target Speaker Extraction with Reverse Selective Auditory Attention
by: Tao, Ruijie, et al.
Published: (2024)
by: Tao, Ruijie, et al.
Published: (2024)
Multi-channel Speech Separation Using Spatially Selective Deep Non-linear Filters
by: Tesch, Kristina, et al.
Published: (2023)
by: Tesch, Kristina, et al.
Published: (2023)
Closed-Form Successive Relative Transfer Function Vector Estimation based on Blind Oblique Projection Incorporating Noise Whitening
by: Gode, Henri, et al.
Published: (2025)
by: Gode, Henri, et al.
Published: (2025)
Soft-Constrained Spatially Selective Active Noise Control for Open-fitting Hearables
by: Xiao, Tong, et al.
Published: (2025)
by: Xiao, Tong, et al.
Published: (2025)
Multi-Microphone Noise Data Augmentation for DNN-based Own Voice Reconstruction for Hearables in Noisy Environments
by: Ohlenbusch, Mattes, et al.
Published: (2023)
by: Ohlenbusch, Mattes, et al.
Published: (2023)
WeSep: A Scalable and Flexible Toolkit Towards Generalizable Target Speaker Extraction
by: Wang, Shuai, et al.
Published: (2024)
by: Wang, Shuai, et al.
Published: (2024)
Steering Deep Non-Linear Spatially Selective Filters for Weakly Guided Extraction of Moving Speakers in Dynamic Scenarios
by: Kienegger, Jakob, et al.
Published: (2025)
by: Kienegger, Jakob, et al.
Published: (2025)
Binaural Selective Attention Model for Target Speaker Extraction
by: Meng, Hanyu, et al.
Published: (2024)
by: Meng, Hanyu, et al.
Published: (2024)
Multi-Input Multi-Output Target-Speaker Voice Activity Detection For Unified, Flexible, and Robust Audio-Visual Speaker Diarization
by: Cheng, Ming, et al.
Published: (2024)
by: Cheng, Ming, et al.
Published: (2024)
Libri2Vox Dataset: Target Speaker Extraction with Diverse Speaker Conditions and Synthetic Data
by: Liu, Yun, et al.
Published: (2024)
by: Liu, Yun, et al.
Published: (2024)
Comparison of Frequency-Fusion Mechanisms for Binaural Direction-of-Arrival Estimation for Multiple Speakers
by: Fejgin, Daniel, et al.
Published: (2024)
by: Fejgin, Daniel, et al.
Published: (2024)
Robust Soft-Constrained Spatially Selective Active Noise Control for Hearables Under Secondary Path Variations
by: Xiao, Tong, et al.
Published: (2026)
by: Xiao, Tong, et al.
Published: (2026)
USEF-TSE: Universal Speaker Embedding Free Target Speaker Extraction
by: Zeng, Bang, et al.
Published: (2024)
by: Zeng, Bang, et al.
Published: (2024)
EvoTSE: Evolving Enrollment for Target Speaker Extraction
by: Liu, Zikai, et al.
Published: (2026)
by: Liu, Zikai, et al.
Published: (2026)
Target Speaker Selection for Neural Network Beamforming in Multi-Speaker Scenarios
by: Fiorio, Luan Vinícius, et al.
Published: (2025)
by: Fiorio, Luan Vinícius, et al.
Published: (2025)
Self-Steering Deep Non-Linear Spatially Selective Filters for Efficient Extraction of Moving Speakers under Weak Guidance
by: Kienegger, Jakob, et al.
Published: (2025)
by: Kienegger, Jakob, et al.
Published: (2025)
Detect, Attend and Extract: Keyword Guided Target Speaker Extraction
by: Li, Haoyu, et al.
Published: (2026)
by: Li, Haoyu, et al.
Published: (2026)
Training Strategies for Modality Dropout Resilient Multi-Modal Target Speaker Extraction
by: Korse, Srikanth, et al.
Published: (2025)
by: Korse, Srikanth, et al.
Published: (2025)
Reference Microphone Selection for the Weighted Prediction Error Algorithm using the Normalized L-p Norm
by: Lohmann, Anselm, et al.
Published: (2024)
by: Lohmann, Anselm, et al.
Published: (2024)
Microphone Subset Selection for the Weighted Prediction Error Algorithm using a Group Sparsity Penalty
by: Lohmann, Anselm, et al.
Published: (2024)
by: Lohmann, Anselm, et al.
Published: (2024)
Assisted RTF-Vector-Based Binaural Direction of Arrival Estimation Exploiting a Calibrated External Microphone Array
by: Fejgin, Daniel, et al.
Published: (2022)
by: Fejgin, Daniel, et al.
Published: (2022)
Target Speaker Extraction with Curriculum Learning
by: Liu, Yun, et al.
Published: (2024)
by: Liu, Yun, et al.
Published: (2024)
Enhancing Target Speaker Extraction with Explicit Speaker Consistency Modeling
by: Wu, Shu, et al.
Published: (2025)
by: Wu, Shu, et al.
Published: (2025)
Adaptive Deterministic Flow Matching for Target Speaker Extraction
by: Hsieh, Tsun-An, et al.
Published: (2025)
by: Hsieh, Tsun-An, et al.
Published: (2025)
Similar Items
-
BRUDEX Database: Binaural Room Impulse Responses with Uniformly Distributed External Microphones
by: Fejgin, Daniel, et al.
Published: (2023) -
I-DCCRN-VAE: An Improved Deep Representation Learning Framework for Complex VAE-based Single-channel Speech Enhancement
by: Li, Jiatong, et al.
Published: (2025) -
Investigation of Speech and Noise Latent Representations in Single-channel VAE-based Speech Enhancement
by: Li, Jiatong, et al.
Published: (2025) -
Coherence-Based Frequency Subset Selection For Binaural RTF-Vector-Based Direction of Arrival Estimation for Multiple Speakers
by: Fejgin, Daniel, et al.
Published: (2022) -
MC-LExt: Multi-Channel Target Speaker Extraction with Onset-Prompted Speaker Conditioning Mechanism
by: Ling, Tongtao, et al.
Published: (2025)