Saved in:
| Main Authors: | Mo, Kaien, Wang, Xianrui, Yang, Yichen, Makino, Shoji, Chen, Jingdong |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2406.09821 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Accelerated Convolutive Transfer Function-Based Multichannel NMF Using Iterative Source Steering
by: Xie, Xuemai, et al.
Published: (2025)
by: Xie, Xuemai, et al.
Published: (2025)
Robust Online Overdetermined Independent Vector Analysis Based on Bilinear Decomposition
by: Chen, Kang, et al.
Published: (2026)
by: Chen, Kang, et al.
Published: (2026)
Neural Network-Based Time-Frequency-Bin-Wise Linear Combination of Beamformers for Underdetermined Target Source Extraction
by: Chen, Changda, et al.
Published: (2026)
by: Chen, Changda, et al.
Published: (2026)
Independent low-rank matrix analysis based on the Sinkhorn divergence source model for blind source separation
by: Wang, Jianyu, et al.
Published: (2024)
by: Wang, Jianyu, et al.
Published: (2024)
Speech dereverberation constrained on room impulse response characteristics
by: Bahrman, Louis, et al.
Published: (2024)
by: Bahrman, Louis, et al.
Published: (2024)
Determined blind source separation via modeling adjacent frequency band correlations in speech signals
by: Wang, Jianyu, et al.
Published: (2025)
by: Wang, Jianyu, et al.
Published: (2025)
Online neural fusion of distortionless differential beamformers for robust speech enhancement
by: Qian, Yuanhang, et al.
Published: (2025)
by: Qian, Yuanhang, et al.
Published: (2025)
Entropy-Guided GRVQ for Ultra-Low Bitrate Neural Speech Codec
by: Ren, Yanzhou, et al.
Published: (2026)
by: Ren, Yanzhou, et al.
Published: (2026)
Treble10: A high-quality dataset for far-field speech recognition, dereverberation, and enhancement
by: Mullins, Sarabeth S., et al.
Published: (2025)
by: Mullins, Sarabeth S., et al.
Published: (2025)
Unrestricted Global Phase Bias-Aware Single-channel Speech Enhancement with Conformer-based Metric GAN
by: Zhang, Shiqi, et al.
Published: (2024)
by: Zhang, Shiqi, et al.
Published: (2024)
DNCASR: End-to-End Training for Speaker-Attributed ASR
by: Zheng, Xianrui, et al.
Published: (2025)
by: Zheng, Xianrui, et al.
Published: (2025)
Multichannel blind speech source separation with a disjoint constraint source model
by: Wang, Jianyu, et al.
Published: (2024)
by: Wang, Jianyu, et al.
Published: (2024)
PlumberNet: Fixing interference leakage after GEV beamforming
by: Grondin, François, et al.
Published: (2023)
by: Grondin, François, et al.
Published: (2023)
Why does music source separation benefit from cacophony?
by: Jeon, Chang-Bin, et al.
Published: (2024)
by: Jeon, Chang-Bin, et al.
Published: (2024)
Deep learning based spatial aliasing reduction in beamforming for audio capture
by: Guzik, Mateusz, et al.
Published: (2025)
by: Guzik, Mateusz, et al.
Published: (2025)
SOT Triggered Neural Clustering for Speaker Attributed ASR
by: Zheng, Xianrui, et al.
Published: (2024)
by: Zheng, Xianrui, et al.
Published: (2024)
DualSep: A Light-weight dual-encoder convolutional recurrent network for real-time in-car speech separation
by: Wang, Ziqian, et al.
Published: (2024)
by: Wang, Ziqian, et al.
Published: (2024)
Representational learning for an anomalous sound detection system with source separation model
by: Shin, Seunghyeon, et al.
Published: (2024)
by: Shin, Seunghyeon, et al.
Published: (2024)
Determined Blind Source Separation with Sinkhorn Divergence-based Optimal Allocation of the Source Power
by: Wang, Jianyu, et al.
Published: (2025)
by: Wang, Jianyu, et al.
Published: (2025)
Adaptive high-precision sound source localization at low frequencies based on convolutional neural network
by: Ma, Wenbo, et al.
Published: (2024)
by: Ma, Wenbo, et al.
Published: (2024)
A two-step approach for speech enhancement in low-SNR scenarios using cyclostationary beamforming and DNNs
by: Bologni, Giovanni, et al.
Published: (2026)
by: Bologni, Giovanni, et al.
Published: (2026)
Adaptive Federated Fine-Tuning of Self-Supervised Speech Representations
by: Guo, Xin, et al.
Published: (2026)
by: Guo, Xin, et al.
Published: (2026)
Improving snore detection under limited dataset through harmonic/percussive source separation and convolutional neural networks
by: Gonzalez-Martinez, F. D., et al.
Published: (2024)
by: Gonzalez-Martinez, F. D., et al.
Published: (2024)
Forward Convolutive Prediction for Frame Online Monaural Speech Dereverberation Based on Kronecker Product Decomposition
by: Zhu, Yujie, et al.
Published: (2025)
by: Zhu, Yujie, et al.
Published: (2025)
Spatial-Filter-Bank-Based Neural Method for Multichannel Speech Enhancement
by: Zheng, Tianqin, et al.
Published: (2025)
by: Zheng, Tianqin, et al.
Published: (2025)
Can all variations within the unified mask-based beamformer framework achieve identical peak extraction performance?
by: Hiroe, Atsuo, et al.
Published: (2024)
by: Hiroe, Atsuo, et al.
Published: (2024)
Direction-of-Arrival and Noise Covariance Matrix joint estimation for beamforming
by: Curtarelli, Vitor Gelsleichter Probst, et al.
Published: (2025)
by: Curtarelli, Vitor Gelsleichter Probst, et al.
Published: (2025)
Full-frequency dynamic convolution: a physical frequency-dependent convolution for sound event detection
by: Yue, Haobo, et al.
Published: (2024)
by: Yue, Haobo, et al.
Published: (2024)
Hybrid-Sep: Language-queried audio source separation via pre-trained Model Fusion and Adversarial Diffusion Training
by: Feng, Jianyuan, et al.
Published: (2025)
by: Feng, Jianyuan, et al.
Published: (2025)
A Directional-Derivative-Constrained Method for Continuously Steerable Differential Beamformers with Uniform Circular Arrays
by: Xiong, Tiantian, et al.
Published: (2026)
by: Xiong, Tiantian, et al.
Published: (2026)
IPDnet2: an efficient and improved inter-channel phase difference estimation network for sound source localization
by: Wang, Yabo, et al.
Published: (2025)
by: Wang, Yabo, et al.
Published: (2025)
Frequency-aware convolution for sound event detection
by: Song, Tao, et al.
Published: (2024)
by: Song, Tao, et al.
Published: (2024)
Advances in Microphone Array Processing and Multichannel Speech Enhancement
by: Huang, Gongping, et al.
Published: (2025)
by: Huang, Gongping, et al.
Published: (2025)
Rethinking the joint estimation of magnitude and phase for time-frequency domain neural vocoders
by: Dai, Lingling, et al.
Published: (2025)
by: Dai, Lingling, et al.
Published: (2025)
What do neural networks listen to? Exploring the crucial bands in Speech Enhancement using Sinc-convolution
by: Ho, Kuan-Hsun, et al.
Published: (2024)
by: Ho, Kuan-Hsun, et al.
Published: (2024)
Towards detecting the pathological subharmonic voicing with fully convolutional neural networks
by: Ikuma, Takeshi, et al.
Published: (2025)
by: Ikuma, Takeshi, et al.
Published: (2025)
Omni-directional attention mechanism based on Mamba for speech separation
by: Xue, Ke, et al.
Published: (2026)
by: Xue, Ke, et al.
Published: (2026)
Advancing Continual Learning for Robust Deepfake Audio Classification
by: Dong, Feiyi, et al.
Published: (2024)
by: Dong, Feiyi, et al.
Published: (2024)
Singer separation for karaoke content generation
by: Lin, Hsuan-Yu, et al.
Published: (2021)
by: Lin, Hsuan-Yu, et al.
Published: (2021)
Multispecies bird sound recognition using a fully convolutional neural network
by: García-Ordás, María Teresa, et al.
Published: (2024)
by: García-Ordás, María Teresa, et al.
Published: (2024)
Similar Items
-
Accelerated Convolutive Transfer Function-Based Multichannel NMF Using Iterative Source Steering
by: Xie, Xuemai, et al.
Published: (2025) -
Robust Online Overdetermined Independent Vector Analysis Based on Bilinear Decomposition
by: Chen, Kang, et al.
Published: (2026) -
Neural Network-Based Time-Frequency-Bin-Wise Linear Combination of Beamformers for Underdetermined Target Source Extraction
by: Chen, Changda, et al.
Published: (2026) -
Independent low-rank matrix analysis based on the Sinkhorn divergence source model for blind source separation
by: Wang, Jianyu, et al.
Published: (2024) -
Speech dereverberation constrained on room impulse response characteristics
by: Bahrman, Louis, et al.
Published: (2024)