:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Mo, Kaien, Wang, Xianrui, Yang, Yichen, Makino, Shoji, Chen, Jingdong
Format:	Preprint
Published:	2024
Subjects:	Audio and Speech Processing
Online Access:	https://arxiv.org/abs/2406.09821
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Accelerated Convolutive Transfer Function-Based Multichannel NMF Using Iterative Source Steering
by: Xie, Xuemai, et al.
Published: (2025)

Robust Online Overdetermined Independent Vector Analysis Based on Bilinear Decomposition
by: Chen, Kang, et al.
Published: (2026)

Neural Network-Based Time-Frequency-Bin-Wise Linear Combination of Beamformers for Underdetermined Target Source Extraction
by: Chen, Changda, et al.
Published: (2026)

Independent low-rank matrix analysis based on the Sinkhorn divergence source model for blind source separation
by: Wang, Jianyu, et al.
Published: (2024)

Speech dereverberation constrained on room impulse response characteristics
by: Bahrman, Louis, et al.
Published: (2024)

Determined blind source separation via modeling adjacent frequency band correlations in speech signals
by: Wang, Jianyu, et al.
Published: (2025)

Online neural fusion of distortionless differential beamformers for robust speech enhancement
by: Qian, Yuanhang, et al.
Published: (2025)

Entropy-Guided GRVQ for Ultra-Low Bitrate Neural Speech Codec
by: Ren, Yanzhou, et al.
Published: (2026)

Treble10: A high-quality dataset for far-field speech recognition, dereverberation, and enhancement
by: Mullins, Sarabeth S., et al.
Published: (2025)

Unrestricted Global Phase Bias-Aware Single-channel Speech Enhancement with Conformer-based Metric GAN
by: Zhang, Shiqi, et al.
Published: (2024)

DNCASR: End-to-End Training for Speaker-Attributed ASR
by: Zheng, Xianrui, et al.
Published: (2025)

Multichannel blind speech source separation with a disjoint constraint source model
by: Wang, Jianyu, et al.
Published: (2024)

PlumberNet: Fixing interference leakage after GEV beamforming
by: Grondin, François, et al.
Published: (2023)

Why does music source separation benefit from cacophony?
by: Jeon, Chang-Bin, et al.
Published: (2024)

Deep learning based spatial aliasing reduction in beamforming for audio capture
by: Guzik, Mateusz, et al.
Published: (2025)

SOT Triggered Neural Clustering for Speaker Attributed ASR
by: Zheng, Xianrui, et al.
Published: (2024)

DualSep: A Light-weight dual-encoder convolutional recurrent network for real-time in-car speech separation
by: Wang, Ziqian, et al.
Published: (2024)

Representational learning for an anomalous sound detection system with source separation model
by: Shin, Seunghyeon, et al.
Published: (2024)

Determined Blind Source Separation with Sinkhorn Divergence-based Optimal Allocation of the Source Power
by: Wang, Jianyu, et al.
Published: (2025)

Adaptive high-precision sound source localization at low frequencies based on convolutional neural network
by: Ma, Wenbo, et al.
Published: (2024)

A two-step approach for speech enhancement in low-SNR scenarios using cyclostationary beamforming and DNNs
by: Bologni, Giovanni, et al.
Published: (2026)

Adaptive Federated Fine-Tuning of Self-Supervised Speech Representations
by: Guo, Xin, et al.
Published: (2026)

Improving snore detection under limited dataset through harmonic/percussive source separation and convolutional neural networks
by: Gonzalez-Martinez, F. D., et al.
Published: (2024)

Forward Convolutive Prediction for Frame Online Monaural Speech Dereverberation Based on Kronecker Product Decomposition
by: Zhu, Yujie, et al.
Published: (2025)

Spatial-Filter-Bank-Based Neural Method for Multichannel Speech Enhancement
by: Zheng, Tianqin, et al.
Published: (2025)

Can all variations within the unified mask-based beamformer framework achieve identical peak extraction performance?
by: Hiroe, Atsuo, et al.
Published: (2024)

Direction-of-Arrival and Noise Covariance Matrix joint estimation for beamforming
by: Curtarelli, Vitor Gelsleichter Probst, et al.
Published: (2025)

Full-frequency dynamic convolution: a physical frequency-dependent convolution for sound event detection
by: Yue, Haobo, et al.
Published: (2024)

Hybrid-Sep: Language-queried audio source separation via pre-trained Model Fusion and Adversarial Diffusion Training
by: Feng, Jianyuan, et al.
Published: (2025)

A Directional-Derivative-Constrained Method for Continuously Steerable Differential Beamformers with Uniform Circular Arrays
by: Xiong, Tiantian, et al.
Published: (2026)

IPDnet2: an efficient and improved inter-channel phase difference estimation network for sound source localization
by: Wang, Yabo, et al.
Published: (2025)

Frequency-aware convolution for sound event detection
by: Song, Tao, et al.
Published: (2024)

Advances in Microphone Array Processing and Multichannel Speech Enhancement
by: Huang, Gongping, et al.
Published: (2025)

Rethinking the joint estimation of magnitude and phase for time-frequency domain neural vocoders
by: Dai, Lingling, et al.
Published: (2025)

What do neural networks listen to? Exploring the crucial bands in Speech Enhancement using Sinc-convolution
by: Ho, Kuan-Hsun, et al.
Published: (2024)

Towards detecting the pathological subharmonic voicing with fully convolutional neural networks
by: Ikuma, Takeshi, et al.
Published: (2025)

Omni-directional attention mechanism based on Mamba for speech separation
by: Xue, Ke, et al.
Published: (2026)

Advancing Continual Learning for Robust Deepfake Audio Classification
by: Dong, Feiyi, et al.
Published: (2024)

Singer separation for karaoke content generation
by: Lin, Hsuan-Yu, et al.
Published: (2021)

Multispecies bird sound recognition using a fully convolutional neural network
by: García-Ordás, María Teresa, et al.
Published: (2024)