Saved in:
Bibliographic Details
Main Authors: Gode, Henri, Doclo, Simon
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2601.21114
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914289180213248
author Gode, Henri
Doclo, Simon
author_facet Gode, Henri
Doclo, Simon
contents The number of active sound sources is a key parameter in many acoustic signal processing tasks, such as source localization, source separation, and multi-microphone speech enhancement. This paper proposes a novel method for online source counting by detecting changes in the number of active sources based on spatial coherence. The proposed method exploits the fact that a single coherent source in spatially white background noise yields high spatial coherence, whereas only noise results in low spatial coherence. By applying a spatial whitening operation, the source counting problem is reformulated as a change detection task, aiming to identify the time frames when the number of active sources changes. The method leverages the generalized magnitude-squared coherence as a measure to quantify spatial coherence, providing features for a compact neural network trained to detect source count changes framewise. Simulation results with binaural hearing aids in reverberant acoustic scenes with up to 4 speakers and background noise demonstrate the effectiveness of the proposed method for online source counting.
format Preprint
id arxiv_https___arxiv_org_abs_2601_21114
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle DNN-Based Online Source Counting Based on Spatial Generalized Magnitude Squared Coherence
Gode, Henri
Doclo, Simon
Audio and Speech Processing
Sound
The number of active sound sources is a key parameter in many acoustic signal processing tasks, such as source localization, source separation, and multi-microphone speech enhancement. This paper proposes a novel method for online source counting by detecting changes in the number of active sources based on spatial coherence. The proposed method exploits the fact that a single coherent source in spatially white background noise yields high spatial coherence, whereas only noise results in low spatial coherence. By applying a spatial whitening operation, the source counting problem is reformulated as a change detection task, aiming to identify the time frames when the number of active sources changes. The method leverages the generalized magnitude-squared coherence as a measure to quantify spatial coherence, providing features for a compact neural network trained to detect source count changes framewise. Simulation results with binaural hearing aids in reverberant acoustic scenes with up to 4 speakers and background noise demonstrate the effectiveness of the proposed method for online source counting.
title DNN-Based Online Source Counting Based on Spatial Generalized Magnitude Squared Coherence
topic Audio and Speech Processing
Sound
url https://arxiv.org/abs/2601.21114