:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Saijo, Kohei, Bando, Yoshiaki
Format:	Preprint
Published:	2026
Subjects:	Audio and Speech Processing
Online Access:	https://arxiv.org/abs/2602.08671
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Is MixIT Really Unsuitable for Correlated Sources? Exploring MixIT for Unsupervised Pre-training in Music Source Separation
by: Saijo, Kohei, et al.
Published: (2025)

A Comparative Study on Positional Encoding for Time-frequency Domain Dual-path Transformer-based Source Separation Models
by: Saijo, Kohei, et al.
Published: (2025)

Neural Blind Source Separation and Diarization for Distant Speech Recognition
by: Bando, Yoshiaki, et al.
Published: (2024)

Task-Aware Unified Source Separation
by: Saijo, Kohei, et al.
Published: (2024)

Subspace Track-before-Detect for Passive Multi-Target Tracking with Unknown Emitted Signals
by: Ito, Nobutaka, et al.
Published: (2026)

Infrastructure-less Localization from Indoor Environmental Sounds Based on Spectral Decomposition and Spatial Likelihood Model
by: Ogiso, Satoki, et al.
Published: (2024)

FasTUSS: Faster Task-Aware Unified Source Separation
by: Paissan, Francesco, et al.
Published: (2025)

TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and Enhancement
by: Saijo, Kohei, et al.
Published: (2024)

Toward Universal Speech Enhancement for Diverse Input Conditions
by: Zhang, Wangyou, et al.
Published: (2023)

Enhanced Reverberation as Supervision for Unsupervised Speech Separation
by: Saijo, Kohei, et al.
Published: (2024)

LEAD Dataset: How Can Labels for Sound Event Detection Vary Depending on Annotators?
by: Koga, Naoki, et al.
Published: (2024)

SaSLaW: Dialogue Speech Corpus with Audio-visual Egocentric Information Toward Environment-adaptive Dialogue Speech Synthesis
by: Take, Osamu, et al.
Published: (2024)

SCNet: Sparse Compression Network for Music Source Separation
by: Tong, Weinan, et al.
Published: (2024)

FlexIO: Flexible Single- and Multi-Channel Speech Separation and Enhancement
by: Masuyama, Yoshiki, et al.
Published: (2025)

Subband Splitting: Simple, Efficient and Effective Technique for Solving Block Permutation Problem in Determined Blind Source Separation
by: Matsumoto, Kazuki, et al.
Published: (2024)

Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enhancement
by: Zhang, Wangyou, et al.
Published: (2024)

Leveraging Audio-Only Data for Text-Queried Target Sound Extraction
by: Saijo, Kohei, et al.
Published: (2024)

Inference-Adaptive Neural Steering for Real-Time Area-Based Sound Source Separation
by: Strauss, Martin, et al.
Published: (2024)

Determined Multichannel Blind Source Separation with Clustered Source Model
by: Wang, Jianyu, et al.
Published: (2024)

Discrete Token Modeling for Multi-Stem Music Source Separation with Language Models
by: Lyu, Pengbo, et al.
Published: (2026)

Pre-training Music Classification Models via Music Source Separation
by: Garoufis, Christos, et al.
Published: (2023)

Geneses: Unified Generative Speech Enhancement and Separation
by: Asai, Kohei, et al.
Published: (2026)

Source Separation by Flow Matching
by: Scheibler, Robin, et al.
Published: (2025)

P.808 Multilingual Speech Enhancement Testing: Approach and Results of URGENT 2025 Challenge
by: Sach, Marvin, et al.
Published: (2025)

Interspeech 2025 URGENT Speech Enhancement Challenge
by: Saijo, Kohei, et al.
Published: (2025)

On Ambisonic Source Separation with Spatially Informed Non-negative Tensor Factorization
by: Guzik, Mateusz, et al.
Published: (2025)

SELEBI: Percussion-aware Time Stretching via Selective Magnitude Spectrogram Compression by Nonstationary Gabor Transform
by: Akaishi, Natsuki, et al.
Published: (2026)

ICASSP 2026 URGENT Speech Enhancement Challenge
by: Li, Chenda, et al.
Published: (2026)

URGENT-PK: Perceptually-Aligned Ranking Model Designed for Speech Enhancement Competition
by: Wang, Jiahe, et al.
Published: (2025)

Towards Blind Data Cleaning: A Case Study in Music Source Separation
by: Gui, Azalea, et al.
Published: (2025)

Musical Source Separation Bake-Off: Comparing Objective Metrics with Human Perception
by: Jaffe, Noah, et al.
Published: (2025)

Musical Source Separation of Brazilian Percussion
by: Namballa, Richa, et al.
Published: (2025)

UniArray: Unified Spectral-Spatial Modeling for Array-Geometry-Agnostic Speech Separation
by: Chen, Weiguang, et al.
Published: (2025)

Leveraging Sound Source Trajectories for Universal Sound Separation
by: Wu, Donghang, et al.
Published: (2024)

MAPSS: Manifold-based Assessment of Perceptual Source Separation
by: Ivry, Amir, et al.
Published: (2025)

Efficient Area-based and Speaker-Agnostic Source Separation
by: Strauss, Martin, et al.
Published: (2024)

Improving Music Source Separation with Diffusion and Consistency Refinement
by: Karchkhadze, Tornike, et al.
Published: (2024)

Determined Blind Source Separation with Sinkhorn Divergence-based Optimal Allocation of the Source Power
by: Wang, Jianyu, et al.
Published: (2025)

Moving Speaker Separation via Parallel Spectral-Spatial Processing
by: Wang, Yuzhu, et al.
Published: (2026)

DOA-Aware Audio-Visual Self-Supervised Learning for Sound Event Localization and Detection
by: Fujita, Yoto, et al.
Published: (2024)