:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Guzik, Mateusz, Kowalczyk, Konrad
Format:	Preprint
Published:	2025
Subjects:	Audio and Speech Processing
Online Access:	https://arxiv.org/abs/2501.10305
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Clustering-based hard negative sampling for supervised contrastive speaker verification
by: Masztalski, Piotr, et al.
Published: (2025)

Deep learning based spatial aliasing reduction in beamforming for audio capture
by: Guzik, Mateusz, et al.
Published: (2025)

HeightCeleb - an enrichment of VoxCeleb dataset with speaker height information
by: Kacprzak, Stanisław, et al.
Published: (2024)

Investigation of Whisper ASR Hallucinations Induced by Non-Speech Audio
by: Barański, Mateusz, et al.
Published: (2025)

Perceptually-motivated Spatial Audio Codec for Higher-Order Ambisonics Compression
by: Hold, Christoph, et al.
Published: (2024)

Evaluation of Spherical Wavelet Framework in Comparsion with Ambisonics
by: Ekmen, Ş., et al.
Published: (2025)

Introduction to Ambisonics, Part 1: The Part With No Math
by: Ahrens, Jens
Published: (2025)

Residual Learning for Neural Ambisonics Encoders
by: Deppisch, Thomas, et al.
Published: (2026)

Spatial-Temporal Activity-Informed Diarization and Separation
by: Hsu, Yicheng, et al.
Published: (2024)

Ambisonizer: Neural Upmixing as Spherical Harmonics Generation
by: Zang, Yongyi, et al.
Published: (2024)

Ambisonics Networks -- The Effect Of Radial Functions Regularization
by: Shaybet, Bar, et al.
Published: (2024)

Ambisonics Encoder for Wearable Array with Improved Binaural Reproduction
by: Gayer, Yhonatan, et al.
Published: (2025)

Neural Ambisonics encoding for compact irregular microphone arrays
by: Heikkinen, Mikko, et al.
Published: (2024)

DiffAU: Diffusion-Based Ambisonics Upscaling
by: Milstein, Amit, et al.
Published: (2025)

Dynamic Real-Time Ambisonics Order Adaptation for Immersive Networked Music Performances
by: Ostan, Paolo, et al.
Published: (2025)

Perceptual implications of simplifying geometrical acoustics models for Ambisonics-based binaural reverberation
by: Martin, Vincent, et al.
Published: (2024)

Ambisonics Encoding For Arbitrary Microphone Arrays Incorporating Residual Channels For Binaural Reproduction
by: Gayer, Yhonatan, et al.
Published: (2024)

Ambisonics Binaural Rendering via Masked Magnitude Least Squares
by: Berebi, Or, et al.
Published: (2025)

A first-order DirAC-based parametric Ambisonic coder for immersive communications
by: Fuchs, Guillaume, et al.
Published: (2025)

Neural Ambisonic Encoding For Multi-Speaker Scenarios Using A Circular Microphone Array
by: Qiao, Yue, et al.
Published: (2024)

Perceptual Compensation of Ambisonics Recordings for Reproduction in Room
by: Fallah, Ali, et al.
Published: (2025)

AmbiDrop: Array-Agnostic Speech Enhancement Using Ambisonics Encoding and Dropout-Based Learning
by: Tatarjitzky, Michael, et al.
Published: (2025)

Beyond Omnidirectional: Neural Ambisonics Encoding for Arbitrary Microphone Directivity Patterns using Cross-Attention
by: Heikkinen, Mikko, et al.
Published: (2026)

Ambisonics Super-Resolution Using A Waveform-Domain Neural Network
by: Nawfal, Ismael, et al.
Published: (2025)

SHroom: A Python Framework for Ambisonics Room Acoustics Simulation and Binaural Rendering
by: Gayer, Yhonatan
Published: (2026)

Array-Aware Ambisonics and HRTF Encoding for Binaural Reproduction With Wearable Arrays
by: Gayer, Yhonatan, et al.
Published: (2025)

Velocity Potential Neural Field for Efficient Ambisonics Impulse Response Modeling
by: Masuyama, Yoshiki, et al.
Published: (2026)

Do Music Source Separation Models Preserve Spatial Information in Binaural Audio?
by: Namballa, Richa, et al.
Published: (2025)

Fast Multichannel NMF with Block-Diagonal Spatial Covariance Matrices for Efficient Blind Source Separation Using Distributed Microphone Arrays
by: Nishikori, Hirotaka, et al.
Published: (2026)

Source Separation by Flow Matching
by: Scheibler, Robin, et al.
Published: (2025)

SCNet: Sparse Compression Network for Music Source Separation
by: Tong, Weinan, et al.
Published: (2024)

Blind Estimation of Sub-band Acoustic Parameters from Ambisonics Recordings using Spectro-Spatial Covariance Features
by: Meng, Hanyu, et al.
Published: (2024)

DnR-nonverbal: Cinematic Audio Source Separation Dataset Containing Non-Verbal Sounds
by: Hasumi, Takuya, et al.
Published: (2025)

Determined Multichannel Blind Source Separation with Clustered Source Model
by: Wang, Jianyu, et al.
Published: (2024)

Task-Aware Unified Source Separation
by: Saijo, Kohei, et al.
Published: (2024)

Compression of Higher Order Ambisonics with Multichannel RVQGAN
by: Hirvonen, Toni, et al.
Published: (2024)

Brain-Informed Speech Separation for Cochlear Implants
by: Gajecki, Tom, et al.
Published: (2026)

Is MixIT Really Unsuitable for Correlated Sources? Exploring MixIT for Unsupervised Pre-training in Music Source Separation
by: Saijo, Kohei, et al.
Published: (2025)

Pre-training Music Classification Models via Music Source Separation
by: Garoufis, Christos, et al.
Published: (2023)

Input-Adaptive Spectral Feature Compression by Sequence Modeling for Source Separation
by: Saijo, Kohei, et al.
Published: (2026)