Saved in:
| Main Authors: | Guzik, Mateusz, Kowalczyk, Konrad |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2501.10305 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Clustering-based hard negative sampling for supervised contrastive speaker verification
by: Masztalski, Piotr, et al.
Published: (2025)
by: Masztalski, Piotr, et al.
Published: (2025)
Deep learning based spatial aliasing reduction in beamforming for audio capture
by: Guzik, Mateusz, et al.
Published: (2025)
by: Guzik, Mateusz, et al.
Published: (2025)
HeightCeleb - an enrichment of VoxCeleb dataset with speaker height information
by: Kacprzak, Stanisław, et al.
Published: (2024)
by: Kacprzak, Stanisław, et al.
Published: (2024)
Investigation of Whisper ASR Hallucinations Induced by Non-Speech Audio
by: Barański, Mateusz, et al.
Published: (2025)
by: Barański, Mateusz, et al.
Published: (2025)
Perceptually-motivated Spatial Audio Codec for Higher-Order Ambisonics Compression
by: Hold, Christoph, et al.
Published: (2024)
by: Hold, Christoph, et al.
Published: (2024)
Evaluation of Spherical Wavelet Framework in Comparsion with Ambisonics
by: Ekmen, Ş., et al.
Published: (2025)
by: Ekmen, Ş., et al.
Published: (2025)
Introduction to Ambisonics, Part 1: The Part With No Math
by: Ahrens, Jens
Published: (2025)
by: Ahrens, Jens
Published: (2025)
Residual Learning for Neural Ambisonics Encoders
by: Deppisch, Thomas, et al.
Published: (2026)
by: Deppisch, Thomas, et al.
Published: (2026)
Spatial-Temporal Activity-Informed Diarization and Separation
by: Hsu, Yicheng, et al.
Published: (2024)
by: Hsu, Yicheng, et al.
Published: (2024)
Ambisonizer: Neural Upmixing as Spherical Harmonics Generation
by: Zang, Yongyi, et al.
Published: (2024)
by: Zang, Yongyi, et al.
Published: (2024)
Ambisonics Networks -- The Effect Of Radial Functions Regularization
by: Shaybet, Bar, et al.
Published: (2024)
by: Shaybet, Bar, et al.
Published: (2024)
Ambisonics Encoder for Wearable Array with Improved Binaural Reproduction
by: Gayer, Yhonatan, et al.
Published: (2025)
by: Gayer, Yhonatan, et al.
Published: (2025)
Neural Ambisonics encoding for compact irregular microphone arrays
by: Heikkinen, Mikko, et al.
Published: (2024)
by: Heikkinen, Mikko, et al.
Published: (2024)
DiffAU: Diffusion-Based Ambisonics Upscaling
by: Milstein, Amit, et al.
Published: (2025)
by: Milstein, Amit, et al.
Published: (2025)
Dynamic Real-Time Ambisonics Order Adaptation for Immersive Networked Music Performances
by: Ostan, Paolo, et al.
Published: (2025)
by: Ostan, Paolo, et al.
Published: (2025)
Perceptual implications of simplifying geometrical acoustics models for Ambisonics-based binaural reverberation
by: Martin, Vincent, et al.
Published: (2024)
by: Martin, Vincent, et al.
Published: (2024)
Ambisonics Encoding For Arbitrary Microphone Arrays Incorporating Residual Channels For Binaural Reproduction
by: Gayer, Yhonatan, et al.
Published: (2024)
by: Gayer, Yhonatan, et al.
Published: (2024)
Ambisonics Binaural Rendering via Masked Magnitude Least Squares
by: Berebi, Or, et al.
Published: (2025)
by: Berebi, Or, et al.
Published: (2025)
A first-order DirAC-based parametric Ambisonic coder for immersive communications
by: Fuchs, Guillaume, et al.
Published: (2025)
by: Fuchs, Guillaume, et al.
Published: (2025)
Neural Ambisonic Encoding For Multi-Speaker Scenarios Using A Circular Microphone Array
by: Qiao, Yue, et al.
Published: (2024)
by: Qiao, Yue, et al.
Published: (2024)
Perceptual Compensation of Ambisonics Recordings for Reproduction in Room
by: Fallah, Ali, et al.
Published: (2025)
by: Fallah, Ali, et al.
Published: (2025)
AmbiDrop: Array-Agnostic Speech Enhancement Using Ambisonics Encoding and Dropout-Based Learning
by: Tatarjitzky, Michael, et al.
Published: (2025)
by: Tatarjitzky, Michael, et al.
Published: (2025)
Beyond Omnidirectional: Neural Ambisonics Encoding for Arbitrary Microphone Directivity Patterns using Cross-Attention
by: Heikkinen, Mikko, et al.
Published: (2026)
by: Heikkinen, Mikko, et al.
Published: (2026)
Ambisonics Super-Resolution Using A Waveform-Domain Neural Network
by: Nawfal, Ismael, et al.
Published: (2025)
by: Nawfal, Ismael, et al.
Published: (2025)
SHroom: A Python Framework for Ambisonics Room Acoustics Simulation and Binaural Rendering
by: Gayer, Yhonatan
Published: (2026)
by: Gayer, Yhonatan
Published: (2026)
Array-Aware Ambisonics and HRTF Encoding for Binaural Reproduction With Wearable Arrays
by: Gayer, Yhonatan, et al.
Published: (2025)
by: Gayer, Yhonatan, et al.
Published: (2025)
Velocity Potential Neural Field for Efficient Ambisonics Impulse Response Modeling
by: Masuyama, Yoshiki, et al.
Published: (2026)
by: Masuyama, Yoshiki, et al.
Published: (2026)
Do Music Source Separation Models Preserve Spatial Information in Binaural Audio?
by: Namballa, Richa, et al.
Published: (2025)
by: Namballa, Richa, et al.
Published: (2025)
Fast Multichannel NMF with Block-Diagonal Spatial Covariance Matrices for Efficient Blind Source Separation Using Distributed Microphone Arrays
by: Nishikori, Hirotaka, et al.
Published: (2026)
by: Nishikori, Hirotaka, et al.
Published: (2026)
Source Separation by Flow Matching
by: Scheibler, Robin, et al.
Published: (2025)
by: Scheibler, Robin, et al.
Published: (2025)
SCNet: Sparse Compression Network for Music Source Separation
by: Tong, Weinan, et al.
Published: (2024)
by: Tong, Weinan, et al.
Published: (2024)
Blind Estimation of Sub-band Acoustic Parameters from Ambisonics Recordings using Spectro-Spatial Covariance Features
by: Meng, Hanyu, et al.
Published: (2024)
by: Meng, Hanyu, et al.
Published: (2024)
DnR-nonverbal: Cinematic Audio Source Separation Dataset Containing Non-Verbal Sounds
by: Hasumi, Takuya, et al.
Published: (2025)
by: Hasumi, Takuya, et al.
Published: (2025)
Determined Multichannel Blind Source Separation with Clustered Source Model
by: Wang, Jianyu, et al.
Published: (2024)
by: Wang, Jianyu, et al.
Published: (2024)
Task-Aware Unified Source Separation
by: Saijo, Kohei, et al.
Published: (2024)
by: Saijo, Kohei, et al.
Published: (2024)
Compression of Higher Order Ambisonics with Multichannel RVQGAN
by: Hirvonen, Toni, et al.
Published: (2024)
by: Hirvonen, Toni, et al.
Published: (2024)
Brain-Informed Speech Separation for Cochlear Implants
by: Gajecki, Tom, et al.
Published: (2026)
by: Gajecki, Tom, et al.
Published: (2026)
Is MixIT Really Unsuitable for Correlated Sources? Exploring MixIT for Unsupervised Pre-training in Music Source Separation
by: Saijo, Kohei, et al.
Published: (2025)
by: Saijo, Kohei, et al.
Published: (2025)
Pre-training Music Classification Models via Music Source Separation
by: Garoufis, Christos, et al.
Published: (2023)
by: Garoufis, Christos, et al.
Published: (2023)
Input-Adaptive Spectral Feature Compression by Sequence Modeling for Source Separation
by: Saijo, Kohei, et al.
Published: (2026)
by: Saijo, Kohei, et al.
Published: (2026)
Similar Items
-
Clustering-based hard negative sampling for supervised contrastive speaker verification
by: Masztalski, Piotr, et al.
Published: (2025) -
Deep learning based spatial aliasing reduction in beamforming for audio capture
by: Guzik, Mateusz, et al.
Published: (2025) -
HeightCeleb - an enrichment of VoxCeleb dataset with speaker height information
by: Kacprzak, Stanisław, et al.
Published: (2024) -
Investigation of Whisper ASR Hallucinations Induced by Non-Speech Audio
by: Barański, Mateusz, et al.
Published: (2025) -
Perceptually-motivated Spatial Audio Codec for Higher-Order Ambisonics Compression
by: Hold, Christoph, et al.
Published: (2024)