Saved in:
| Main Authors: | Qiao, Yue, Kothapally, Vinay, Yu, Meng, Yu, Dong |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2409.06954 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Gen-A: Generalizing Ambisonics Neural Encoding to Unseen Microphone Arrays
by: Heikkinen, Mikko, et al.
Published: (2025)
by: Heikkinen, Mikko, et al.
Published: (2025)
Ambisonics Encoding For Arbitrary Microphone Arrays Incorporating Residual Channels For Binaural Reproduction
by: Gayer, Yhonatan, et al.
Published: (2024)
by: Gayer, Yhonatan, et al.
Published: (2024)
Beyond Omnidirectional: Neural Ambisonics Encoding for Arbitrary Microphone Directivity Patterns using Cross-Attention
by: Heikkinen, Mikko, et al.
Published: (2026)
by: Heikkinen, Mikko, et al.
Published: (2026)
SpatialCodec: Neural Spatial Speech Coding
by: Xu, Zhongweiyang, et al.
Published: (2023)
by: Xu, Zhongweiyang, et al.
Published: (2023)
Neural Directed Speech Enhancement with Dual Microphone Array in High Noise Scenario
by: Wen, Wen, et al.
Published: (2024)
by: Wen, Wen, et al.
Published: (2024)
LLM-Enhanced Dialogue Management for Full-Duplex Spoken Dialogue Systems
by: Zhang, Hao, et al.
Published: (2025)
by: Zhang, Hao, et al.
Published: (2025)
Array-Aware Ambisonics and HRTF Encoding for Binaural Reproduction With Wearable Arrays
by: Gayer, Yhonatan, et al.
Published: (2025)
by: Gayer, Yhonatan, et al.
Published: (2025)
AmbiDrop: Array-Agnostic Speech Enhancement Using Ambisonics Encoding and Dropout-Based Learning
by: Tatarjitzky, Michael, et al.
Published: (2025)
by: Tatarjitzky, Michael, et al.
Published: (2025)
Neural Directional Filtering Using a Compact Microphone Array
by: Huang, Weilong, et al.
Published: (2025)
by: Huang, Weilong, et al.
Published: (2025)
Multi-Channel Multi-Speaker ASR Using Target Speaker's Solo Segment
by: Shao, Yiwen, et al.
Published: (2024)
by: Shao, Yiwen, et al.
Published: (2024)
SpatialEmb: Extract and Encode Spatial Information for 1-Stage Multi-channel Multi-speaker ASR on Arbitrary Microphone Arrays
by: Shao, Yiwen, et al.
Published: (2026)
by: Shao, Yiwen, et al.
Published: (2026)
GAN-Based Multi-Microphone Spatial Target Speaker Extraction
by: Shetu, Shrishti Saha, et al.
Published: (2025)
by: Shetu, Shrishti Saha, et al.
Published: (2025)
Target Speaker Selection for Neural Network Beamforming in Multi-Speaker Scenarios
by: Fiorio, Luan Vinícius, et al.
Published: (2025)
by: Fiorio, Luan Vinícius, et al.
Published: (2025)
Microphone Occlusion Mitigation for Own-Voice Enhancement in Head-Worn Microphone Arrays Using Switching-Adaptive Beamforming
by: Middelberg, Wiebke, et al.
Published: (2025)
by: Middelberg, Wiebke, et al.
Published: (2025)
Ambisonics Encoder for Wearable Array with Improved Binaural Reproduction
by: Gayer, Yhonatan, et al.
Published: (2025)
by: Gayer, Yhonatan, et al.
Published: (2025)
What Does the Speaker Embedding Encode?
by: Wang, Shuai, et al.
Published: (2025)
by: Wang, Shuai, et al.
Published: (2025)
Your Microphone Array Retains Your Identity: A Robust Voice Liveness Detection System for Smart Speakers
by: Meng, Yan, et al.
Published: (2025)
by: Meng, Yan, et al.
Published: (2025)
Ambisonics Super-Resolution Using A Waveform-Domain Neural Network
by: Nawfal, Ismael, et al.
Published: (2025)
by: Nawfal, Ismael, et al.
Published: (2025)
Deep Learning Based Stage-wise Two-dimensional Speaker Localization with Large Ad-hoc Microphone Arrays
by: Liu, Shupei, et al.
Published: (2022)
by: Liu, Shupei, et al.
Published: (2022)
Residual Learning for Neural Ambisonics Encoders
by: Deppisch, Thomas, et al.
Published: (2026)
by: Deppisch, Thomas, et al.
Published: (2026)
Advances in Microphone Array Processing and Multichannel Speech Enhancement
by: Huang, Gongping, et al.
Published: (2025)
by: Huang, Gongping, et al.
Published: (2025)
SonicBoom: Contact Localization Using Array of Microphones
by: Lee, Moonyoung, et al.
Published: (2024)
by: Lee, Moonyoung, et al.
Published: (2024)
Ambisonizer: Neural Upmixing as Spherical Harmonics Generation
by: Zang, Yongyi, et al.
Published: (2024)
by: Zang, Yongyi, et al.
Published: (2024)
Linearly Constrained Deep Beamformer for Multi-Speaker Scenarios
by: Zaidel, Ilai, et al.
Published: (2026)
by: Zaidel, Ilai, et al.
Published: (2026)
Neural Directional Filtering: Far-Field Directivity Control With a Small Microphone Array
by: Wechsler, Julian, et al.
Published: (2024)
by: Wechsler, Julian, et al.
Published: (2024)
Hierarchical Sparse Sound Field Reconstruction with Spherical and Linear Microphone Arrays
by: Xu, Shunxi, et al.
Published: (2025)
by: Xu, Shunxi, et al.
Published: (2025)
Neural Ambisonics encoding for compact irregular microphone arrays
by: Heikkinen, Mikko, et al.
Published: (2024)
by: Heikkinen, Mikko, et al.
Published: (2024)
RIR-SF: Room Impulse Response Based Spatial Feature for Target Speech Recognition in Multi-Channel Multi-Speaker Scenarios
by: Shao, Yiwen, et al.
Published: (2023)
by: Shao, Yiwen, et al.
Published: (2023)
VM-UNSSOR: Unsupervised Neural Speech Separation Enhanced by Higher-SNR Virtual Microphone Arrays
by: He, Shulin, et al.
Published: (2025)
by: He, Shulin, et al.
Published: (2025)
Single-Microphone Speaker Separation and Voice Activity Detection in Noisy and Reverberant Environments
by: Opochinsky, Renana, et al.
Published: (2024)
by: Opochinsky, Renana, et al.
Published: (2024)
Applying Automatic Differentiation to Optimize Differential Microphone Array Designs
by: Galougah, Siminfar Samakoush, et al.
Published: (2024)
by: Galougah, Siminfar Samakoush, et al.
Published: (2024)
Asynchronous Microphone Array Calibration using Hybrid TDOA Information
by: Zhang, Chengjie, et al.
Published: (2024)
by: Zhang, Chengjie, et al.
Published: (2024)
Blind Localization of Early Room Reflections with Arbitrary Microphone Array
by: Hadadi, Yogev, et al.
Published: (2024)
by: Hadadi, Yogev, et al.
Published: (2024)
Impact of Microphone Array Mismatches to Learning-based Replay Speech Detection
by: Neri, Michael, et al.
Published: (2025)
by: Neri, Michael, et al.
Published: (2025)
Direction of Arrival Estimation Using Microphone Array Processing for Moving Humanoid Robots
by: Tourbabin, Vladimir, et al.
Published: (2024)
by: Tourbabin, Vladimir, et al.
Published: (2024)
Design and Analysis of Binaural Signal Matching with Arbitrary Microphone Arrays and Listener Head Rotations
by: Madmoni, Lior, et al.
Published: (2024)
by: Madmoni, Lior, et al.
Published: (2024)
Evaluation of Spherical Wavelet Framework in Comparsion with Ambisonics
by: Ekmen, Ş., et al.
Published: (2025)
by: Ekmen, Ş., et al.
Published: (2025)
Introduction to Ambisonics, Part 1: The Part With No Math
by: Ahrens, Jens
Published: (2025)
by: Ahrens, Jens
Published: (2025)
Microphone Array Signal Processing and Deep Learning for Speech Enhancement
by: Haeb-Umbach, Reinhold, et al.
Published: (2025)
by: Haeb-Umbach, Reinhold, et al.
Published: (2025)
HyBeam: Hybrid Microphone-Beamforming Array-Agnostic Speech Enhancement for Wearables
by: Ilan, Yuval Bar, et al.
Published: (2025)
by: Ilan, Yuval Bar, et al.
Published: (2025)
Similar Items
-
Gen-A: Generalizing Ambisonics Neural Encoding to Unseen Microphone Arrays
by: Heikkinen, Mikko, et al.
Published: (2025) -
Ambisonics Encoding For Arbitrary Microphone Arrays Incorporating Residual Channels For Binaural Reproduction
by: Gayer, Yhonatan, et al.
Published: (2024) -
Beyond Omnidirectional: Neural Ambisonics Encoding for Arbitrary Microphone Directivity Patterns using Cross-Attention
by: Heikkinen, Mikko, et al.
Published: (2026) -
SpatialCodec: Neural Spatial Speech Coding
by: Xu, Zhongweiyang, et al.
Published: (2023) -
Neural Directed Speech Enhancement with Dual Microphone Array in High Noise Scenario
by: Wen, Wen, et al.
Published: (2024)