Saved in:
| Main Authors: | Zang, Yongyi, Kong, Qiuqiang |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2503.17866 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Training-Free Multi-Step Audio Source Separation
by: Zang, Yongyi, et al.
Published: (2025)
by: Zang, Yongyi, et al.
Published: (2025)
Piano Transcription by Hierarchical Language Modeling with Pretrained Roll-based Encoders
by: Li, Dichucheng, et al.
Published: (2025)
by: Li, Dichucheng, et al.
Published: (2025)
Ambisonizer: Neural Upmixing as Spherical Harmonics Generation
by: Zang, Yongyi, et al.
Published: (2024)
by: Zang, Yongyi, et al.
Published: (2024)
Music Source Restoration
by: Zang, Yongyi, et al.
Published: (2025)
by: Zang, Yongyi, et al.
Published: (2025)
HiFi-HARP: A High-Fidelity 7th-Order Ambisonic Room Impulse Response Dataset
by: Saini, Shivam, et al.
Published: (2025)
by: Saini, Shivam, et al.
Published: (2025)
Velocity Potential Neural Field for Efficient Ambisonics Impulse Response Modeling
by: Masuyama, Yoshiki, et al.
Published: (2026)
by: Masuyama, Yoshiki, et al.
Published: (2026)
Voices of Civilizations: A Multilingual QA Benchmark for Global Music Understanding
by: Wu, Shangda, et al.
Published: (2026)
by: Wu, Shangda, et al.
Published: (2026)
MSRBench: A Benchmarking Dataset for Music Source Restoration
by: Zang, Yongyi, et al.
Published: (2025)
by: Zang, Yongyi, et al.
Published: (2025)
HARP: A Large-Scale Higher-Order Ambisonic Room Impulse Response Dataset
by: Saini, Shivam, et al.
Published: (2024)
by: Saini, Shivam, et al.
Published: (2024)
PromptReverb: Multimodal Room Impulse Response Generation Through Latent Rectified Flow Matching
by: Vosoughi, Ali, et al.
Published: (2025)
by: Vosoughi, Ali, et al.
Published: (2025)
Summary of The Inaugural Music Source Restoration Challenge
by: Zang, Yongyi, et al.
Published: (2026)
by: Zang, Yongyi, et al.
Published: (2026)
Direction-Aware Neural Acoustic Fields for Few-Shot Interpolation of Ambisonic Impulse Responses
by: Ick, Christopher, et al.
Published: (2025)
by: Ick, Christopher, et al.
Published: (2025)
SHroom: A Python Framework for Ambisonics Room Acoustics Simulation and Binaural Rendering
by: Gayer, Yhonatan
Published: (2026)
by: Gayer, Yhonatan
Published: (2026)
FlowSynth: Instrument Generation Through Distributional Flow Matching and Test-Time Search
by: Yang, Qihui, et al.
Published: (2025)
by: Yang, Qihui, et al.
Published: (2025)
Efficient Vocal Source Separation Through Windowed Sink Attention
by: Benetatos, Christodoulos, et al.
Published: (2025)
by: Benetatos, Christodoulos, et al.
Published: (2025)
The Interpretation Gap in Text-to-Music Generation Models
by: Zang, Yongyi, et al.
Published: (2024)
by: Zang, Yongyi, et al.
Published: (2024)
Direct and Residual Subspace Decomposition of Spatial Room Impulse Responses
by: Deppisch, Thomas, et al.
Published: (2022)
by: Deppisch, Thomas, et al.
Published: (2022)
MusicScore: A Dataset for Music Score Modeling and Generation
by: Lin, Yuheng, et al.
Published: (2024)
by: Lin, Yuheng, et al.
Published: (2024)
FOA Tokenizer: Low-bitrate Neural Codec for First Order Ambisonics with Spatial Consistency Loss
by: Sudarsanam, Parthasaarathy, et al.
Published: (2025)
by: Sudarsanam, Parthasaarathy, et al.
Published: (2025)
Accelerated Interactive Auralization of Highly Reverberant Spaces using Graphics Hardware
by: Rosseel, Hannes, et al.
Published: (2025)
by: Rosseel, Hannes, et al.
Published: (2025)
Room Impulse Responses help attackers to evade Deep Fake Detection
by: Luong, Hieu-Thi, et al.
Published: (2024)
by: Luong, Hieu-Thi, et al.
Published: (2024)
AuralNet: Hierarchical Attention-based 3D Binaural Localization of Overlapping Speakers
by: Fu, Linya, et al.
Published: (2025)
by: Fu, Linya, et al.
Published: (2025)
Learning Interpretable Features in Audio Latent Spaces via Sparse Autoencoders
by: Paek, Nathan, et al.
Published: (2025)
by: Paek, Nathan, et al.
Published: (2025)
Residual Learning for Neural Ambisonics Encoders
by: Deppisch, Thomas, et al.
Published: (2026)
by: Deppisch, Thomas, et al.
Published: (2026)
Perceptually Transparent Binaural Auralization of Simulated Sound Fields
by: Ahrens, Jens
Published: (2024)
by: Ahrens, Jens
Published: (2024)
Region-Specific Audio Tagging for Spatial Sound
by: Zhao, Jinzheng, et al.
Published: (2025)
by: Zhao, Jinzheng, et al.
Published: (2025)
Ambisonics Networks -- The Effect Of Radial Functions Regularization
by: Shaybet, Bar, et al.
Published: (2024)
by: Shaybet, Bar, et al.
Published: (2024)
Blind Spatial Impulse Response Generation from Separate Room- and Scene-Specific Information
by: Lluís, Francesc, et al.
Published: (2024)
by: Lluís, Francesc, et al.
Published: (2024)
Room Impulse Response Synthesis via Differentiable Feedback Delay Networks for Efficient Spatial Audio Rendering
by: Gerami, Armin, et al.
Published: (2025)
by: Gerami, Armin, et al.
Published: (2025)
Neural Ambisonics encoding for compact irregular microphone arrays
by: Heikkinen, Mikko, et al.
Published: (2024)
by: Heikkinen, Mikko, et al.
Published: (2024)
FineLAP: Taming Heterogeneous Supervision for Fine-grained Language-Audio Pretraining
by: Li, Xiquan, et al.
Published: (2026)
by: Li, Xiquan, et al.
Published: (2026)
On the Usefulness of Diffusion-Based Room Impulse Response Interpolation to Microphone Array Processing
by: Della Torre, Sagi, et al.
Published: (2026)
by: Della Torre, Sagi, et al.
Published: (2026)
SingFake: Singing Voice Deepfake Detection
by: Zang, Yongyi, et al.
Published: (2023)
by: Zang, Yongyi, et al.
Published: (2023)
Spatial Analysis and Synthesis Methods: Subjective and Objective Evaluations Using Various Microphone Arrays in the Auralization of a Critical Listening Room
by: Pawlak, Alan, et al.
Published: (2024)
by: Pawlak, Alan, et al.
Published: (2024)
Sensitivity of Room Impulse Responses in Changing Acoustic Environment
by: Prawda, Karolina
Published: (2025)
by: Prawda, Karolina
Published: (2025)
Room Impulse Response Generation Conditioned on Acoustic Parameters
by: Arellano, Silvia, et al.
Published: (2025)
by: Arellano, Silvia, et al.
Published: (2025)
Acoustic Volume Rendering for Neural Impulse Response Fields
by: Lan, Zitong, et al.
Published: (2024)
by: Lan, Zitong, et al.
Published: (2024)
Are you really listening? Boosting Perceptual Awareness in Music-QA Benchmarks
by: Zang, Yongyi, et al.
Published: (2025)
by: Zang, Yongyi, et al.
Published: (2025)
DiffAU: Diffusion-Based Ambisonics Upscaling
by: Milstein, Amit, et al.
Published: (2025)
by: Milstein, Amit, et al.
Published: (2025)
Ambisonics Super-Resolution Using A Waveform-Domain Neural Network
by: Nawfal, Ismael, et al.
Published: (2025)
by: Nawfal, Ismael, et al.
Published: (2025)
Similar Items
-
Training-Free Multi-Step Audio Source Separation
by: Zang, Yongyi, et al.
Published: (2025) -
Piano Transcription by Hierarchical Language Modeling with Pretrained Roll-based Encoders
by: Li, Dichucheng, et al.
Published: (2025) -
Ambisonizer: Neural Upmixing as Spherical Harmonics Generation
by: Zang, Yongyi, et al.
Published: (2024) -
Music Source Restoration
by: Zang, Yongyi, et al.
Published: (2025) -
HiFi-HARP: A High-Fidelity 7th-Order Ambisonic Room Impulse Response Dataset
by: Saini, Shivam, et al.
Published: (2025)