:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zang, Yongyi, Kong, Qiuqiang
Format:	Preprint
Published:	2025
Subjects:	Sound
Online Access:	https://arxiv.org/abs/2503.17866
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Training-Free Multi-Step Audio Source Separation
by: Zang, Yongyi, et al.
Published: (2025)

Piano Transcription by Hierarchical Language Modeling with Pretrained Roll-based Encoders
by: Li, Dichucheng, et al.
Published: (2025)

Ambisonizer: Neural Upmixing as Spherical Harmonics Generation
by: Zang, Yongyi, et al.
Published: (2024)

Music Source Restoration
by: Zang, Yongyi, et al.
Published: (2025)

HiFi-HARP: A High-Fidelity 7th-Order Ambisonic Room Impulse Response Dataset
by: Saini, Shivam, et al.
Published: (2025)

Velocity Potential Neural Field for Efficient Ambisonics Impulse Response Modeling
by: Masuyama, Yoshiki, et al.
Published: (2026)

Voices of Civilizations: A Multilingual QA Benchmark for Global Music Understanding
by: Wu, Shangda, et al.
Published: (2026)

MSRBench: A Benchmarking Dataset for Music Source Restoration
by: Zang, Yongyi, et al.
Published: (2025)

HARP: A Large-Scale Higher-Order Ambisonic Room Impulse Response Dataset
by: Saini, Shivam, et al.
Published: (2024)

PromptReverb: Multimodal Room Impulse Response Generation Through Latent Rectified Flow Matching
by: Vosoughi, Ali, et al.
Published: (2025)

Summary of The Inaugural Music Source Restoration Challenge
by: Zang, Yongyi, et al.
Published: (2026)

Direction-Aware Neural Acoustic Fields for Few-Shot Interpolation of Ambisonic Impulse Responses
by: Ick, Christopher, et al.
Published: (2025)

SHroom: A Python Framework for Ambisonics Room Acoustics Simulation and Binaural Rendering
by: Gayer, Yhonatan
Published: (2026)

FlowSynth: Instrument Generation Through Distributional Flow Matching and Test-Time Search
by: Yang, Qihui, et al.
Published: (2025)

Efficient Vocal Source Separation Through Windowed Sink Attention
by: Benetatos, Christodoulos, et al.
Published: (2025)

The Interpretation Gap in Text-to-Music Generation Models
by: Zang, Yongyi, et al.
Published: (2024)

Direct and Residual Subspace Decomposition of Spatial Room Impulse Responses
by: Deppisch, Thomas, et al.
Published: (2022)

MusicScore: A Dataset for Music Score Modeling and Generation
by: Lin, Yuheng, et al.
Published: (2024)

FOA Tokenizer: Low-bitrate Neural Codec for First Order Ambisonics with Spatial Consistency Loss
by: Sudarsanam, Parthasaarathy, et al.
Published: (2025)

Accelerated Interactive Auralization of Highly Reverberant Spaces using Graphics Hardware
by: Rosseel, Hannes, et al.
Published: (2025)

Room Impulse Responses help attackers to evade Deep Fake Detection
by: Luong, Hieu-Thi, et al.
Published: (2024)

AuralNet: Hierarchical Attention-based 3D Binaural Localization of Overlapping Speakers
by: Fu, Linya, et al.
Published: (2025)

Learning Interpretable Features in Audio Latent Spaces via Sparse Autoencoders
by: Paek, Nathan, et al.
Published: (2025)

Residual Learning for Neural Ambisonics Encoders
by: Deppisch, Thomas, et al.
Published: (2026)

Perceptually Transparent Binaural Auralization of Simulated Sound Fields
by: Ahrens, Jens
Published: (2024)

Region-Specific Audio Tagging for Spatial Sound
by: Zhao, Jinzheng, et al.
Published: (2025)

Ambisonics Networks -- The Effect Of Radial Functions Regularization
by: Shaybet, Bar, et al.
Published: (2024)

Blind Spatial Impulse Response Generation from Separate Room- and Scene-Specific Information
by: Lluís, Francesc, et al.
Published: (2024)

Room Impulse Response Synthesis via Differentiable Feedback Delay Networks for Efficient Spatial Audio Rendering
by: Gerami, Armin, et al.
Published: (2025)

Neural Ambisonics encoding for compact irregular microphone arrays
by: Heikkinen, Mikko, et al.
Published: (2024)

FineLAP: Taming Heterogeneous Supervision for Fine-grained Language-Audio Pretraining
by: Li, Xiquan, et al.
Published: (2026)

On the Usefulness of Diffusion-Based Room Impulse Response Interpolation to Microphone Array Processing
by: Della Torre, Sagi, et al.
Published: (2026)

SingFake: Singing Voice Deepfake Detection
by: Zang, Yongyi, et al.
Published: (2023)

Spatial Analysis and Synthesis Methods: Subjective and Objective Evaluations Using Various Microphone Arrays in the Auralization of a Critical Listening Room
by: Pawlak, Alan, et al.
Published: (2024)

Sensitivity of Room Impulse Responses in Changing Acoustic Environment
by: Prawda, Karolina
Published: (2025)

Room Impulse Response Generation Conditioned on Acoustic Parameters
by: Arellano, Silvia, et al.
Published: (2025)

Acoustic Volume Rendering for Neural Impulse Response Fields
by: Lan, Zitong, et al.
Published: (2024)

Are you really listening? Boosting Perceptual Awareness in Music-QA Benchmarks
by: Zang, Yongyi, et al.
Published: (2025)

DiffAU: Diffusion-Based Ambisonics Upscaling
by: Milstein, Amit, et al.
Published: (2025)

Ambisonics Super-Resolution Using A Waveform-Domain Neural Network
by: Nawfal, Ismael, et al.
Published: (2025)