Saved in:
Bibliographic Details
Main Authors: Lan, Zitong, Zheng, Chenhao, Zheng, Zhiwei, Zhao, Mingmin
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2411.06307
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866916476323102720
author Lan, Zitong
Zheng, Chenhao
Zheng, Zhiwei
Zhao, Mingmin
author_facet Lan, Zitong
Zheng, Chenhao
Zheng, Zhiwei
Zhao, Mingmin
contents Realistic audio synthesis that captures accurate acoustic phenomena is essential for creating immersive experiences in virtual and augmented reality. Synthesizing the sound received at any position relies on the estimation of impulse response (IR), which characterizes how sound propagates in one scene along different paths before arriving at the listener's position. In this paper, we present Acoustic Volume Rendering (AVR), a novel approach that adapts volume rendering techniques to model acoustic impulse responses. While volume rendering has been successful in modeling radiance fields for images and neural scene representations, IRs present unique challenges as time-series signals. To address these challenges, we introduce frequency-domain volume rendering and use spherical integration to fit the IR measurements. Our method constructs an impulse response field that inherently encodes wave propagation principles and achieves state-of-the-art performance in synthesizing impulse responses for novel poses. Experiments show that AVR surpasses current leading methods by a substantial margin. Additionally, we develop an acoustic simulation platform, AcoustiX, which provides more accurate and realistic IR simulations than existing simulators. Code for AVR and AcoustiX are available at https://zitonglan.github.io/avr.
format Preprint
id arxiv_https___arxiv_org_abs_2411_06307
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Acoustic Volume Rendering for Neural Impulse Response Fields
Lan, Zitong
Zheng, Chenhao
Zheng, Zhiwei
Zhao, Mingmin
Sound
Audio and Speech Processing
Realistic audio synthesis that captures accurate acoustic phenomena is essential for creating immersive experiences in virtual and augmented reality. Synthesizing the sound received at any position relies on the estimation of impulse response (IR), which characterizes how sound propagates in one scene along different paths before arriving at the listener's position. In this paper, we present Acoustic Volume Rendering (AVR), a novel approach that adapts volume rendering techniques to model acoustic impulse responses. While volume rendering has been successful in modeling radiance fields for images and neural scene representations, IRs present unique challenges as time-series signals. To address these challenges, we introduce frequency-domain volume rendering and use spherical integration to fit the IR measurements. Our method constructs an impulse response field that inherently encodes wave propagation principles and achieves state-of-the-art performance in synthesizing impulse responses for novel poses. Experiments show that AVR surpasses current leading methods by a substantial margin. Additionally, we develop an acoustic simulation platform, AcoustiX, which provides more accurate and realistic IR simulations than existing simulators. Code for AVR and AcoustiX are available at https://zitonglan.github.io/avr.
title Acoustic Volume Rendering for Neural Impulse Response Fields
topic Sound
Audio and Speech Processing
url https://arxiv.org/abs/2411.06307