Saved in:
Bibliographic Details
Main Authors: Ostan, Paolo, Centofanti, Carlo, Pezzoli, Mirco, Bernardini, Alberto, Rinaldi, Claudia, Antonacci, Fabio
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2508.00509
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909716205010944
author Ostan, Paolo
Centofanti, Carlo
Pezzoli, Mirco
Bernardini, Alberto
Rinaldi, Claudia
Antonacci, Fabio
author_facet Ostan, Paolo
Centofanti, Carlo
Pezzoli, Mirco
Bernardini, Alberto
Rinaldi, Claudia
Antonacci, Fabio
contents Advanced remote applications such as Networked Music Performance (NMP) require solutions to guarantee immersive real-world-like interaction among users. Therefore, the adoption of spatial audio formats, such as Ambisonics, is fundamental to let the user experience an immersive acoustic scene. The accuracy of the sound scene reproduction increases with the order of the Ambisonics enconding, resulting in an improved immersivity at the cost of a greater number of audio channels, which in turn escalates both bandwidth requirements and susceptibility to network impairments (e.g., latency, jitter, and packet loss). These factors pose a significant challenge for interactive music sessions, which demand high spatial fidelity and low end-to-end delay. We propose a real-time adaptive higher-order Ambisonics strategy that continuously monitors network throughput and dynamically scales the Ambisonics order. When available bandwidth drops below a preset threshold, the order is lowered to prevent audio dropouts; it then reverts to higher orders once conditions recover, thus balancing immersion and reliability. A MUSHRA-based evaluation indicates that this adaptive approach is promising to guarantee user experience in bandwidth-limited NMP scenarios.
format Preprint
id arxiv_https___arxiv_org_abs_2508_00509
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Dynamic Real-Time Ambisonics Order Adaptation for Immersive Networked Music Performances
Ostan, Paolo
Centofanti, Carlo
Pezzoli, Mirco
Bernardini, Alberto
Rinaldi, Claudia
Antonacci, Fabio
Audio and Speech Processing
Advanced remote applications such as Networked Music Performance (NMP) require solutions to guarantee immersive real-world-like interaction among users. Therefore, the adoption of spatial audio formats, such as Ambisonics, is fundamental to let the user experience an immersive acoustic scene. The accuracy of the sound scene reproduction increases with the order of the Ambisonics enconding, resulting in an improved immersivity at the cost of a greater number of audio channels, which in turn escalates both bandwidth requirements and susceptibility to network impairments (e.g., latency, jitter, and packet loss). These factors pose a significant challenge for interactive music sessions, which demand high spatial fidelity and low end-to-end delay. We propose a real-time adaptive higher-order Ambisonics strategy that continuously monitors network throughput and dynamically scales the Ambisonics order. When available bandwidth drops below a preset threshold, the order is lowered to prevent audio dropouts; it then reverts to higher orders once conditions recover, thus balancing immersion and reliability. A MUSHRA-based evaluation indicates that this adaptive approach is promising to guarantee user experience in bandwidth-limited NMP scenarios.
title Dynamic Real-Time Ambisonics Order Adaptation for Immersive Networked Music Performances
topic Audio and Speech Processing
url https://arxiv.org/abs/2508.00509