Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Sagasti, Amaia, Scaini, Davide, Arteaga, Daniel
Format: Preprint
Veröffentlicht: 2024
Schlagworte:
Online-Zugang:https://arxiv.org/abs/2405.04471
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
_version_ 1866914857729654784
author Sagasti, Amaia
Scaini, Davide
Arteaga, Daniel
author_facet Sagasti, Amaia
Scaini, Davide
Arteaga, Daniel
contents This paper addresses the challenges associated with both the conversion between different spatial audio formats and the decoding of a spatial audio format to a specific loudspeaker layout. Existing approaches often rely on layout remapping tools, which may not guarantee optimal conversion from a psychoacoustic perspective. To overcome these challenges, we present the Universal Spatial Audio Transcoder (USAT) method and its corresponding open source implementation. USAT generates an optimal decoder or transcoder for any input spatial audio format, adapting it to any output format or 2D/3D loudspeaker configuration. Drawing upon optimization techniques based on psychoacoustic principles, the algorithm maximizes the preservation of spatial information. We present examples of the decoding and transcoding of several audio formats, and show that USAT approach is advantageous compared to the most common methods in the field.
format Preprint
id arxiv_https___arxiv_org_abs_2405_04471
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Universal Spatial Audio Transcoder
Sagasti, Amaia
Scaini, Davide
Arteaga, Daniel
Sound
Audio and Speech Processing
This paper addresses the challenges associated with both the conversion between different spatial audio formats and the decoding of a spatial audio format to a specific loudspeaker layout. Existing approaches often rely on layout remapping tools, which may not guarantee optimal conversion from a psychoacoustic perspective. To overcome these challenges, we present the Universal Spatial Audio Transcoder (USAT) method and its corresponding open source implementation. USAT generates an optimal decoder or transcoder for any input spatial audio format, adapting it to any output format or 2D/3D loudspeaker configuration. Drawing upon optimization techniques based on psychoacoustic principles, the algorithm maximizes the preservation of spatial information. We present examples of the decoding and transcoding of several audio formats, and show that USAT approach is advantageous compared to the most common methods in the field.
title Universal Spatial Audio Transcoder
topic Sound
Audio and Speech Processing
url https://arxiv.org/abs/2405.04471