Internformat: :: Library Catalog

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Sagasti, Amaia, Scaini, Davide, Arteaga, Daniel
Format:	Preprint
Veröffentlicht:	2024
Schlagworte:	Sound Audio and Speech Processing
Online-Zugang:	https://arxiv.org/abs/2405.04471
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

_version_	1866914857729654784
author	Sagasti, Amaia Scaini, Davide Arteaga, Daniel
author_facet	Sagasti, Amaia Scaini, Davide Arteaga, Daniel
contents	This paper addresses the challenges associated with both the conversion between different spatial audio formats and the decoding of a spatial audio format to a specific loudspeaker layout. Existing approaches often rely on layout remapping tools, which may not guarantee optimal conversion from a psychoacoustic perspective. To overcome these challenges, we present the Universal Spatial Audio Transcoder (USAT) method and its corresponding open source implementation. USAT generates an optimal decoder or transcoder for any input spatial audio format, adapting it to any output format or 2D/3D loudspeaker configuration. Drawing upon optimization techniques based on psychoacoustic principles, the algorithm maximizes the preservation of spatial information. We present examples of the decoding and transcoding of several audio formats, and show that USAT approach is advantageous compared to the most common methods in the field.
format	Preprint
id	arxiv_https___arxiv_org_abs_2405_04471
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Universal Spatial Audio Transcoder Sagasti, Amaia Scaini, Davide Arteaga, Daniel Sound Audio and Speech Processing This paper addresses the challenges associated with both the conversion between different spatial audio formats and the decoding of a spatial audio format to a specific loudspeaker layout. Existing approaches often rely on layout remapping tools, which may not guarantee optimal conversion from a psychoacoustic perspective. To overcome these challenges, we present the Universal Spatial Audio Transcoder (USAT) method and its corresponding open source implementation. USAT generates an optimal decoder or transcoder for any input spatial audio format, adapting it to any output format or 2D/3D loudspeaker configuration. Drawing upon optimization techniques based on psychoacoustic principles, the algorithm maximizes the preservation of spatial information. We present examples of the decoding and transcoding of several audio formats, and show that USAT approach is advantageous compared to the most common methods in the field.
title	Universal Spatial Audio Transcoder
topic	Sound Audio and Speech Processing
url	https://arxiv.org/abs/2405.04471

Ähnliche Einträge