Enregistré dans:
Détails bibliographiques
Auteurs principaux: Fuchs, Guillaume, Ghido, Florin, Weckbecker, Dominik, Thiergart, Oliver
Format: Preprint
Publié: 2025
Sujets:
Accès en ligne:https://arxiv.org/abs/2503.23586
Tags: Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
_version_ 1866910899199016960
author Fuchs, Guillaume
Ghido, Florin
Weckbecker, Dominik
Thiergart, Oliver
author_facet Fuchs, Guillaume
Ghido, Florin
Weckbecker, Dominik
Thiergart, Oliver
contents Directional Audio Coding (DirAC) is a proven method for parametrically representing a 3D audio scene in B-format and is capable of reproducing it on arbitrary loudspeaker layouts. Although such a method seems well suited for low bitrate Ambisonic transmission, little work has been done on the feasibility of building a real system upon it. In this paper, we present a DirAC-based coding for Higher-Order Ambisonics (HOA), developed as part of a standardisation effort to extend the 3GPP EVS codec to immersive communications. Starting from the first-order DirAC model, we show how to reduce algorithmic delay, the bitrate required for the parameters and complexity by bringing the full synthesis in the spherical harmonic domain. The evaluation of the proposed technique for coding 3\textsuperscript{rd} order Ambisonics at bitrates from 32 to 128 kbps shows the relevance of the parametric approach compared with existing solutions.
format Preprint
id arxiv_https___arxiv_org_abs_2503_23586
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle A first-order DirAC-based parametric Ambisonic coder for immersive communications
Fuchs, Guillaume
Ghido, Florin
Weckbecker, Dominik
Thiergart, Oliver
Audio and Speech Processing
Directional Audio Coding (DirAC) is a proven method for parametrically representing a 3D audio scene in B-format and is capable of reproducing it on arbitrary loudspeaker layouts. Although such a method seems well suited for low bitrate Ambisonic transmission, little work has been done on the feasibility of building a real system upon it. In this paper, we present a DirAC-based coding for Higher-Order Ambisonics (HOA), developed as part of a standardisation effort to extend the 3GPP EVS codec to immersive communications. Starting from the first-order DirAC model, we show how to reduce algorithmic delay, the bitrate required for the parameters and complexity by bringing the full synthesis in the spherical harmonic domain. The evaluation of the proposed technique for coding 3\textsuperscript{rd} order Ambisonics at bitrates from 32 to 128 kbps shows the relevance of the parametric approach compared with existing solutions.
title A first-order DirAC-based parametric Ambisonic coder for immersive communications
topic Audio and Speech Processing
url https://arxiv.org/abs/2503.23586