Saved in:
Bibliographic Details
Main Authors: Sun, Haoran, Fourer, Dominique, Maaref, Hichem
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2411.04337
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866908528251240448
author Sun, Haoran
Fourer, Dominique
Maaref, Hichem
author_facet Sun, Haoran
Fourer, Dominique
Maaref, Hichem
contents Dynamic Range Compression (DRC) is a widely used audio effect that adjusts signal dynamics for applications in music production, broadcasting, and speech processing. Inverting DRC is of broad importance for restoring the original dynamics, enabling remixing, and enhancing the overall audio quality. Existing DRC inversion methods either overlook key parameters or rely on precise parameter values, which can be challenging to estimate accurately. To address this limitation, we introduce a hybrid approach that combines model-based DRC inversion with neural networks to achieve robust DRC parameter estimation and audio restoration simultaneously. Our method uses tailored neural network architectures (classification and regression), which are then integrated into a model-based inversion framework to reconstruct the original signal. Experimental evaluations on various music and speech datasets confirm the effectiveness and robustness of our approach, outperforming several state-of-the-art techniques.
format Preprint
id arxiv_https___arxiv_org_abs_2411_04337
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Neural-Enhanced Dynamic Range Compression Inversion: A Hybrid Approach for Restoring Audio Dynamics
Sun, Haoran
Fourer, Dominique
Maaref, Hichem
Sound
Artificial Intelligence
Audio and Speech Processing
Dynamic Range Compression (DRC) is a widely used audio effect that adjusts signal dynamics for applications in music production, broadcasting, and speech processing. Inverting DRC is of broad importance for restoring the original dynamics, enabling remixing, and enhancing the overall audio quality. Existing DRC inversion methods either overlook key parameters or rely on precise parameter values, which can be challenging to estimate accurately. To address this limitation, we introduce a hybrid approach that combines model-based DRC inversion with neural networks to achieve robust DRC parameter estimation and audio restoration simultaneously. Our method uses tailored neural network architectures (classification and regression), which are then integrated into a model-based inversion framework to reconstruct the original signal. Experimental evaluations on various music and speech datasets confirm the effectiveness and robustness of our approach, outperforming several state-of-the-art techniques.
title Neural-Enhanced Dynamic Range Compression Inversion: A Hybrid Approach for Restoring Audio Dynamics
topic Sound
Artificial Intelligence
Audio and Speech Processing
url https://arxiv.org/abs/2411.04337