Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Sun, Haoran, Fourer, Dominique, Maaref, Hichem
Format:	Preprint
Published:	2024
Subjects:	Sound Artificial Intelligence Audio and Speech Processing
Online Access:	https://arxiv.org/abs/2411.04337
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866908528251240448
author	Sun, Haoran Fourer, Dominique Maaref, Hichem
author_facet	Sun, Haoran Fourer, Dominique Maaref, Hichem
contents	Dynamic Range Compression (DRC) is a widely used audio effect that adjusts signal dynamics for applications in music production, broadcasting, and speech processing. Inverting DRC is of broad importance for restoring the original dynamics, enabling remixing, and enhancing the overall audio quality. Existing DRC inversion methods either overlook key parameters or rely on precise parameter values, which can be challenging to estimate accurately. To address this limitation, we introduce a hybrid approach that combines model-based DRC inversion with neural networks to achieve robust DRC parameter estimation and audio restoration simultaneously. Our method uses tailored neural network architectures (classification and regression), which are then integrated into a model-based inversion framework to reconstruct the original signal. Experimental evaluations on various music and speech datasets confirm the effectiveness and robustness of our approach, outperforming several state-of-the-art techniques.
format	Preprint
id	arxiv_https___arxiv_org_abs_2411_04337
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Neural-Enhanced Dynamic Range Compression Inversion: A Hybrid Approach for Restoring Audio Dynamics Sun, Haoran Fourer, Dominique Maaref, Hichem Sound Artificial Intelligence Audio and Speech Processing Dynamic Range Compression (DRC) is a widely used audio effect that adjusts signal dynamics for applications in music production, broadcasting, and speech processing. Inverting DRC is of broad importance for restoring the original dynamics, enabling remixing, and enhancing the overall audio quality. Existing DRC inversion methods either overlook key parameters or rely on precise parameter values, which can be challenging to estimate accurately. To address this limitation, we introduce a hybrid approach that combines model-based DRC inversion with neural networks to achieve robust DRC parameter estimation and audio restoration simultaneously. Our method uses tailored neural network architectures (classification and regression), which are then integrated into a model-based inversion framework to reconstruct the original signal. Experimental evaluations on various music and speech datasets confirm the effectiveness and robustness of our approach, outperforming several state-of-the-art techniques.
title	Neural-Enhanced Dynamic Range Compression Inversion: A Hybrid Approach for Restoring Audio Dynamics
topic	Sound Artificial Intelligence Audio and Speech Processing
url	https://arxiv.org/abs/2411.04337

Similar Items