Saved in:
Bibliographic Details
Main Authors: Haider, Daniel, Perfler, Felix, Balazs, Peter, Hollomey, Clara, Holighaus, Nicki
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2505.07709
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • This paper introduces ISAC, an invertible and stable, perceptually-motivated filter bank that is specifically designed to be integrated into machine learning paradigms. More precisely, the center frequencies and bandwidths of the filters are chosen to follow a non-linear, auditory frequency scale, the filter kernels have user-defined maximum temporal support and may serve as learnable convolutional kernels, and there exists a corresponding filter bank such that both form a perfect reconstruction pair. ISAC provides a powerful and user-friendly audio front-end suitable for any application, including analysis-synthesis schemes.