:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Cheddad, Zohra Adila, Cheddad, Abbas
Format:	Preprint
Published:	2021
Subjects:	Audio and Speech Processing Machine Learning Sound
Online Access:	https://arxiv.org/abs/2111.10891
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Diffusion Models for Audio Restoration
by: Lemercier, Jean-Marie, et al.
Published: (2024)

Learning to Upsample and Upmix Audio in the Latent Domain
by: Bralios, Dimitrios, et al.
Published: (2025)

Music2Latent: Consistency Autoencoders for Latent Audio Compression
by: Pasini, Marco, et al.
Published: (2024)

Fast Timing-Conditioned Latent Audio Diffusion
by: Evans, Zach, et al.
Published: (2024)

The Rarity of Musical Audio Signals Within the Space of Possible Audio Generation
by: Collins, Nick
Published: (2024)

Re-Bottleneck: Latent Re-Structuring for Neural Audio Autoencoders
by: Bralios, Dimitrios, et al.
Published: (2025)

Voice Signal Processing for Machine Learning. The Case of Speaker Isolation
by: Ganchev, Radan
Published: (2024)

LiLAC: A Lightweight Latent ControlNet for Musical Audio Generation
by: Baker, Tom, et al.
Published: (2025)

Audio-based Anomaly Detection in Industrial Machines Using Deep One-Class Support Vector Data Description
by: Kilickaya, Sertac, et al.
Published: (2024)

Latent Granular Resynthesis using Neural Audio Codecs
by: Tokui, Nao, et al.
Published: (2025)

Synthesizer Sound Matching Using Audio Spectrogram Transformers
by: Bruford, Fred, et al.
Published: (2024)

Aggregation Strategies for Efficient Annotation of Bioacoustic Sound Events Using Active Learning
by: Lindholm, Richard, et al.
Published: (2025)

Detecting Throat Cancer from Speech Signals using Machine Learning: A Scoping Literature Review
by: Paterson, Mary, et al.
Published: (2023)

Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities
by: Kong, Zhifeng, et al.
Published: (2024)

Learning Spatially-Aware Language and Audio Embeddings
by: Devnani, Bhavika, et al.
Published: (2024)

Learning Music Audio Representations With Limited Data
by: Plachouras, Christos, et al.
Published: (2025)

Contrastive Learning from Synthetic Audio Doppelgängers
by: Cherep, Manuel, et al.
Published: (2024)

Abnormal Respiratory Sound Identification Using Audio-Spectrogram Vision Transformer
by: Ariyanti, Whenty, et al.
Published: (2024)

COVID-19 Detection System: A Comparative Analysis of System Performance Based on Acoustic Features of Cough Audio Signals
by: Shati, Asmaa, et al.
Published: (2023)

Learning Disentangled Audio Representations through Controlled Synthesis
by: Brima, Yusuf, et al.
Published: (2024)

Exploring Meta Information for Audio-based Zero-shot Bird Classification
by: Gebhard, Alexander, et al.
Published: (2023)

High-Resolution Speech Restoration with Latent Diffusion Model
by: Dhyani, Tushar, et al.
Published: (2024)

Zero-Shot Unsupervised and Text-Based Audio Editing Using DDPM Inversion
by: Manor, Hila, et al.
Published: (2024)

A Data-Centric Framework for Machine Listening Projects: Addressing Large-Scale Data Acquisition and Labeling through Active Learning
by: Naranjo-Alcazar, Javier, et al.
Published: (2024)

A2SB: Audio-to-Audio Schrodinger Bridges
by: Kong, Zhifeng, et al.
Published: (2025)

Sound Source Separation Using Latent Variational Block-Wise Disentanglement
by: Helwani, Karim, et al.
Published: (2024)

COCOLA: Coherence-Oriented Contrastive Learning of Musical Audio Representations
by: Ciranni, Ruben, et al.
Published: (2024)

CAK: Emergent Audio Effects from Minimal Deep Learning
by: Rockman, Austin
Published: (2025)

TACOS: Temporally-aligned Audio CaptiOnS for Language-Audio Pretraining
by: Primus, Paul, et al.
Published: (2025)

Zero Shot Audio to Audio Emotion Transfer With Speaker Disentanglement
by: Dutta, Soumya, et al.
Published: (2024)

Text-Independent Speaker Identification Using Audio Looping With Margin Based Loss Functions
by: Garcia, Elliot Q C, et al.
Published: (2025)

Generating Sample-Based Musical Instruments Using Neural Audio Codec Language Models
by: Nercessian, Shahan, et al.
Published: (2024)

Estimated Audio-Caption Correspondences Improve Language-Based Audio Retrieval
by: Primus, Paul, et al.
Published: (2024)

Fusing Audio and Metadata Embeddings Improves Language-based Audio Retrieval
by: Primus, Paul, et al.
Published: (2024)

On Class Separability Pitfalls In Audio-Text Contrastive Zero-Shot Learning
by: Tavares, Tiago, et al.
Published: (2024)

Meta-Learning in Audio and Speech Processing: An End to End Comprehensive Review
by: Raimon, Athul, et al.
Published: (2024)

Comparative Analysis of Mel-Frequency Cepstral Coefficients and Wavelet Based Audio Signal Processing for Emotion Detection and Mental Health Assessment in Spoken Speech
by: Agbo, Idoko, et al.
Published: (2024)

Parametric Neural Amp Modeling with Active Learning
by: Grötschla, Florian, et al.
Published: (2025)

Audio Match Cutting: Finding and Creating Matching Audio Transitions in Movies and Videos
by: Fedorishin, Dennis, et al.
Published: (2024)

CLAP-ART: Automated Audio Captioning with Semantic-rich Audio Representation Tokenizer
by: Takeuchi, Daiki, et al.
Published: (2025)