:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Guinot, Julien, Quinton, Elio, Fazekas, György
Format:	Preprint
Veröffentlicht:	2024
Schlagworte:	Audio and Speech Processing
Online-Zugang:	https://arxiv.org/abs/2407.13840
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

GD-Retriever: Controllable Generative Text-Music Retrieval with Diffusion Models
von: Guinot, Julien, et al.
Veröffentlicht: (2025)

Leave-One-EquiVariant: Alleviating invariance-related information loss in contrastive music representations
von: Guinot, Julien, et al.
Veröffentlicht: (2024)

SLAP: Siamese Language-Audio Pretraining Without Negative Samples for Music Understanding
von: Guinot, Julien, et al.
Veröffentlicht: (2025)

MuChoMusic: Evaluating Music Understanding in Multimodal Audio-Language Models
von: Weck, Benno, et al.
Veröffentlicht: (2024)

Exploring Transformer-Based Music Overpainting for Jazz Piano Variations
von: Row, Eleanor, et al.
Veröffentlicht: (2024)

Mamba-Diffusion Model with Learnable Wavelet for Controllable Symbolic Music Generation
von: Zhang, Jincheng, et al.
Veröffentlicht: (2025)

Composer Style-specific Symbolic Music Generation Using Vector Quantized Discrete Diffusion Models
von: Zhang, Jincheng, et al.
Veröffentlicht: (2023)

Differentiable Time-Varying Linear Prediction in the Context of End-to-End Analysis-by-Synthesis
von: Yu, Chin-Yun, et al.
Veröffentlicht: (2024)

Singing Voice Synthesis Using Differentiable LPC and Glottal-Flow-Inspired Wavetables
von: Yu, Chin-Yun, et al.
Veröffentlicht: (2023)

Semi-Supervised Contrastive Learning for Controllable Video-to-Music Retrieval
von: Stewart, Shanti, et al.
Veröffentlicht: (2024)

Composers' Evaluations of an AI Music Tool: Insights for Human-Centred Design
von: Row, Eleanor, et al.
Veröffentlicht: (2024)

Accelerating Automatic Differentiation of Direct Form Digital Filters
von: Yu, Chin-Yun, et al.
Veröffentlicht: (2025)

Robust Lossy Audio Compression Identification
von: Koops, Hendrik Vincent, et al.
Veröffentlicht: (2024)

Sound Matching an Analogue Levelling Amplifier Using the Newton-Raphson Method
von: Yu, Chin-Yun, et al.
Veröffentlicht: (2025)

Time-of-arrival Estimation and Phase Unwrapping of Head-related Transfer Functions With Integer Linear Programming
von: Yu, Chin-Yun, et al.
Veröffentlicht: (2024)

Zero-Shot Duet Singing Voices Separation with Diffusion Models
von: Yu, Chin-Yun, et al.
Veröffentlicht: (2023)

Audio synthesizer inversion in symmetric parameter spaces with approximately equivariant flow matching
von: Hayes, Ben, et al.
Veröffentlicht: (2025)

Exploring trends in audio mixes and masters: Insights from a dataset analysis
von: Mourgela, Angeliki, et al.
Veröffentlicht: (2024)

Conditioning and Sampling in Variational Diffusion Models for Speech Super-Resolution
von: Yu, Chin-Yun, et al.
Veröffentlicht: (2022)

Tidal MerzA: Combining affective modelling and autonomous code generation through Reinforcement Learning
von: Wilson, Elizabeth, et al.
Veröffentlicht: (2024)

Self-Supervised Multi-View Learning for Disentangled Music Audio Representations
von: Wilkins, Julia, et al.
Veröffentlicht: (2024)

Balancing Information Preservation and Disentanglement in Self-Supervised Music Representation Learning
von: Wilkins, Julia, et al.
Veröffentlicht: (2025)

Semi-Supervised Self-Learning Enhanced Music Emotion Recognition
von: Sun, Yifu, et al.
Veröffentlicht: (2024)

Towards An Integrated Approach for Expressive Piano Performance Synthesis from Music Scores
von: Tang, Jingjing, et al.
Veröffentlicht: (2025)

Music2Latent: Consistency Autoencoders for Latent Audio Compression
von: Pasini, Marco, et al.
Veröffentlicht: (2024)

COCOLA: Coherence-Oriented Contrastive Learning of Musical Audio Representations
von: Ciranni, Ruben, et al.
Veröffentlicht: (2024)

JAZZVAR: A Dataset of Variations found within Solo Piano Performances of Jazz Standards for Music Overpainting
von: Row, Eleanor, et al.
Veröffentlicht: (2023)

DiffVox: A Differentiable Model for Capturing and Analysing Vocal Effects Distributions
von: Yu, Chin-Yun, et al.
Veröffentlicht: (2025)

Learning Separated Representations for Instrument-based Music Similarity
von: Hashizume, Yuka, et al.
Veröffentlicht: (2025)

SongFormer: Scaling Music Structure Analysis with Heterogeneous Supervision
von: Hao, Chunbo, et al.
Veröffentlicht: (2025)

The Effect of Batch Size on Contrastive Self-Supervised Speech Representation Learning
von: Vaessen, Nik, et al.
Veröffentlicht: (2024)

Emotion-Aligned Contrastive Learning Between Images and Music
von: Stewart, Shanti, et al.
Veröffentlicht: (2023)

Differentiable All-pole Filters for Time-varying Audio Systems
von: Yu, Chin-Yun, et al.
Veröffentlicht: (2024)

On the Use of Self-Supervised Representation Learning for Speaker Diarization and Separation
von: Baroudi, Séverin, et al.
Veröffentlicht: (2025)

Music2Latent2: Audio Compression with Summary Embeddings and Autoregressive Decoding
von: Pasini, Marco, et al.
Veröffentlicht: (2025)

Multi-Distillation from Speech and Music Representation Models
von: Wei, Jui-Chiang, et al.
Veröffentlicht: (2025)

Music Era Recognition Using Supervised Contrastive Learning and Artist Information
von: He, Qiqi, et al.
Veröffentlicht: (2024)

Additive Margin in Contrastive Self-Supervised Frameworks to Learn Discriminative Speaker Representations
von: Lepage, Theo, et al.
Veröffentlicht: (2024)

Learning Multidimensional Disentangled Representations of Instrumental Sounds for Musical Similarity Assessment
von: Hashizume, Yuka, et al.
Veröffentlicht: (2024)

Boosting Multi-Speaker Expressive Speech Synthesis with Semi-supervised Contrastive Learning
von: Zhu, Xinfa, et al.
Veröffentlicht: (2023)