:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Plachouras, Christos, Guinot, Julien, Fazekas, George, Quinton, Elio, Benetos, Emmanouil, Pauwels, Johan
Format:	Preprint
Veröffentlicht:	2025
Schlagworte:	Machine Learning
Online-Zugang:	https://arxiv.org/abs/2505.06224
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

Learning Music Audio Representations With Limited Data
von: Plachouras, Christos, et al.
Veröffentlicht: (2025)

MuChoMusic: Evaluating Music Understanding in Multimodal Audio-Language Models
von: Weck, Benno, et al.
Veröffentlicht: (2024)

Semi-Supervised Contrastive Learning of Musical Representations
von: Guinot, Julien, et al.
Veröffentlicht: (2024)

GD-Retriever: Controllable Generative Text-Music Retrieval with Diffusion Models
von: Guinot, Julien, et al.
Veröffentlicht: (2025)

Leave-One-EquiVariant: Alleviating invariance-related information loss in contrastive music representations
von: Guinot, Julien, et al.
Veröffentlicht: (2024)

SLAP: Siamese Language-Audio Pretraining Without Negative Samples for Music Understanding
von: Guinot, Julien, et al.
Veröffentlicht: (2025)

Universal Music Representations? Evaluating Foundation Models on World Music Corpora
von: Papaioannou, Charilaos, et al.
Veröffentlicht: (2025)

RUMAA: Repeat-Aware Unified Music Audio Analysis for Score-Performance Alignment, Transcription, and Mistake Detection
von: Chang, Sungkyun, et al.
Veröffentlicht: (2025)

CoDiCodec: Unifying Continuous and Discrete Compressed Representations of Audio
von: Pasini, Marco, et al.
Veröffentlicht: (2025)

Audio-JEPA: Joint-Embedding Predictive Architecture for Audio Representation Learning
von: Tuncay, Ludovic, et al.
Veröffentlicht: (2025)

LC-Protonets: Multi-Label Few-Shot Learning for World Music Audio Tagging
von: Papaioannou, Charilaos, et al.
Veröffentlicht: (2024)

Towards Effective Negation Modeling in Joint Audio-Text Models for Music
von: Vasilakis, Yannis, et al.
Veröffentlicht: (2026)

CMI-Bench: A Comprehensive Benchmark for Evaluating Music Instruction Following
von: Ma, Yinghao, et al.
Veröffentlicht: (2025)

Acoustic identification of individual animals with hierarchical contrastive learning
von: Nolasco, Ines, et al.
Veröffentlicht: (2024)

YourMT3+: Multi-instrument Music Transcription with Enhanced Transformer Architectures and Cross-dataset Stem Augmentation
von: Chang, Sungkyun, et al.
Veröffentlicht: (2024)

SCRAPL: Scattering Transform with Random Paths for Machine Learning
von: Mitcheltree, Christopher, et al.
Veröffentlicht: (2026)

Evaluation of pretrained language models on music understanding
von: Vasilakis, Yannis, et al.
Veröffentlicht: (2024)

Comparison of Autoencoder Encodings for ECG Representation in Downstream Prediction Tasks
von: Harvey, Christopher J., et al.
Veröffentlicht: (2024)

Generalized Multi-Source Inference for Text Conditioned Music Diffusion Models
von: Postolache, Emilian, et al.
Veröffentlicht: (2024)

A Data-Driven Analysis of Robust Automatic Piano Transcription
von: Edwards, Drew, et al.
Veröffentlicht: (2024)

Generalized Graph Prompt: Toward a Unification of Pre-Training and Downstream Tasks on Graphs
von: Yu, Xingtong, et al.
Veröffentlicht: (2023)

Task Priors: Enhancing Model Evaluation by Considering the Entire Space of Downstream Tasks
von: Patel, Niket, et al.
Veröffentlicht: (2025)

Smoke and Mirrors in Causal Downstream Tasks
von: Cadei, Riccardo, et al.
Veröffentlicht: (2024)

Dataset Representativeness and Downstream Task Fairness
von: Borza, Victor, et al.
Veröffentlicht: (2024)

I can listen but cannot read: An evaluation of two-tower multimodal systems for instrument recognition
von: Vasilakis, Yannis, et al.
Veröffentlicht: (2024)

Coupling Speech Encoders with Downstream Text Models
von: Chelba, Ciprian, et al.
Veröffentlicht: (2024)

Towards a Unified Framework for Evaluating Explanations
von: Pinto, Juan D., et al.
Veröffentlicht: (2024)

Panprediction: Optimal Predictions for Any Downstream Task and Loss
von: Balakrishnan, Sivaraman, et al.
Veröffentlicht: (2025)

Pretraining Induces a Reusable Spectral Basis for Downstream Task Adaptation
von: Yu, Junjie, et al.
Veröffentlicht: (2026)

JAZZVAR: A Dataset of Variations found within Solo Piano Performances of Jazz Standards for Music Overpainting
von: Row, Eleanor, et al.
Veröffentlicht: (2023)

Music2Latent: Consistency Autoencoders for Latent Audio Compression
von: Pasini, Marco, et al.
Veröffentlicht: (2024)

ECG Latent Feature Extraction with Autoencoders for Downstream Prediction Tasks
von: Harvey, Christopher, et al.
Veröffentlicht: (2025)

Handling Missing Data in Downstream Tasks With Distribution-Preserving Guarantees
von: Bordoloi, Rahul, et al.
Veröffentlicht: (2025)

Learning Treatment Representations for Downstream Instrumental Variable Regression
von: Lin, Shiangyi, et al.
Veröffentlicht: (2025)

Attacking Attention of Foundation Models Disrupts Downstream Tasks
von: Silva, Hondamunige Prasanna, et al.
Veröffentlicht: (2025)

Aligning the Evaluation of Probabilistic Predictions with Downstream Value
von: Shahroudi, Novin, et al.
Veröffentlicht: (2025)

Quality Audio Prototyping: a prototype system for unified sound retrieval and procedural generation
von: Garcia, Nelly, et al.
Veröffentlicht: (2026)

Task-tailored Pre-processing: Fair Downstream Supervised Learning
von: Sohn, Jinwon, et al.
Veröffentlicht: (2026)

Music2Latent2: Audio Compression with Summary Embeddings and Autoregressive Decoding
von: Pasini, Marco, et al.
Veröffentlicht: (2025)

Time-of-arrival Estimation and Phase Unwrapping of Head-related Transfer Functions With Integer Linear Programming
von: Yu, Chin-Yun, et al.
Veröffentlicht: (2024)