Enregistré dans:
| Auteurs principaux: | Krishnan, Venkatakrishnan Vaidyanathapuram, Alben, Noel, Nair, Anish, Condit-Schultz, Nathaniel |
|---|---|
| Format: | Preprint |
| Publié: |
2025
|
| Sujets: | |
| Accès en ligne: | https://arxiv.org/abs/2501.06959 |
| Tags: |
Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
|
Documents similaires
The Perception of Phase Intercept Distortion and its Application in Data Augmentation
par: Krishnan, Venkatakrishnan Vaidyanathapuram, et autres
Publié: (2025)
par: Krishnan, Venkatakrishnan Vaidyanathapuram, et autres
Publié: (2025)
An Open Research Dataset of the 1932 Cairo Congress of Arab Music
par: Bozkurt, Baris
Publié: (2025)
par: Bozkurt, Baris
Publié: (2025)
Similar but Faster: Manipulation of Tempo in Music Audio Embeddings for Tempo Prediction and Search
par: McCallum, Matthew C., et autres
Publié: (2024)
par: McCallum, Matthew C., et autres
Publié: (2024)
GraphMuse: A Library for Symbolic Music Graph Processing
par: Karystinaios, Emmanouil, et autres
Publié: (2024)
par: Karystinaios, Emmanouil, et autres
Publié: (2024)
The GigaMIDI Dataset with Features for Expressive Music Performance Detection
par: Lee, Keon Ju Maverick, et autres
Publié: (2025)
par: Lee, Keon Ju Maverick, et autres
Publié: (2025)
Revisiting Meter Tracking in Carnatic Music using Deep Learning Approaches
par: Prabhu, Satyajeet
Publié: (2025)
par: Prabhu, Satyajeet
Publié: (2025)
InaGVAD : a Challenging French TV and Radio Corpus Annotated for Speech Activity Detection and Speaker Gender Segmentation
par: Doukhan, David, et autres
Publié: (2024)
par: Doukhan, David, et autres
Publié: (2024)
A Semi-Automatic Approach to Create Large Gender- and Age-Balanced Speaker Corpora: Usefulness of Speaker Diarization & Identification
par: Uro, Rémi, et autres
Publié: (2024)
par: Uro, Rémi, et autres
Publié: (2024)
KuiSCIMA v2.0: Improved Baselines, Calibration, and Cross-Notation Generalization for Historical Chinese Music Notations in Jiang Kui's Baishidaoren Gequ
par: Repolusk, Tristan, et autres
Publié: (2025)
par: Repolusk, Tristan, et autres
Publié: (2025)
Style-based Composer Identification and Attribution of Symbolic Music Scores: a Systematic Survey
par: Simonetta, Federico
Publié: (2025)
par: Simonetta, Federico
Publié: (2025)
DAIRHuM: A Platform for Directly Aligning AI Representations with Human Musical Judgments applied to Carnatic Music
par: Ravikumar, Prashanth Thattai
Publié: (2024)
par: Ravikumar, Prashanth Thattai
Publié: (2024)
The Role of Large Language Models in Musicology: Are We Ready to Trust the Machines?
par: Ramoneda, Pedro, et autres
Publié: (2024)
par: Ramoneda, Pedro, et autres
Publié: (2024)
CloserMusicDB: A Modern Multipurpose Dataset of High Quality Music
par: Piekarzewicz, Aleksandra, et autres
Publié: (2024)
par: Piekarzewicz, Aleksandra, et autres
Publié: (2024)
Carnatic Raga Identification System using Rigorous Time-Delay Neural Network
par: Natesan, Sanjay, et autres
Publié: (2024)
par: Natesan, Sanjay, et autres
Publié: (2024)
Automatic Identification of Samples in Hip-Hop Music via Multi-Loss Training and an Artificial Dataset
par: Cheston, Huw, et autres
Publié: (2025)
par: Cheston, Huw, et autres
Publié: (2025)
SynthSOD: Developing an Heterogeneous Dataset for Orchestra Music Source Separation
par: Garcia-Martinez, Jaime, et autres
Publié: (2024)
par: Garcia-Martinez, Jaime, et autres
Publié: (2024)
MOSA: Music Motion with Semantic Annotation Dataset for Cross-Modal Music Processing
par: Huang, Yu-Fen, et autres
Publié: (2024)
par: Huang, Yu-Fen, et autres
Publié: (2024)
Meta Audiobox Aesthetics: Unified Automatic Quality Assessment for Speech, Music, and Sound
par: Tjandra, Andros, et autres
Publié: (2025)
par: Tjandra, Andros, et autres
Publié: (2025)
The Spheres Dataset: Multitrack Orchestral Recordings for Music Source Separation and Information Retrieval
par: Garcia-Martinez, Jaime, et autres
Publié: (2025)
par: Garcia-Martinez, Jaime, et autres
Publié: (2025)
Multi-Source Music Generation with Latent Diffusion
par: Xu, Zhongweiyang, et autres
Publié: (2024)
par: Xu, Zhongweiyang, et autres
Publié: (2024)
MusicEval: A Generative Music Dataset with Expert Ratings for Automatic Text-to-Music Evaluation
par: Liu, Cheng, et autres
Publié: (2025)
par: Liu, Cheng, et autres
Publié: (2025)
Development of Large Annotated Music Datasets using HMM-based Forced Viterbi Alignment
par: Joysingh, S. Johanan, et autres
Publié: (2024)
par: Joysingh, S. Johanan, et autres
Publié: (2024)
Seed-Music: A Unified Framework for High Quality and Controlled Music Generation
par: Bai, Ye, et autres
Publié: (2024)
par: Bai, Ye, et autres
Publié: (2024)
Multi-Source Diffusion Models for Simultaneous Music Generation and Separation
par: Mariani, Giorgio, et autres
Publié: (2023)
par: Mariani, Giorgio, et autres
Publié: (2023)
FakeMusicCaps: a Dataset for Detection and Attribution of Synthetic Music Generated via Text-to-Music Models
par: Comanducci, Luca, et autres
Publié: (2024)
par: Comanducci, Luca, et autres
Publié: (2024)
MusicMamba: A Dual-Feature Modeling Approach for Generating Chinese Traditional Music with Modal Precision
par: Chen, Jiatao, et autres
Publié: (2024)
par: Chen, Jiatao, et autres
Publié: (2024)
HAAQI-Net: A Non-intrusive Neural Music Audio Quality Assessment Model for Hearing Aids
par: Wisnu, Dyah A. M. G., et autres
Publié: (2024)
par: Wisnu, Dyah A. M. G., et autres
Publié: (2024)
JAZZVAR: A Dataset of Variations found within Solo Piano Performances of Jazz Standards for Music Overpainting
par: Row, Eleanor, et autres
Publié: (2023)
par: Row, Eleanor, et autres
Publié: (2023)
ACMID: Automatic Curation of Musical Instrument Dataset for 7-Stem Music Source Separation
par: Yu, Ji, et autres
Publié: (2025)
par: Yu, Ji, et autres
Publié: (2025)
Multi-Microphone and Multi-Modal Emotion Recognition in Reverberant Environment
par: Cohen, Ohad, et autres
Publié: (2024)
par: Cohen, Ohad, et autres
Publié: (2024)
MusicRL: Aligning Music Generation to Human Preferences
par: Cideron, Geoffrey, et autres
Publié: (2024)
par: Cideron, Geoffrey, et autres
Publié: (2024)
Generalized Multi-Source Inference for Text Conditioned Music Diffusion Models
par: Postolache, Emilian, et autres
Publié: (2024)
par: Postolache, Emilian, et autres
Publié: (2024)
An Experimental Comparison Of Multi-view Self-supervised Methods For Music Tagging
par: Meseguer-Brocal, Gabriel, et autres
Publié: (2024)
par: Meseguer-Brocal, Gabriel, et autres
Publié: (2024)
Multimodal Dataset Normalization and Perceptual Validation for Music-Taste Correspondences
par: Spanio, Matteo, et autres
Publié: (2026)
par: Spanio, Matteo, et autres
Publié: (2026)
ClearerVoice-Studio: Bridging Advanced Speech Processing Research and Practical Deployment
par: Zhao, Shengkui, et autres
Publié: (2025)
par: Zhao, Shengkui, et autres
Publié: (2025)
PoolingVQ: A VQVAE Variant for Reducing Audio Redundancy and Boosting Multi-Modal Fusion in Music Emotion Analysis
par: Zou, Dinghao, et autres
Publié: (2025)
par: Zou, Dinghao, et autres
Publié: (2025)
Anticipatory Music Transformer
par: Thickstun, John, et autres
Publié: (2023)
par: Thickstun, John, et autres
Publié: (2023)
LC-Protonets: Multi-Label Few-Shot Learning for World Music Audio Tagging
par: Papaioannou, Charilaos, et autres
Publié: (2024)
par: Papaioannou, Charilaos, et autres
Publié: (2024)
ProGress: Structured Music Generation via Graph Diffusion and Hierarchical Music Analysis
par: Ni-Hahn, Stephen, et autres
Publié: (2025)
par: Ni-Hahn, Stephen, et autres
Publié: (2025)
Score-informed Music Source Separation: Improving Synthetic-to-real Generalization in Classical Music
par: Tunturi, Eetu, et autres
Publié: (2025)
par: Tunturi, Eetu, et autres
Publié: (2025)
Documents similaires
-
The Perception of Phase Intercept Distortion and its Application in Data Augmentation
par: Krishnan, Venkatakrishnan Vaidyanathapuram, et autres
Publié: (2025) -
An Open Research Dataset of the 1932 Cairo Congress of Arab Music
par: Bozkurt, Baris
Publié: (2025) -
Similar but Faster: Manipulation of Tempo in Music Audio Embeddings for Tempo Prediction and Search
par: McCallum, Matthew C., et autres
Publié: (2024) -
GraphMuse: A Library for Symbolic Music Graph Processing
par: Karystinaios, Emmanouil, et autres
Publié: (2024) -
The GigaMIDI Dataset with Features for Expressive Music Performance Detection
par: Lee, Keon Ju Maverick, et autres
Publié: (2025)