Saved in:
| Main Authors: | Collins, Nick, Grierson, Mick |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2402.14589 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
The Rarity of Musical Audio Signals Within the Space of Possible Audio Generation
by: Collins, Nick
Published: (2024)
by: Collins, Nick
Published: (2024)
Local deployment of large-scale music AI models on commodity hardware
by: Zhou, Xun, et al.
Published: (2024)
by: Zhou, Xun, et al.
Published: (2024)
Improving AI-generated music with user-guided training
by: Singh, Vishwa Mohan, et al.
Published: (2025)
by: Singh, Vishwa Mohan, et al.
Published: (2025)
Detecting music deepfakes is easy but actually hard
by: Afchar, Darius, et al.
Published: (2024)
by: Afchar, Darius, et al.
Published: (2024)
Long-form music generation with latent diffusion
by: Evans, Zach, et al.
Published: (2024)
by: Evans, Zach, et al.
Published: (2024)
Computational music analysis from first principles
by: Tymoczko, Dmitri, et al.
Published: (2024)
by: Tymoczko, Dmitri, et al.
Published: (2024)
StemGen: A music generation model that listens
by: Parker, Julian D., et al.
Published: (2023)
by: Parker, Julian D., et al.
Published: (2023)
Unsupervised outlier detection to improve bird audio dataset labels
by: Collins, Bruce
Published: (2025)
by: Collins, Bruce
Published: (2025)
Musical composition and 2D cellular automata based on music intervals
by: Lugo, Igor, et al.
Published: (2024)
by: Lugo, Igor, et al.
Published: (2024)
Learning and composing of classical music using restricted Boltzmann machines
by: Kobayashi, Mutsumi, et al.
Published: (2025)
by: Kobayashi, Mutsumi, et al.
Published: (2025)
Towards measuring fairness in speech recognition: Fair-Speech dataset
by: Veliche, Irina-Elena, et al.
Published: (2024)
by: Veliche, Irina-Elena, et al.
Published: (2024)
Emotion Manipulation Through Music -- A Deep Learning Interactive Visual Approach
by: Abdalla, Adel N., et al.
Published: (2024)
by: Abdalla, Adel N., et al.
Published: (2024)
The first Cadenza challenges: using machine learning competitions to improve music for listeners with a hearing loss
by: Dabike, Gerardo Roa, et al.
Published: (2024)
by: Dabike, Gerardo Roa, et al.
Published: (2024)
SLEEPING-DISCO 9M: A large-scale pre-training dataset for generative music modeling
by: Ahmed, Tawsif, et al.
Published: (2025)
by: Ahmed, Tawsif, et al.
Published: (2025)
On the Effect of Purely Synthetic Training Data for Different Automatic Speech Recognition Architectures
by: Hilmes, Benedikt, et al.
Published: (2024)
by: Hilmes, Benedikt, et al.
Published: (2024)
On the Problem of Text-To-Speech Model Selection for Synthetic Data Generation in Automatic Speech Recognition
by: Rossenbach, Nick, et al.
Published: (2024)
by: Rossenbach, Nick, et al.
Published: (2024)
Analyzing the Importance of Blank for CTC-Based Knowledge Distillation
by: Hilmes, Benedikt, et al.
Published: (2025)
by: Hilmes, Benedikt, et al.
Published: (2025)
Melody predominates over harmony in the evolution of musical scales across 96 countries
by: McBride, John M, et al.
Published: (2024)
by: McBride, John M, et al.
Published: (2024)
Joint sentiment analysis of lyrics and audio in music
by: Schaab, Lea, et al.
Published: (2024)
by: Schaab, Lea, et al.
Published: (2024)
Continuous Autoregressive Models with Noise Augmentation Avoid Error Accumulation
by: Pasini, Marco, et al.
Published: (2024)
by: Pasini, Marco, et al.
Published: (2024)
Symbotunes: unified hub for symbolic music generative models
by: Skierś, Paweł, et al.
Published: (2024)
by: Skierś, Paweł, et al.
Published: (2024)
Linear RNNs for autoregressive generation of long music samples
by: Szewczyk, Konrad, et al.
Published: (2025)
by: Szewczyk, Konrad, et al.
Published: (2025)
Who Gets Heard? Rethinking Fairness in AI for Music Systems
by: Mehta, Atharva, et al.
Published: (2025)
by: Mehta, Atharva, et al.
Published: (2025)
Improving Generalization for AI-Synthesized Voice Detection
by: Ren, Hainan, et al.
Published: (2024)
by: Ren, Hainan, et al.
Published: (2024)
Music Genre Classification: Training an AI model
by: Mogonediwa, Keoikantse
Published: (2024)
by: Mogonediwa, Keoikantse
Published: (2024)
Unified AI for Accurate Audio Anomaly Detection
by: Khaleghpour, Hamideh, et al.
Published: (2025)
by: Khaleghpour, Hamideh, et al.
Published: (2025)
Annealed Multiple Choice Learning: Overcoming limitations of Winner-takes-all with annealing
by: Perera, David, et al.
Published: (2024)
by: Perera, David, et al.
Published: (2024)
Developing an AI-Guided Assistant Device for the Deaf and Hearing Impaired
by: Jiayu, et al.
Published: (2025)
by: Jiayu, et al.
Published: (2025)
WikiMuTe: A web-sourced dataset of semantic descriptions for music audio
by: Weck, Benno, et al.
Published: (2023)
by: Weck, Benno, et al.
Published: (2023)
Interpreting Graphic Notation with MusicLDM: An AI Improvisation of Cornelius Cardew's Treatise
by: Karchkhadze, Tornike, et al.
Published: (2024)
by: Karchkhadze, Tornike, et al.
Published: (2024)
Emergent musical properties of a transformer under contrastive self-supervised learning
by: Kong, Yuexuan, et al.
Published: (2025)
by: Kong, Yuexuan, et al.
Published: (2025)
Listening for Expert Identified Linguistic Features: Assessment of Audio Deepfake Discernment among Undergraduate Students
by: Bhalli, Noshaba N., et al.
Published: (2024)
by: Bhalli, Noshaba N., et al.
Published: (2024)
Towards better visualizations of urban sound environments: insights from interviews
by: Tailleur, Modan, et al.
Published: (2024)
by: Tailleur, Modan, et al.
Published: (2024)
Towards Energy-Efficient and Low-Latency Voice-Controlled Smart Homes: A Proposal for Offline Speech Recognition and IoT Integration
by: Huang, Peng, et al.
Published: (2025)
by: Huang, Peng, et al.
Published: (2025)
Best Practices and Considerations for Child Speech Corpus Collection and Curation in Educational, Clinical, and Forensic Scenarios
by: Hansen, John, et al.
Published: (2025)
by: Hansen, John, et al.
Published: (2025)
Music and Artificial Intelligence: Artistic Trends
by: Pons, Jordi, et al.
Published: (2025)
by: Pons, Jordi, et al.
Published: (2025)
Combining Textual and Spectral Features for Robust Classification of Pilot Communications
by: Tanvir, Abdullah All, et al.
Published: (2025)
by: Tanvir, Abdullah All, et al.
Published: (2025)
Robust CAPTCHA Using Audio Illusions in the Era of Large Language Models: from Evaluation to Advances
by: Ding, Ziqi, et al.
Published: (2026)
by: Ding, Ziqi, et al.
Published: (2026)
An AI-Driven Approach to Wind Turbine Bearing Fault Diagnosis from Acoustic Signals
by: Wang, Zhao, et al.
Published: (2024)
by: Wang, Zhao, et al.
Published: (2024)
From Sound to Setting: AI-Based Equalizer Parameter Prediction for Piano Tone Replication
by: Yu, Song-Ze
Published: (2025)
by: Yu, Song-Ze
Published: (2025)
Similar Items
-
The Rarity of Musical Audio Signals Within the Space of Possible Audio Generation
by: Collins, Nick
Published: (2024) -
Local deployment of large-scale music AI models on commodity hardware
by: Zhou, Xun, et al.
Published: (2024) -
Improving AI-generated music with user-guided training
by: Singh, Vishwa Mohan, et al.
Published: (2025) -
Detecting music deepfakes is easy but actually hard
by: Afchar, Darius, et al.
Published: (2024) -
Long-form music generation with latent diffusion
by: Evans, Zach, et al.
Published: (2024)