:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Collins, Nick, Grierson, Mick
Format:	Preprint
Published:	2024
Subjects:	Computers and Society Machine Learning Sound Audio and Speech Processing
Online Access:	https://arxiv.org/abs/2402.14589
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

The Rarity of Musical Audio Signals Within the Space of Possible Audio Generation
by: Collins, Nick
Published: (2024)

Local deployment of large-scale music AI models on commodity hardware
by: Zhou, Xun, et al.
Published: (2024)

Improving AI-generated music with user-guided training
by: Singh, Vishwa Mohan, et al.
Published: (2025)

Detecting music deepfakes is easy but actually hard
by: Afchar, Darius, et al.
Published: (2024)

Long-form music generation with latent diffusion
by: Evans, Zach, et al.
Published: (2024)

Computational music analysis from first principles
by: Tymoczko, Dmitri, et al.
Published: (2024)

StemGen: A music generation model that listens
by: Parker, Julian D., et al.
Published: (2023)

Unsupervised outlier detection to improve bird audio dataset labels
by: Collins, Bruce
Published: (2025)

Musical composition and 2D cellular automata based on music intervals
by: Lugo, Igor, et al.
Published: (2024)

Learning and composing of classical music using restricted Boltzmann machines
by: Kobayashi, Mutsumi, et al.
Published: (2025)

Towards measuring fairness in speech recognition: Fair-Speech dataset
by: Veliche, Irina-Elena, et al.
Published: (2024)

Emotion Manipulation Through Music -- A Deep Learning Interactive Visual Approach
by: Abdalla, Adel N., et al.
Published: (2024)

The first Cadenza challenges: using machine learning competitions to improve music for listeners with a hearing loss
by: Dabike, Gerardo Roa, et al.
Published: (2024)

SLEEPING-DISCO 9M: A large-scale pre-training dataset for generative music modeling
by: Ahmed, Tawsif, et al.
Published: (2025)

On the Effect of Purely Synthetic Training Data for Different Automatic Speech Recognition Architectures
by: Hilmes, Benedikt, et al.
Published: (2024)

On the Problem of Text-To-Speech Model Selection for Synthetic Data Generation in Automatic Speech Recognition
by: Rossenbach, Nick, et al.
Published: (2024)

Analyzing the Importance of Blank for CTC-Based Knowledge Distillation
by: Hilmes, Benedikt, et al.
Published: (2025)

Melody predominates over harmony in the evolution of musical scales across 96 countries
by: McBride, John M, et al.
Published: (2024)

Joint sentiment analysis of lyrics and audio in music
by: Schaab, Lea, et al.
Published: (2024)

Continuous Autoregressive Models with Noise Augmentation Avoid Error Accumulation
by: Pasini, Marco, et al.
Published: (2024)

Symbotunes: unified hub for symbolic music generative models
by: Skierś, Paweł, et al.
Published: (2024)

Linear RNNs for autoregressive generation of long music samples
by: Szewczyk, Konrad, et al.
Published: (2025)

Who Gets Heard? Rethinking Fairness in AI for Music Systems
by: Mehta, Atharva, et al.
Published: (2025)

Improving Generalization for AI-Synthesized Voice Detection
by: Ren, Hainan, et al.
Published: (2024)

Music Genre Classification: Training an AI model
by: Mogonediwa, Keoikantse
Published: (2024)

Unified AI for Accurate Audio Anomaly Detection
by: Khaleghpour, Hamideh, et al.
Published: (2025)

Annealed Multiple Choice Learning: Overcoming limitations of Winner-takes-all with annealing
by: Perera, David, et al.
Published: (2024)

Developing an AI-Guided Assistant Device for the Deaf and Hearing Impaired
by: Jiayu, et al.
Published: (2025)

WikiMuTe: A web-sourced dataset of semantic descriptions for music audio
by: Weck, Benno, et al.
Published: (2023)

Interpreting Graphic Notation with MusicLDM: An AI Improvisation of Cornelius Cardew's Treatise
by: Karchkhadze, Tornike, et al.
Published: (2024)

Emergent musical properties of a transformer under contrastive self-supervised learning
by: Kong, Yuexuan, et al.
Published: (2025)

Listening for Expert Identified Linguistic Features: Assessment of Audio Deepfake Discernment among Undergraduate Students
by: Bhalli, Noshaba N., et al.
Published: (2024)

Towards better visualizations of urban sound environments: insights from interviews
by: Tailleur, Modan, et al.
Published: (2024)

Towards Energy-Efficient and Low-Latency Voice-Controlled Smart Homes: A Proposal for Offline Speech Recognition and IoT Integration
by: Huang, Peng, et al.
Published: (2025)

Best Practices and Considerations for Child Speech Corpus Collection and Curation in Educational, Clinical, and Forensic Scenarios
by: Hansen, John, et al.
Published: (2025)

Music and Artificial Intelligence: Artistic Trends
by: Pons, Jordi, et al.
Published: (2025)

Combining Textual and Spectral Features for Robust Classification of Pilot Communications
by: Tanvir, Abdullah All, et al.
Published: (2025)

Robust CAPTCHA Using Audio Illusions in the Era of Large Language Models: from Evaluation to Advances
by: Ding, Ziqi, et al.
Published: (2026)

An AI-Driven Approach to Wind Turbine Bearing Fault Diagnosis from Acoustic Signals
by: Wang, Zhao, et al.
Published: (2024)

From Sound to Setting: AI-Based Equalizer Parameter Prediction for Piano Tone Replication
by: Yu, Song-Ze
Published: (2025)