Saved in:
| Main Authors: | Wilson, Elizabeth, Fazekas, György, Wiggins, Geraint |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2409.07918 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Composers' Evaluations of an AI Music Tool: Insights for Human-Centred Design
by: Row, Eleanor, et al.
Published: (2024)
by: Row, Eleanor, et al.
Published: (2024)
GMM-ResNext: Combining Generative and Discriminative Models for Speaker Verification
by: Yan, Hui, et al.
Published: (2024)
by: Yan, Hui, et al.
Published: (2024)
MIDI-VALLE: Improving Expressive Piano Performance Synthesis Through Neural Codec Language Modelling
by: Tang, Jingjing, et al.
Published: (2025)
by: Tang, Jingjing, et al.
Published: (2025)
A Cross-Modal Approach to Silent Speech with LLM-Enhanced Recognition
by: Benster, Tyler, et al.
Published: (2024)
by: Benster, Tyler, et al.
Published: (2024)
Exploring Situated Stabilities of a Rhythm Generation System through Variational Cross-Examination
by: Kotowski, Błażej, et al.
Published: (2025)
by: Kotowski, Błażej, et al.
Published: (2025)
A Theory-Based Explainable Deep Learning Architecture for Music Emotion
by: Fong, Hortense, et al.
Published: (2024)
by: Fong, Hortense, et al.
Published: (2024)
Music Generation using Human-In-The-Loop Reinforcement Learning
by: Justus, Aju Ani
Published: (2025)
by: Justus, Aju Ani
Published: (2025)
Open vocabulary keyword spotting through transfer learning from speech synthesis
by: V, Kesavaraj, et al.
Published: (2024)
by: V, Kesavaraj, et al.
Published: (2024)
A General Close-loop Predictive Coding Framework for Auditory Working Memory
by: Yuan, Zhongju, et al.
Published: (2025)
by: Yuan, Zhongju, et al.
Published: (2025)
Cervical Auscultation Machine Learning for Dysphagia Assessment
by: Chia, An An, et al.
Published: (2024)
by: Chia, An An, et al.
Published: (2024)
A cross-talk robust multichannel VAD model for multiparty agent interactions trained using synthetic re-recordings
by: Han, Hyewon, et al.
Published: (2024)
by: Han, Hyewon, et al.
Published: (2024)
Composer Style-specific Symbolic Music Generation Using Vector Quantized Discrete Diffusion Models
by: Zhang, Jincheng, et al.
Published: (2023)
by: Zhang, Jincheng, et al.
Published: (2023)
Mamba-Diffusion Model with Learnable Wavelet for Controllable Symbolic Music Generation
by: Zhang, Jincheng, et al.
Published: (2025)
by: Zhang, Jincheng, et al.
Published: (2025)
DeformTune: A Deformable XAI Music Prototype for Non-Musicians
by: Xu, Ziqing, et al.
Published: (2025)
by: Xu, Ziqing, et al.
Published: (2025)
Human Perception of Audio Deepfakes
by: Müller, Nicolas M., et al.
Published: (2021)
by: Müller, Nicolas M., et al.
Published: (2021)
CabinSep: IR-Augmented Mask-Based MVDR for Real-Time In-Car Speech Separation with Distributed Heterogeneous Arrays
by: Han, Runduo, et al.
Published: (2025)
by: Han, Runduo, et al.
Published: (2025)
SynthScribe: Deep Multimodal Tools for Synthesizer Sound Retrieval and Exploration
by: Brade, Stephen, et al.
Published: (2023)
by: Brade, Stephen, et al.
Published: (2023)
Revisiting Your Memory: Reconstruction of Affect-Contextualized Memory via EEG-guided Audiovisual Generation
by: Kwon, Joonwoo, et al.
Published: (2024)
by: Kwon, Joonwoo, et al.
Published: (2024)
VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching
by: Guo, Yiwei, et al.
Published: (2023)
by: Guo, Yiwei, et al.
Published: (2023)
EvolveCaptions: Empowering DHH Users Through Real-Time Collaborative Captioning
by: Wu, Liang-Yuan, et al.
Published: (2025)
by: Wu, Liang-Yuan, et al.
Published: (2025)
Reimagining Dance: Real-time Music Co-creation between Dancers and AI
by: Vechtomova, Olga, et al.
Published: (2025)
by: Vechtomova, Olga, et al.
Published: (2025)
LSTM-CNN Network for Audio Signature Analysis in Noisy Environments
by: Damacharla, Praveen, et al.
Published: (2023)
by: Damacharla, Praveen, et al.
Published: (2023)
MCP2OSC: Parametric Control by Natural Language
by: Fan, Yuan-Yi
Published: (2025)
by: Fan, Yuan-Yi
Published: (2025)
STAA-Net: A Sparse and Transferable Adversarial Attack for Speech Emotion Recognition
by: Chang, Yi, et al.
Published: (2024)
by: Chang, Yi, et al.
Published: (2024)
Interactive Melody Generation System for Enhancing the Creativity of Musicians
by: Hirawata, So, et al.
Published: (2024)
by: Hirawata, So, et al.
Published: (2024)
Tipping Points, Pulse Elasticity and Tonal Tension: An Empirical Study on What Generates Tipping Points
by: Naik, Canishk, et al.
Published: (2024)
by: Naik, Canishk, et al.
Published: (2024)
Between the AI and Me: Analysing Listeners' Perspectives on AI- and Human-Composed Progressive Metal Music
by: Sarmento, Pedro, et al.
Published: (2024)
by: Sarmento, Pedro, et al.
Published: (2024)
Sound2Hap: Learning Audio-to-Vibrotactile Haptic Generation from Human Ratings
by: Li, Yinan, et al.
Published: (2026)
by: Li, Yinan, et al.
Published: (2026)
Capturing Cancer as Music: Cancer Mechanisms Expressed through Musification
by: Hnatyshyn, Rostyslav, et al.
Published: (2024)
by: Hnatyshyn, Rostyslav, et al.
Published: (2024)
NeuroIncept Decoder for High-Fidelity Speech Reconstruction from Neural Activity
by: Khanday, Owais Mujtaba, et al.
Published: (2025)
by: Khanday, Owais Mujtaba, et al.
Published: (2025)
Recreating Neural Activity During Speech Production with Language and Speech Model Embeddings
by: Khanday, Owais Mujtaba, et al.
Published: (2025)
by: Khanday, Owais Mujtaba, et al.
Published: (2025)
A Mapping Strategy for Interacting with Latent Audio Synthesis Using Artistic Materials
by: Zheng, Shuoyang, et al.
Published: (2024)
by: Zheng, Shuoyang, et al.
Published: (2024)
Enhancing DMI Interactions by Integrating Haptic Feedback for Intricate Vibrato Technique
by: Piao, Ziyue, et al.
Published: (2024)
by: Piao, Ziyue, et al.
Published: (2024)
Towards Temporally Explainable Dysarthric Speech Clarity Assessment
by: Park, Seohyun, et al.
Published: (2025)
by: Park, Seohyun, et al.
Published: (2025)
Interactive Sonification for Health and Energy using ChucK and Unity
by: Zhao, Yichun, et al.
Published: (2024)
by: Zhao, Yichun, et al.
Published: (2024)
A Near-Real-Time Processing Ego Speech Filtering Pipeline Designed for Speech Interruption During Human-Robot Interaction
by: Li, Yue, et al.
Published: (2024)
by: Li, Yue, et al.
Published: (2024)
USpeech: Ultrasound-Enhanced Speech with Minimal Human Effort via Cross-Modal Synthesis
by: Yu, Luca Jiang-Tao, et al.
Published: (2024)
by: Yu, Luca Jiang-Tao, et al.
Published: (2024)
Seeing Beyond Sound: Visualization and Abstraction in Audio Data Representation
by: Blum'e, Ashlae
Published: (2025)
by: Blum'e, Ashlae
Published: (2025)
Early Detection of Furniture-Infesting Wood-Boring Beetles Using CNN-LSTM Networks and MFCC-Based Acoustic Features
by: Manukalpa, J. M. Chan Sri, et al.
Published: (2025)
by: Manukalpa, J. M. Chan Sri, et al.
Published: (2025)
Interfacing with history: Curating with audio augmented objects
by: Cliffe, Laurence
Published: (2024)
by: Cliffe, Laurence
Published: (2024)
Similar Items
-
Composers' Evaluations of an AI Music Tool: Insights for Human-Centred Design
by: Row, Eleanor, et al.
Published: (2024) -
GMM-ResNext: Combining Generative and Discriminative Models for Speaker Verification
by: Yan, Hui, et al.
Published: (2024) -
MIDI-VALLE: Improving Expressive Piano Performance Synthesis Through Neural Codec Language Modelling
by: Tang, Jingjing, et al.
Published: (2025) -
A Cross-Modal Approach to Silent Speech with LLM-Enhanced Recognition
by: Benster, Tyler, et al.
Published: (2024) -
Exploring Situated Stabilities of a Rhythm Generation System through Variational Cross-Examination
by: Kotowski, Błażej, et al.
Published: (2025)