:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wilson, Elizabeth, Fazekas, György, Wiggins, Geraint
Format:	Preprint
Published:	2024
Subjects:	Human-Computer Interaction Artificial Intelligence Sound Audio and Speech Processing
Online Access:	https://arxiv.org/abs/2409.07918
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Composers' Evaluations of an AI Music Tool: Insights for Human-Centred Design
by: Row, Eleanor, et al.
Published: (2024)

GMM-ResNext: Combining Generative and Discriminative Models for Speaker Verification
by: Yan, Hui, et al.
Published: (2024)

MIDI-VALLE: Improving Expressive Piano Performance Synthesis Through Neural Codec Language Modelling
by: Tang, Jingjing, et al.
Published: (2025)

A Cross-Modal Approach to Silent Speech with LLM-Enhanced Recognition
by: Benster, Tyler, et al.
Published: (2024)

Exploring Situated Stabilities of a Rhythm Generation System through Variational Cross-Examination
by: Kotowski, Błażej, et al.
Published: (2025)

A Theory-Based Explainable Deep Learning Architecture for Music Emotion
by: Fong, Hortense, et al.
Published: (2024)

Music Generation using Human-In-The-Loop Reinforcement Learning
by: Justus, Aju Ani
Published: (2025)

Open vocabulary keyword spotting through transfer learning from speech synthesis
by: V, Kesavaraj, et al.
Published: (2024)

A General Close-loop Predictive Coding Framework for Auditory Working Memory
by: Yuan, Zhongju, et al.
Published: (2025)

Cervical Auscultation Machine Learning for Dysphagia Assessment
by: Chia, An An, et al.
Published: (2024)

A cross-talk robust multichannel VAD model for multiparty agent interactions trained using synthetic re-recordings
by: Han, Hyewon, et al.
Published: (2024)

Composer Style-specific Symbolic Music Generation Using Vector Quantized Discrete Diffusion Models
by: Zhang, Jincheng, et al.
Published: (2023)

Mamba-Diffusion Model with Learnable Wavelet for Controllable Symbolic Music Generation
by: Zhang, Jincheng, et al.
Published: (2025)

DeformTune: A Deformable XAI Music Prototype for Non-Musicians
by: Xu, Ziqing, et al.
Published: (2025)

Human Perception of Audio Deepfakes
by: Müller, Nicolas M., et al.
Published: (2021)

CabinSep: IR-Augmented Mask-Based MVDR for Real-Time In-Car Speech Separation with Distributed Heterogeneous Arrays
by: Han, Runduo, et al.
Published: (2025)

SynthScribe: Deep Multimodal Tools for Synthesizer Sound Retrieval and Exploration
by: Brade, Stephen, et al.
Published: (2023)

Revisiting Your Memory: Reconstruction of Affect-Contextualized Memory via EEG-guided Audiovisual Generation
by: Kwon, Joonwoo, et al.
Published: (2024)

VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching
by: Guo, Yiwei, et al.
Published: (2023)

EvolveCaptions: Empowering DHH Users Through Real-Time Collaborative Captioning
by: Wu, Liang-Yuan, et al.
Published: (2025)

Reimagining Dance: Real-time Music Co-creation between Dancers and AI
by: Vechtomova, Olga, et al.
Published: (2025)

LSTM-CNN Network for Audio Signature Analysis in Noisy Environments
by: Damacharla, Praveen, et al.
Published: (2023)

MCP2OSC: Parametric Control by Natural Language
by: Fan, Yuan-Yi
Published: (2025)

STAA-Net: A Sparse and Transferable Adversarial Attack for Speech Emotion Recognition
by: Chang, Yi, et al.
Published: (2024)

Interactive Melody Generation System for Enhancing the Creativity of Musicians
by: Hirawata, So, et al.
Published: (2024)

Tipping Points, Pulse Elasticity and Tonal Tension: An Empirical Study on What Generates Tipping Points
by: Naik, Canishk, et al.
Published: (2024)

Between the AI and Me: Analysing Listeners' Perspectives on AI- and Human-Composed Progressive Metal Music
by: Sarmento, Pedro, et al.
Published: (2024)

Sound2Hap: Learning Audio-to-Vibrotactile Haptic Generation from Human Ratings
by: Li, Yinan, et al.
Published: (2026)

Capturing Cancer as Music: Cancer Mechanisms Expressed through Musification
by: Hnatyshyn, Rostyslav, et al.
Published: (2024)

NeuroIncept Decoder for High-Fidelity Speech Reconstruction from Neural Activity
by: Khanday, Owais Mujtaba, et al.
Published: (2025)

Recreating Neural Activity During Speech Production with Language and Speech Model Embeddings
by: Khanday, Owais Mujtaba, et al.
Published: (2025)

A Mapping Strategy for Interacting with Latent Audio Synthesis Using Artistic Materials
by: Zheng, Shuoyang, et al.
Published: (2024)

Enhancing DMI Interactions by Integrating Haptic Feedback for Intricate Vibrato Technique
by: Piao, Ziyue, et al.
Published: (2024)

Towards Temporally Explainable Dysarthric Speech Clarity Assessment
by: Park, Seohyun, et al.
Published: (2025)

Interactive Sonification for Health and Energy using ChucK and Unity
by: Zhao, Yichun, et al.
Published: (2024)

A Near-Real-Time Processing Ego Speech Filtering Pipeline Designed for Speech Interruption During Human-Robot Interaction
by: Li, Yue, et al.
Published: (2024)

USpeech: Ultrasound-Enhanced Speech with Minimal Human Effort via Cross-Modal Synthesis
by: Yu, Luca Jiang-Tao, et al.
Published: (2024)

Seeing Beyond Sound: Visualization and Abstraction in Audio Data Representation
by: Blum'e, Ashlae
Published: (2025)

Early Detection of Furniture-Infesting Wood-Boring Beetles Using CNN-LSTM Networks and MFCC-Based Acoustic Features
by: Manukalpa, J. M. Chan Sri, et al.
Published: (2025)

Interfacing with history: Curating with audio augmented objects
by: Cliffe, Laurence
Published: (2024)