:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Vuilliomenet, Aude, Balvanera, Santiago Martínez, Mac Aodha, Oisin, Jones, Kate E., Wilson, Duncan
Format:	Preprint
Published:	2025
Subjects:	Sound Machine Learning Audio and Speech Processing H.5.5; I.2.m; J.3
Online Access:	https://arxiv.org/abs/2501.17841
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Dichotic harmony for the musical practice
by: Madgazin, Vadim R.
Published: (2010)

Compositional Phoneme Approximation for L1-Grounded L2 Pronunciation Training
by: Park, Jisang, et al.
Published: (2024)

Generation of Musical Timbres using a Text-Guided Diffusion Model
by: Yuan, Weixuan, et al.
Published: (2025)

Self-Improvement for Audio Large Language Model using Unlabeled Speech
by: Wang, Shaowen, et al.
Published: (2025)

MAIN-VC: Lightweight Speech Representation Disentanglement for One-shot Voice Conversion
by: Li, Pengcheng, et al.
Published: (2024)

OBHS: An Optimized Block Huffman Scheme for Real-Time Audio Compression
by: Mahfi, Muntahi Safwan, et al.
Published: (2025)

Embodied Exploration of Latent Spaces and Explainable AI
by: Wilson, Elizabeth, et al.
Published: (2024)

Audio Foundation Models Outperform Symbolic Representations for Piano Performance Evaluation
by: Dhiman, Jai
Published: (2026)

Can pre-trained Deep Learning models predict groove ratings?
by: Marmoret, Axel, et al.
Published: (2026)

Acoustic Wave Modeling Using 2D FDTD: Applications in Unreal Engine For Dynamic Sound Rendering
by: Samsurya, Bilkent
Published: (2025)

AudioMiXR: Spatial Audio Object Manipulation with 6DoF for Sound Design in Augmented Reality
by: Woodard, Brandon, et al.
Published: (2025)

Quantum-Enhanced Analysis and Grading of Vocal Performance
by: Agarwal, Rohan
Published: (2025)

ParaNoise-SV: Integrated Approach for Noise-Robust Speaker Verification with Parallel Joint Learning of Speech Enhancement and Noise Extraction
by: Kim, Minu, et al.
Published: (2025)

SeamlessEdit: Background Noise Aware Zero-Shot Speech Editing with in-Context Enhancement
by: Chen, Kuan-Yu, et al.
Published: (2025)

Less Stress, More Privacy: Stress Detection on Anonymized Speech of Air Traffic Controllers
by: Viswanathan, Janaki, et al.
Published: (2025)

SFMS-ALR: Script-First Multilingual Speech Synthesis with Adaptive Locale Resolution
by: Donepudi, Dharma Teja
Published: (2025)

REMAST: Real-time Emotion-based Music Arrangement with Soft Transition
by: Wang, Zihao, et al.
Published: (2023)

GraFPrint: A GNN-Based Approach for Audio Identification
by: Bhattacharjee, Aditya, et al.
Published: (2024)

Two Sonification Methods for the MindCube
by: Liu, Fangzheng, et al.
Published: (2025)

AI Harmonizer: Expanding Vocal Expression with a Generative Neurosymbolic Music AI System
by: Blanchard, Lancelot, et al.
Published: (2025)

Scalable Evaluation for Audio Identification via Synthetic Latent Fingerprint Generation
by: Bhattacharjee, Aditya, et al.
Published: (2025)

Taming Audio VAEs via Target-KL Regularization
by: Seetharaman, Prem, et al.
Published: (2026)

Real-Time Emergency Vehicle Detection using Mel Spectrograms and Regular Expressions
by: Pacheco-Gonzalez, Alberto, et al.
Published: (2023)

PerceiverS: A Multi-Scale Perceiver with Effective Segmentation for Long-Term Expressive Symbolic Music Generation
by: Yi, Yungang, et al.
Published: (2024)

Neural Proxies for Sound Synthesizers: Learning Perceptually Informed Preset Representations
by: Combes, Paolo, et al.
Published: (2025)

Auptimize: Optimal Placement of Spatial Audio Cues for Extended Reality
by: Cho, Hyunsung, et al.
Published: (2024)

Machine Learning Framework for Audio-Based Content Evaluation using MFCC, Chroma, Spectral Contrast, and Temporal Feature Engineering
by: Aristorenas, Aris J.
Published: (2024)

The evolution of inharmonicity and noisiness in contemporary popular music
by: Deruty, Emmanuel, et al.
Published: (2024)

Revisiting SSL for sound event detection: complementary fusion and adaptive post-processing
by: Cui, Hanfang, et al.
Published: (2025)

Joint Estimation of Piano Dynamics and Metrical Structure with a Multi-task Multi-Scale Network
by: He, Zhanhong, et al.
Published: (2025)

Dereverberation Using Binary Residual Masking with Time-Domain Consistency
by: Williams, Daniel G.
Published: (2025)

Window Size Versus Accuracy Experiments in Voice Activity Detectors
by: McKinnon, Max, et al.
Published: (2026)

Improving Cross-Lingual Phonetic Representation of Low-Resource Languages Through Language Similarity Analysis
by: Kim, Minu, et al.
Published: (2025)

Reciprocal Latent Fields for Precomputed Sound Propagation
by: Seuté, Hugo, et al.
Published: (2026)

Score Distillation Sampling for Audio: Source Separation, Synthesis, and Beyond
by: Richter-Powell, Jessie, et al.
Published: (2025)

Enhanced DareFightingICE Competitions: Sound Design and AI Competitions
by: Khan, Ibrahim, et al.
Published: (2024)

A Framework for Multimodal Medical Image Interaction
by: Schütz, Laura, et al.
Published: (2024)

STRUM: A Spectral Transcription and Rhythm Understanding Model for End-to-End Generation of Playable Rhythm-Game Charts
by: Opria, Joshua
Published: (2026)

Adaptable Symbolic Music Infilling with MIDI-RWKV
by: Zhou-Zheng, Christian, et al.
Published: (2025)

Understanding the Algorithm Behind Audio Key Detection
by: Silva, Henrique Perez G.
Published: (2025)