:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Park, Jiyun, Cancino-Chacón, Carlos, Chiruthapudi, Suhit, Nam, Juhan
Format:	Preprint
Published:	2025
Subjects:	Sound
Online Access:	https://arxiv.org/abs/2510.10087
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Towards Musically Informed Evaluation of Piano Transcription Models
by: Hu, Patricia, et al.
Published: (2024)

A Real-Time Lyrics Alignment System Using Chroma And Phonetic Features For Classical Vocal Performance
by: Park, Jiyun, et al.
Published: (2024)

Towards Efficient and Real-Time Piano Transcription Using Neural Autoregressive Models
by: Kwon, Taegyun, et al.
Published: (2024)

Dialogue in Resonance: An Interactive Music Piece for Piano and Real-Time Automatic Transcription System
by: Bang, Hayeon, et al.
Published: (2025)

PianoVAM: A Multimodal Piano Performance Dataset
by: Kim, Yonghyun, et al.
Published: (2025)

PianoBind: A Multimodal Joint Embedding Model for Pop-piano Music
by: Bang, Hayeon, et al.
Published: (2025)

D3RM: A Discrete Denoising Diffusion Refinement Model for Piano Transcription
by: Kim, Hounsu, et al.
Published: (2025)

PiAnnotate: A Web Annotation Tool for Piano Fingering, with a Diagnostic Probe
by: Bae, Joonhyung, et al.
Published: (2026)

D3PIA: A Discrete Denoising Diffusion Model for Piano Accompaniment Generation From Lead sheet
by: Choi, Eunjin, et al.
Published: (2026)

Pairing Real-Time Piano Transcription with Symbol-level Tracking for Precise and Robust Score Following
by: Peter, Silvan, et al.
Published: (2025)

Two Web Toolkits for Multimodal Piano Performance Dataset Acquisition and Fingering Annotation
by: Park, Junhyung, et al.
Published: (2025)

PIAST: A Multimodal Piano Dataset with Audio, Symbolic and Text
by: Bang, Hayeon, et al.
Published: (2024)

FlashSR: One-step Versatile Audio Super-resolution via Diffusion Distillation
by: Im, Jaekwon, et al.
Published: (2025)

DIFFRENT: A Diffusion Model for Recording Environment Transfer of Speech
by: Im, Jaekwon, et al.
Published: (2024)

Difficulty-Aware Score Generation for Piano Sight-Reading
by: Ramoneda, Pedro, et al.
Published: (2025)

Sounding Out Reconstruction Error-Based Evaluation of Generative Models of Expressive Performance
by: Peter, Silvan David, et al.
Published: (2023)

WaveRoll: JavaScript Library for Comparative MIDI Piano-Roll Visualization
by: Park, Hannah, et al.
Published: (2025)

Motive-level Analysis of Form-functions Association in Korean Folk song
by: Han, Danbinaerin, et al.
Published: (2025)

On Every Note a Griff: Looking for a Useful Representation of Basso Continuo Performance Style
by: Štefunko, Adam, et al.
Published: (2026)

Difficulty-Controlled Simplification of Piano Scores with Synthetic Data for Inclusive Music Education
by: Ramoneda, Pedro, et al.
Published: (2025)

End-to-End Real-World Polyphonic Piano Audio-to-Score Transcription with Hierarchical Decoding
by: Zeng, Wei, et al.
Published: (2024)

AImoclips: A Benchmark for Evaluating Emotion Conveyance in Text-to-Music Generation
by: Go, Gyehun, et al.
Published: (2025)

TALKPLAY: Multimodal Music Recommendation with Large Language Models
by: Doh, Seungheon, et al.
Published: (2025)

PBSCR: The Piano Bootleg Score Composer Recognition Dataset
by: Jain, Arhan, et al.
Published: (2024)

TalkPlay-Tools: Conversational Music Recommendation with LLM Tool Calling
by: Doh, Seungheon, et al.
Published: (2025)

End-to-end Piano Performance-MIDI to Score Conversion with Transformers
by: Beyer, Tim, et al.
Published: (2024)

Towards An Integrated Approach for Expressive Piano Performance Synthesis from Music Scores
by: Tang, Jingjing, et al.
Published: (2025)

Twenty-Five Years of MIR Research: Achievements, Practices, Evaluations, and Future Challenges
by: Peeters, Geoffroy, et al.
Published: (2025)

PianoCoRe: Combined and Refined Piano MIDI Dataset
by: Borovik, Ilya
Published: (2026)

TalkPlayData 2: An Agentic Synthetic Data Pipeline for Multimodal Conversational Music Recommendation
by: Choi, Keunwoo, et al.
Published: (2025)

Musical Word Embedding for Music Tagging and Retrieval
by: Doh, SeungHeon, et al.
Published: (2024)

CONMOD: Controllable Neural Frame-based Modulation Effects
by: Lee, Gyubin, et al.
Published: (2024)

T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis
by: Chung, Yoonjin, et al.
Published: (2024)

Expressive Acoustic Guitar Sound Synthesis with an Instrument-Specific Input Representation and Diffusion Outpainting
by: Kim, Hounsu, et al.
Published: (2024)

Predicting User Intents and Musical Attributes from Music Discovery Conversations
by: Kwon, Daeyong, et al.
Published: (2024)

Disentangling Score Content and Performance Style for Joint Piano Rendering and Transcription
by: Zeng, Wei, et al.
Published: (2025)

Scoring Time Intervals using Non-Hierarchical Transformer For Automatic Piano Transcription
by: Yan, Yujia, et al.
Published: (2024)

A Holistic Evaluation of Piano Sound Quality
by: Zhou, Monan, et al.
Published: (2023)

Hear What Matters! Text-conditioned Selective Video-to-Audio Generation
by: Lee, Junwon, et al.
Published: (2025)

CounterFlow: A Two-Phase Inference-Time Sampling for Counterfactual Video Foley Generation
by: Lee, Gyubin, et al.
Published: (2026)