:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Han, Danbinaerin, Jeong, Dasaem, Nam, Juhan
Format:	Preprint
Published:	2025
Subjects:	Sound Computers and Society
Online Access:	https://arxiv.org/abs/2508.10472
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Towards Computational Analysis of Pansori Singing
by: Park, Sangheon, et al.
Published: (2024)

Six Dragons Fly Again: Reviving 15th-Century Korean Court Music with Transformers and Novel Encoding
by: Han, Danbinaerin, et al.
Published: (2024)

Towards Efficient and Real-Time Piano Transcription Using Neural Autoregressive Models
by: Kwon, Taegyun, et al.
Published: (2024)

Musical Word Embedding for Music Tagging and Retrieval
by: Doh, SeungHeon, et al.
Published: (2024)

Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval
by: Doh, SeungHeon, et al.
Published: (2024)

On the de-duplication of the Lakh MIDI dataset
by: Choi, Eunjin, et al.
Published: (2025)

WaveRoll: JavaScript Library for Comparative MIDI Piano-Roll Visualization
by: Park, Hannah, et al.
Published: (2025)

K-pop Lyric Translation: Dataset, Analysis, and Neural-Modelling
by: Kim, Haven, et al.
Published: (2023)

LAV: Audio-Driven Dynamic Visual Generation with Neural Compression and StyleGAN2
by: Jung, Jongmin, et al.
Published: (2025)

Difficulty-Controlled Simplification of Piano Scores with Synthetic Data for Inclusive Music Education
by: Ramoneda, Pedro, et al.
Published: (2025)

Boundary Regression for Leitmotif Detection in Music Audio
by: Lee, Sihun, et al.
Published: (2025)

FlashSR: One-step Versatile Audio Super-resolution via Diffusion Distillation
by: Im, Jaekwon, et al.
Published: (2025)

DIFFRENT: A Diffusion Model for Recording Environment Transfer of Speech
by: Im, Jaekwon, et al.
Published: (2024)

Predicting User Intents and Musical Attributes from Music Discovery Conversations
by: Kwon, Daeyong, et al.
Published: (2024)

MusicGen-Chord: Advancing Music Generation through Chord Progressions and Interactive Web-UI
by: Jung, Jongmin, et al.
Published: (2024)

Hear What Matters! Text-conditioned Selective Video-to-Audio Generation
by: Lee, Junwon, et al.
Published: (2025)

CounterFlow: A Two-Phase Inference-Time Sampling for Counterfactual Video Foley Generation
by: Lee, Gyubin, et al.
Published: (2026)

When Pamplona sounds different: the soundscape transformation of San Fermin through intelligent acoustic sensors and a sound repository
by: Sagasti, Amaia, et al.
Published: (2025)

Ethics Statements in AI Music Papers: The Effective and the Ineffective
by: Barnett, Julia, et al.
Published: (2025)

AImoclips: A Benchmark for Evaluating Emotion Conveyance in Text-to-Music Generation
by: Go, Gyehun, et al.
Published: (2025)

Matchmaker: An Open-source Library for Real-time Piano Score Following and Systematic Evaluation
by: Park, Jiyun, et al.
Published: (2025)

ViolinDiff: Enhancing Expressive Violin Synthesis with Pitch Bend Conditioning
by: Kim, Daewoong, et al.
Published: (2024)

ARCADE: A City-Scale Corpus for Fine-Grained Arabic Dialect Tagging
by: Nacar, Omer, et al.
Published: (2026)

Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound
by: Lee, Junwon, et al.
Published: (2024)

Automatic Speech Recognition Biases in Newcastle English: an Error Analysis
by: Serditova, Dana, et al.
Published: (2025)

Dialogue in Resonance: An Interactive Music Piece for Piano and Real-Time Automatic Transcription System
by: Bang, Hayeon, et al.
Published: (2025)

TALKPLAY: Multimodal Music Recommendation with Large Language Models
by: Doh, Seungheon, et al.
Published: (2025)

Abusive music and song transformation using GenAI and LLMs
by: Choi, Jiyang, et al.
Published: (2026)

TalkPlay-Tools: Conversational Music Recommendation with LLM Tool Calling
by: Doh, Seungheon, et al.
Published: (2025)

PianoBind: A Multimodal Joint Embedding Model for Pop-piano Music
by: Bang, Hayeon, et al.
Published: (2025)

An investigation of AI integration in sound designer workflows and experiences
by: Garcia, Nelly, et al.
Published: (2026)

Can Audio Reveal Music Performance Difficulty? Insights from the Piano Syllabus Dataset
by: Ramoneda, Pedro, et al.
Published: (2024)

Decoding Musical Evolution Through Network Science
by: Di Marco, Niccolo', et al.
Published: (2025)

D3RM: A Discrete Denoising Diffusion Refinement Model for Piano Transcription
by: Kim, Hounsu, et al.
Published: (2025)

Instantaneous Spectra Analysis of Pulse Series -- Application to Lung Sounds with Abnormalities
by: Ishiyama, Fumihiko
Published: (2026)

Nested Music Transformer: Sequentially Decoding Compound Tokens in Symbolic Music and Audio Generation
by: Yoo, HaeJun, et al.
Published: (2024)

TalkPlayData 2: An Agentic Synthetic Data Pipeline for Multimodal Conversational Music Recommendation
by: Choi, Keunwoo, et al.
Published: (2025)

Music of Changing Lines: Toward a Culturally Situated Approach to the I-Ching
by: Qi, Ling, et al.
Published: (2026)

Beyond Voice Assistants: Exploring Advantages and Risks of an In-Car Social Robot in Real Driving Scenarios
by: Li, Yuanchao, et al.
Published: (2024)

Music Discovery Dialogue Generation Using Human Intent Analysis and Large Language Models
by: Doh, SeungHeon, et al.
Published: (2024)