Saved in:
| Main Authors: | Han, Danbinaerin, Jeong, Dasaem, Nam, Juhan |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.10472 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Towards Computational Analysis of Pansori Singing
by: Park, Sangheon, et al.
Published: (2024)
by: Park, Sangheon, et al.
Published: (2024)
Six Dragons Fly Again: Reviving 15th-Century Korean Court Music with Transformers and Novel Encoding
by: Han, Danbinaerin, et al.
Published: (2024)
by: Han, Danbinaerin, et al.
Published: (2024)
Towards Efficient and Real-Time Piano Transcription Using Neural Autoregressive Models
by: Kwon, Taegyun, et al.
Published: (2024)
by: Kwon, Taegyun, et al.
Published: (2024)
Musical Word Embedding for Music Tagging and Retrieval
by: Doh, SeungHeon, et al.
Published: (2024)
by: Doh, SeungHeon, et al.
Published: (2024)
Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval
by: Doh, SeungHeon, et al.
Published: (2024)
by: Doh, SeungHeon, et al.
Published: (2024)
On the de-duplication of the Lakh MIDI dataset
by: Choi, Eunjin, et al.
Published: (2025)
by: Choi, Eunjin, et al.
Published: (2025)
WaveRoll: JavaScript Library for Comparative MIDI Piano-Roll Visualization
by: Park, Hannah, et al.
Published: (2025)
by: Park, Hannah, et al.
Published: (2025)
K-pop Lyric Translation: Dataset, Analysis, and Neural-Modelling
by: Kim, Haven, et al.
Published: (2023)
by: Kim, Haven, et al.
Published: (2023)
LAV: Audio-Driven Dynamic Visual Generation with Neural Compression and StyleGAN2
by: Jung, Jongmin, et al.
Published: (2025)
by: Jung, Jongmin, et al.
Published: (2025)
Difficulty-Controlled Simplification of Piano Scores with Synthetic Data for Inclusive Music Education
by: Ramoneda, Pedro, et al.
Published: (2025)
by: Ramoneda, Pedro, et al.
Published: (2025)
Boundary Regression for Leitmotif Detection in Music Audio
by: Lee, Sihun, et al.
Published: (2025)
by: Lee, Sihun, et al.
Published: (2025)
FlashSR: One-step Versatile Audio Super-resolution via Diffusion Distillation
by: Im, Jaekwon, et al.
Published: (2025)
by: Im, Jaekwon, et al.
Published: (2025)
DIFFRENT: A Diffusion Model for Recording Environment Transfer of Speech
by: Im, Jaekwon, et al.
Published: (2024)
by: Im, Jaekwon, et al.
Published: (2024)
Predicting User Intents and Musical Attributes from Music Discovery Conversations
by: Kwon, Daeyong, et al.
Published: (2024)
by: Kwon, Daeyong, et al.
Published: (2024)
MusicGen-Chord: Advancing Music Generation through Chord Progressions and Interactive Web-UI
by: Jung, Jongmin, et al.
Published: (2024)
by: Jung, Jongmin, et al.
Published: (2024)
Hear What Matters! Text-conditioned Selective Video-to-Audio Generation
by: Lee, Junwon, et al.
Published: (2025)
by: Lee, Junwon, et al.
Published: (2025)
CounterFlow: A Two-Phase Inference-Time Sampling for Counterfactual Video Foley Generation
by: Lee, Gyubin, et al.
Published: (2026)
by: Lee, Gyubin, et al.
Published: (2026)
When Pamplona sounds different: the soundscape transformation of San Fermin through intelligent acoustic sensors and a sound repository
by: Sagasti, Amaia, et al.
Published: (2025)
by: Sagasti, Amaia, et al.
Published: (2025)
Ethics Statements in AI Music Papers: The Effective and the Ineffective
by: Barnett, Julia, et al.
Published: (2025)
by: Barnett, Julia, et al.
Published: (2025)
AImoclips: A Benchmark for Evaluating Emotion Conveyance in Text-to-Music Generation
by: Go, Gyehun, et al.
Published: (2025)
by: Go, Gyehun, et al.
Published: (2025)
Matchmaker: An Open-source Library for Real-time Piano Score Following and Systematic Evaluation
by: Park, Jiyun, et al.
Published: (2025)
by: Park, Jiyun, et al.
Published: (2025)
ViolinDiff: Enhancing Expressive Violin Synthesis with Pitch Bend Conditioning
by: Kim, Daewoong, et al.
Published: (2024)
by: Kim, Daewoong, et al.
Published: (2024)
ARCADE: A City-Scale Corpus for Fine-Grained Arabic Dialect Tagging
by: Nacar, Omer, et al.
Published: (2026)
by: Nacar, Omer, et al.
Published: (2026)
Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound
by: Lee, Junwon, et al.
Published: (2024)
by: Lee, Junwon, et al.
Published: (2024)
Automatic Speech Recognition Biases in Newcastle English: an Error Analysis
by: Serditova, Dana, et al.
Published: (2025)
by: Serditova, Dana, et al.
Published: (2025)
Dialogue in Resonance: An Interactive Music Piece for Piano and Real-Time Automatic Transcription System
by: Bang, Hayeon, et al.
Published: (2025)
by: Bang, Hayeon, et al.
Published: (2025)
TALKPLAY: Multimodal Music Recommendation with Large Language Models
by: Doh, Seungheon, et al.
Published: (2025)
by: Doh, Seungheon, et al.
Published: (2025)
Abusive music and song transformation using GenAI and LLMs
by: Choi, Jiyang, et al.
Published: (2026)
by: Choi, Jiyang, et al.
Published: (2026)
TalkPlay-Tools: Conversational Music Recommendation with LLM Tool Calling
by: Doh, Seungheon, et al.
Published: (2025)
by: Doh, Seungheon, et al.
Published: (2025)
PianoBind: A Multimodal Joint Embedding Model for Pop-piano Music
by: Bang, Hayeon, et al.
Published: (2025)
by: Bang, Hayeon, et al.
Published: (2025)
An investigation of AI integration in sound designer workflows and experiences
by: Garcia, Nelly, et al.
Published: (2026)
by: Garcia, Nelly, et al.
Published: (2026)
Can Audio Reveal Music Performance Difficulty? Insights from the Piano Syllabus Dataset
by: Ramoneda, Pedro, et al.
Published: (2024)
by: Ramoneda, Pedro, et al.
Published: (2024)
Decoding Musical Evolution Through Network Science
by: Di Marco, Niccolo', et al.
Published: (2025)
by: Di Marco, Niccolo', et al.
Published: (2025)
D3RM: A Discrete Denoising Diffusion Refinement Model for Piano Transcription
by: Kim, Hounsu, et al.
Published: (2025)
by: Kim, Hounsu, et al.
Published: (2025)
Instantaneous Spectra Analysis of Pulse Series -- Application to Lung Sounds with Abnormalities
by: Ishiyama, Fumihiko
Published: (2026)
by: Ishiyama, Fumihiko
Published: (2026)
Nested Music Transformer: Sequentially Decoding Compound Tokens in Symbolic Music and Audio Generation
by: Yoo, HaeJun, et al.
Published: (2024)
by: Yoo, HaeJun, et al.
Published: (2024)
TalkPlayData 2: An Agentic Synthetic Data Pipeline for Multimodal Conversational Music Recommendation
by: Choi, Keunwoo, et al.
Published: (2025)
by: Choi, Keunwoo, et al.
Published: (2025)
Music of Changing Lines: Toward a Culturally Situated Approach to the I-Ching
by: Qi, Ling, et al.
Published: (2026)
by: Qi, Ling, et al.
Published: (2026)
Beyond Voice Assistants: Exploring Advantages and Risks of an In-Car Social Robot in Real Driving Scenarios
by: Li, Yuanchao, et al.
Published: (2024)
by: Li, Yuanchao, et al.
Published: (2024)
Music Discovery Dialogue Generation Using Human Intent Analysis and Large Language Models
by: Doh, SeungHeon, et al.
Published: (2024)
by: Doh, SeungHeon, et al.
Published: (2024)
Similar Items
-
Towards Computational Analysis of Pansori Singing
by: Park, Sangheon, et al.
Published: (2024) -
Six Dragons Fly Again: Reviving 15th-Century Korean Court Music with Transformers and Novel Encoding
by: Han, Danbinaerin, et al.
Published: (2024) -
Towards Efficient and Real-Time Piano Transcription Using Neural Autoregressive Models
by: Kwon, Taegyun, et al.
Published: (2024) -
Musical Word Embedding for Music Tagging and Retrieval
by: Doh, SeungHeon, et al.
Published: (2024) -
Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval
by: Doh, SeungHeon, et al.
Published: (2024)