Saved in:
| Main Authors: | Li, Sifei, Li, Yang, Wang, Zizhou, Zhang, Yuxin, Wu, Fuzhang, Deussen, Oliver, Lee, Tong-Yee, Dong, Weiming |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.19976 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Dance-to-Music Generation with Encoder-based Textual Inversion
by: Li, Sifei, et al.
Published: (2024)
by: Li, Sifei, et al.
Published: (2024)
SongSong: A Time Phonograph for Chinese SongCi Music from Thousand of Years Away
by: Li, Jiajia, et al.
Published: (2026)
by: Li, Jiajia, et al.
Published: (2026)
SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation
by: Liu, Zihan, et al.
Published: (2025)
by: Liu, Zihan, et al.
Published: (2025)
SongCreator: Lyrics-based Universal Song Generation
by: Lei, Shun, et al.
Published: (2024)
by: Lei, Shun, et al.
Published: (2024)
Self-Reasoning Agentic Framework for Narrative Product Grid-Collage Generation
by: Luo, Minyan, et al.
Published: (2026)
by: Luo, Minyan, et al.
Published: (2026)
SongEditor: Adapting Zero-Shot Song Generation Language Model as a Multi-Task Editor
by: Yang, Chenyu, et al.
Published: (2024)
by: Yang, Chenyu, et al.
Published: (2024)
SongComposer: A Large Language Model for Lyric and Melody Generation in Song Composition
by: Ding, Shuangrui, et al.
Published: (2024)
by: Ding, Shuangrui, et al.
Published: (2024)
VidMusician: Video-to-Music Generation with Semantic-Rhythmic Alignment via Hierarchical Visual Features
by: Li, Sifei, et al.
Published: (2024)
by: Li, Sifei, et al.
Published: (2024)
SegTune: Structured and Fine-Grained Control for Song Generation
by: Cai, Pengfei, et al.
Published: (2025)
by: Cai, Pengfei, et al.
Published: (2025)
SongBench: A Fine-Grained Multi-Aspect Benchmark for Song Quality Assessment
by: Wu, Dapeng, et al.
Published: (2026)
by: Wu, Dapeng, et al.
Published: (2026)
Muse: Towards Reproducible Long-Form Song Generation with Fine-Grained Style Control
by: Jiang, Changhao, et al.
Published: (2026)
by: Jiang, Changhao, et al.
Published: (2026)
Versatile Framework for Song Generation with Prompt-based Control
by: Zhang, Yu, et al.
Published: (2025)
by: Zhang, Yu, et al.
Published: (2025)
IdolSongsJp Corpus: A Multi-Singer Song Corpus in the Style of Japanese Idol Groups
by: Suda, Hitoshi, et al.
Published: (2025)
by: Suda, Hitoshi, et al.
Published: (2025)
Towards Hallucination-Free Music: A Reinforcement Learning Preference Optimization Framework for Reliable Song Generation
by: Zhang, Huaicheng, et al.
Published: (2025)
by: Zhang, Huaicheng, et al.
Published: (2025)
MIDI-Informed Singing Accompaniment Generation in a Compositional Song Pipeline
by: Tsai, Fang-Duo, et al.
Published: (2026)
by: Tsai, Fang-Duo, et al.
Published: (2026)
Analyzing Pitch Content in Traditional Ghanaian Seperewa Songs
by: Walls, Kelvin L, et al.
Published: (2024)
by: Walls, Kelvin L, et al.
Published: (2024)
Music Style Transfer with Time-Varying Inversion of Diffusion Models
by: Li, Sifei, et al.
Published: (2024)
by: Li, Sifei, et al.
Published: (2024)
AI-Generated Song Detection via Lyrics Transcripts
by: Frohmann, Markus, et al.
Published: (2025)
by: Frohmann, Markus, et al.
Published: (2025)
SongBsAb: A Dual Prevention Approach against Singing Voice Conversion based Illegal Song Covers
by: Chen, Guangke, et al.
Published: (2024)
by: Chen, Guangke, et al.
Published: (2024)
The Florence Price Art Song Dataset and Piano Accompaniment Generator
by: He, Tao-Tao, et al.
Published: (2025)
by: He, Tao-Tao, et al.
Published: (2025)
An End-to-End Approach for Chord-Conditioned Song Generation
by: Gao, Shuochen, et al.
Published: (2024)
by: Gao, Shuochen, et al.
Published: (2024)
LeVo: High-Quality Song Generation with Multi-Preference Alignment
by: Lei, Shun, et al.
Published: (2025)
by: Lei, Shun, et al.
Published: (2025)
JAM: A Tiny Flow-based Song Generator with Fine-grained Controllability and Aesthetic Alignment
by: Liu, Renhang, et al.
Published: (2025)
by: Liu, Renhang, et al.
Published: (2025)
Come Together: Analyzing Popular Songs Through Statistical Embeddings
by: Mallory, Matthew Esmaili, et al.
Published: (2026)
by: Mallory, Matthew Esmaili, et al.
Published: (2026)
PhraseVAE and PhraseLDM: Latent Diffusion for Full-Song Multitrack Symbolic Music Generation
by: Ou, Longshen, et al.
Published: (2025)
by: Ou, Longshen, et al.
Published: (2025)
The ICASSP 2026 Automatic Song Aesthetics Evaluation Challenge
by: Ma, Guobin, et al.
Published: (2026)
by: Ma, Guobin, et al.
Published: (2026)
DiffRhythm+: Controllable and Flexible Full-Length Song Generation with Preference Optimization
by: Chen, Huakang, et al.
Published: (2025)
by: Chen, Huakang, et al.
Published: (2025)
Segment-Factorized Full-Song Generation on Symbolic Piano Music
by: Chen, Ping-Yi, et al.
Published: (2025)
by: Chen, Ping-Yi, et al.
Published: (2025)
SongTrans: An unified song transcription and alignment method for lyrics and notes
by: Wu, Siwei, et al.
Published: (2024)
by: Wu, Siwei, et al.
Published: (2024)
EMO100DB: An Open Dataset of Improvised Songs with Emotion Data
by: Hwang, Daeun, et al.
Published: (2025)
by: Hwang, Daeun, et al.
Published: (2025)
SONICS: Synthetic Or Not -- Identifying Counterfeit Songs
by: Rahman, Md Awsafur, et al.
Published: (2024)
by: Rahman, Md Awsafur, et al.
Published: (2024)
Song Aesthetics Evaluation with Multi-Stem Attention and Hierarchical Uncertainty Modeling
by: Lv, Yishan, et al.
Published: (2026)
by: Lv, Yishan, et al.
Published: (2026)
FruitsMusic: A Real-World Corpus of Japanese Idol-Group Songs
by: Suda, Hitoshi, et al.
Published: (2024)
by: Suda, Hitoshi, et al.
Published: (2024)
CHORDONOMICON: A Dataset of 666,000 Songs and their Chord Progressions
by: Kantarelis, Spyridon, et al.
Published: (2024)
by: Kantarelis, Spyridon, et al.
Published: (2024)
SongPrep: A Preprocessing Framework and End-to-end Model for Full-song Structure Parsing and Lyrics Transcription
by: Tan, Wei, et al.
Published: (2025)
by: Tan, Wei, et al.
Published: (2025)
Whole-Song Hierarchical Generation of Symbolic Music Using Cascaded Diffusion Models
by: Wang, Ziyu, et al.
Published: (2024)
by: Wang, Ziyu, et al.
Published: (2024)
Via Score to Performance: Efficient Human-Controllable Long Song Generation with Bar-Level Symbolic Notation
by: Wang, Tongxi, et al.
Published: (2025)
by: Wang, Tongxi, et al.
Published: (2025)
SongGLM: Lyric-to-Melody Generation with 2D Alignment Encoding and Multi-Task Pre-Training
by: Yu, Jiaxing, et al.
Published: (2024)
by: Yu, Jiaxing, et al.
Published: (2024)
Automatic Live Music Song Identification Using Multi-level Deep Sequence Similarity Learning
by: Hakala, Aapo, et al.
Published: (2025)
by: Hakala, Aapo, et al.
Published: (2025)
A Survey on Cross-Modal Interaction Between Music and Multimodal Data
by: Li, Sifei, et al.
Published: (2025)
by: Li, Sifei, et al.
Published: (2025)
Similar Items
-
Dance-to-Music Generation with Encoder-based Textual Inversion
by: Li, Sifei, et al.
Published: (2024) -
SongSong: A Time Phonograph for Chinese SongCi Music from Thousand of Years Away
by: Li, Jiajia, et al.
Published: (2026) -
SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation
by: Liu, Zihan, et al.
Published: (2025) -
SongCreator: Lyrics-based Universal Song Generation
by: Lei, Shun, et al.
Published: (2024) -
Self-Reasoning Agentic Framework for Narrative Product Grid-Collage Generation
by: Luo, Minyan, et al.
Published: (2026)