:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Wang, Tongxi, Yu, Yang, Wang, Qing, Qian, Junlang
Format:	Preprint
Veröffentlicht:	2025
Schlagworte:	Sound Artificial Intelligence Audio and Speech Processing
Online-Zugang:	https://arxiv.org/abs/2508.01394
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

Multi-view MidiVAE: Fusing Track- and Bar-view Representations for Long Multi-track Symbolic Music Generation
von: Lin, Zhiwei, et al.
Veröffentlicht: (2024)

MuseBarControl: Enhancing Fine-Grained Control in Symbolic Music Generation through Pre-Training and Counterfactual Loss
von: Shu, Yangyang, et al.
Veröffentlicht: (2024)

SongCreator: Lyrics-based Universal Song Generation
von: Lei, Shun, et al.
Veröffentlicht: (2024)

CSL-L2M: Controllable Song-Level Lyric-to-Melody Generation Based on Conditional Transformer with Fine-Grained Lyric and Musical Controls
von: Chai, Li, et al.
Veröffentlicht: (2024)

LeVo: High-Quality Song Generation with Multi-Preference Alignment
von: Lei, Shun, et al.
Veröffentlicht: (2025)

YNote: A Novel Music Notation for Fine-Tuning LLMs in Music Generation
von: Lu, Shao-Chien, et al.
Veröffentlicht: (2025)

Segment-Factorized Full-Song Generation on Symbolic Piano Music
von: Chen, Ping-Yi, et al.
Veröffentlicht: (2025)

SongComposer: A Large Language Model for Lyric and Melody Generation in Song Composition
von: Ding, Shuangrui, et al.
Veröffentlicht: (2024)

SongBench: A Fine-Grained Multi-Aspect Benchmark for Song Quality Assessment
von: Wu, Dapeng, et al.
Veröffentlicht: (2026)

SongEditor: Adapting Zero-Shot Song Generation Language Model as a Multi-Task Editor
von: Yang, Chenyu, et al.
Veröffentlicht: (2024)

MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing
von: Wu, Shangda, et al.
Veröffentlicht: (2024)

Towards Hallucination-Free Music: A Reinforcement Learning Preference Optimization Framework for Reliable Song Generation
von: Zhang, Huaicheng, et al.
Veröffentlicht: (2025)

An End-to-End Approach for Chord-Conditioned Song Generation
von: Gao, Shuochen, et al.
Veröffentlicht: (2024)

Whole-Song Hierarchical Generation of Symbolic Music Using Cascaded Diffusion Models
von: Wang, Ziyu, et al.
Veröffentlicht: (2024)

DiffRhythm+: Controllable and Flexible Full-Length Song Generation with Preference Optimization
von: Chen, Huakang, et al.
Veröffentlicht: (2025)

ONOTE: Benchmarking Omnimodal Notation Processing for Expert-level Music Intelligence
von: Ma, Menghe, et al.
Veröffentlicht: (2026)

IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
von: Deng, Wei, et al.
Veröffentlicht: (2025)

Pianoroll-Event: A Novel Score Representation for Symbolic Music
von: Qian, Lekai, et al.
Veröffentlicht: (2026)

XMusic: Towards a Generalized and Controllable Symbolic Music Generation Framework
von: Tian, Sida, et al.
Veröffentlicht: (2025)

SongGLM: Lyric-to-Melody Generation with 2D Alignment Encoding and Multi-Task Pre-Training
von: Yu, Jiaxing, et al.
Veröffentlicht: (2024)

TOMI: Transforming and Organizing Music Ideas for Multi-Track Compositions with Full-Song Structure
von: He, Qi, et al.
Veröffentlicht: (2025)

EMelodyGen: Emotion-Conditioned Melody Generation in ABC Notation with the Musical Feature Template
von: Zhou, Monan, et al.
Veröffentlicht: (2023)

MuSpike: A Benchmark and Evaluation Framework for Symbolic Music Generation with Spiking Neural Networks
von: Liang, Qian, et al.
Veröffentlicht: (2025)

Mamba-Diffusion Model with Learnable Wavelet for Controllable Symbolic Music Generation
von: Zhang, Jincheng, et al.
Veröffentlicht: (2025)

Generating High-quality Symbolic Music Using Fine-grained Discriminators
von: Zhang, Zhedong, et al.
Veröffentlicht: (2024)

MFHCA: Enhancing Speech Emotion Recognition Via Multi-Spatial Fusion and Hierarchical Cooperative Attention
von: Jiao, Xinxin, et al.
Veröffentlicht: (2024)

NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training Paradigms
von: Wang, Yashan, et al.
Veröffentlicht: (2025)

SongPrep: A Preprocessing Framework and End-to-end Model for Full-song Structure Parsing and Lyrics Transcription
von: Tan, Wei, et al.
Veröffentlicht: (2025)

Versatile Framework for Song Generation with Prompt-based Control
von: Zhang, Yu, et al.
Veröffentlicht: (2025)

PianoBART: Symbolic Piano Music Generation and Understanding with Large-Scale Pre-Training
von: Liang, Xiao, et al.
Veröffentlicht: (2024)

MIDI-Informed Singing Accompaniment Generation in a Compositional Song Pipeline
von: Tsai, Fang-Duo, et al.
Veröffentlicht: (2026)

Practical and Reproducible Symbolic Music Generation by Large Language Models with Structural Embeddings
von: Rhyu, Seungyeon, et al.
Veröffentlicht: (2024)

ML-SAN: Multi-Level Speaker-Adaptive Network for Emotion Recognition in Conversations
von: Wang, Kexue, et al.
Veröffentlicht: (2026)

Speech Bandwidth Expansion Via High Fidelity Generative Adversarial Networks
von: Salhab, Mahmoud, et al.
Veröffentlicht: (2024)

MeloTrans: A Text to Symbolic Music Generation Model Following Human Composition Habit
von: Wang, Yutian, et al.
Veröffentlicht: (2024)

Disentangling Score Content and Performance Style for Joint Piano Rendering and Transcription
von: Zeng, Wei, et al.
Veröffentlicht: (2025)

The Florence Price Art Song Dataset and Piano Accompaniment Generator
von: He, Tao-Tao, et al.
Veröffentlicht: (2025)

Speech Quality Assessment Model Based on Mixture of Experts: System-Level Performance Enhancement and Utterance-Level Challenge Analysis
von: Hu, Xintong, et al.
Veröffentlicht: (2025)

MuseTok: Symbolic Music Tokenization for Generation and Semantic Understanding
von: Huang, Jingyue, et al.
Veröffentlicht: (2025)

MuPT: A Generative Symbolic Music Pretrained Transformer
von: Qu, Xingwei, et al.
Veröffentlicht: (2024)