Saved in:
Bibliographic Details
Main Authors: Wang, Hongrui, Zhang, Fan, Yu, Zhiyuan, Zhou, Ziya, Chen, Xi, Yang, Can, Wang, Yang
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2603.01101
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866912934683213824
author Wang, Hongrui
Zhang, Fan
Yu, Zhiyuan
Zhou, Ziya
Chen, Xi
Yang, Can
Wang, Yang
author_facet Wang, Hongrui
Zhang, Fan
Yu, Zhiyuan
Zhou, Ziya
Chen, Xi
Yang, Can
Wang, Yang
contents Multi-track music generation has garnered significant research interest due to its precise mixing and remixing capabilities. However, existing models often overlook essential attributes such as rhythmic stability and synchronization, leading to a focus on differences between tracks rather than their inherent properties. In this paper, we introduce SyncTrack, a synchronous multi-track waveform music generation model designed to capture the unique characteristics of multi-track music. SyncTrack features a novel architecture that includes track-shared modules to establish a common rhythm across all tracks and track-specific modules to accommodate diverse timbres and pitch ranges. Each track-shared module employs two cross-track attention mechanisms to synchronize rhythmic information, while each track-specific module utilizes learnable instrument priors to better represent timbre and other unique features. Additionally, we enhance the evaluation of multi-track music quality by introducing rhythmic consistency through three novel metrics: Inner-track Rhythmic Stability (IRS), Cross-track Beat Synchronization (CBS), and Cross-track Beat Dispersion (CBD). Experiments demonstrate that SyncTrack significantly improves the multi-track music quality by enhancing rhythmic consistency.
format Preprint
id arxiv_https___arxiv_org_abs_2603_01101
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle SyncTrack: Rhythmic Stability and Synchronization in Multi-Track Music Generation
Wang, Hongrui
Zhang, Fan
Yu, Zhiyuan
Zhou, Ziya
Chen, Xi
Yang, Can
Wang, Yang
Sound
Artificial Intelligence
Multi-track music generation has garnered significant research interest due to its precise mixing and remixing capabilities. However, existing models often overlook essential attributes such as rhythmic stability and synchronization, leading to a focus on differences between tracks rather than their inherent properties. In this paper, we introduce SyncTrack, a synchronous multi-track waveform music generation model designed to capture the unique characteristics of multi-track music. SyncTrack features a novel architecture that includes track-shared modules to establish a common rhythm across all tracks and track-specific modules to accommodate diverse timbres and pitch ranges. Each track-shared module employs two cross-track attention mechanisms to synchronize rhythmic information, while each track-specific module utilizes learnable instrument priors to better represent timbre and other unique features. Additionally, we enhance the evaluation of multi-track music quality by introducing rhythmic consistency through three novel metrics: Inner-track Rhythmic Stability (IRS), Cross-track Beat Synchronization (CBS), and Cross-track Beat Dispersion (CBD). Experiments demonstrate that SyncTrack significantly improves the multi-track music quality by enhancing rhythmic consistency.
title SyncTrack: Rhythmic Stability and Synchronization in Multi-Track Music Generation
topic Sound
Artificial Intelligence
url https://arxiv.org/abs/2603.01101