Saved in:
Bibliographic Details
Main Author: Chawla, Varun
Format: Recurso digital
Language:
Published: Zenodo 2026
Online Access:https://doi.org/10.5281/zenodo.18811689
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • <p><span dir="ltr">Abstract—Text-to-music models today are trained almost </span><span dir="ltr">entirely on Western music. They do not understand ragas,</span><br><span dir="ltr">talas, or the ornamentation central to Indian traditions, and </span><span dir="ltr">they produce generic output when prompted for Indian genres.</span><br><span dir="ltr">This paper describes RagaLoRA, a LoRA adapter that tunes </span><span dir="ltr">ACE-Step 1.5’s Diffusion Transformer decoder to generate</span><br><span dir="ltr">Indian music across ten genres—Hindustani classical, Carnatic, </span><span dir="ltr">Bollywood, qawwali, ghazal, bhajan, and four more. The </span><span dir="ltr">adapter is rank-16, weighs 44MB, touches 11M of the model’s </span><span dir="ltr">2.4B parameters (0.46%), and trains on 250 balanced samples </span><span dir="ltr">drawn from a 2,787-segment corpus of Creative Commons </span><span dir="ltr">and public domain Indian recordings. In A/B tests against </span><span dir="ltr">the unmodified base model, the adapter shifts outputs in </span><span dir="ltr">genre-appropriate directions: classical and devotional genres </span><span dir="ltr">get darker and slower (spectral centroid drops up to 16%, </span><span dir="ltr">tempo drops up to 35%), while dance and rock genres get louder </span><span dir="ltr">(RMS energy rises up to 38%). Five of the ten evaluated genres </span><span dir="ltr">had zero dedicated training data, yet still show coherent shifts, </span><span dir="ltr">pointing to transfer across related Indian styles. No prior LoRA </span><span dir="ltr">adapter for a diffusion music model has targeted Indian music </span><span dir="ltr">at this breadth.</span></p> <p><br><span dir="ltr">Index Terms—music generation, Indian music, LoRA, diffu</span><span dir="ltr">sion transformer, raga, fine-tuning</span></p>