Saved in:
| Main Authors: | Serrà, Joan, Goswami, Dipam, Morreale, Fabio, Liao, Wei-Hsiang, Mitsufuji, Yuki |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.17938 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Emergent, not Immanent: A Baradian Reading of Explainable AI
by: Morreale, Fabio, et al.
Published: (2026)
by: Morreale, Fabio, et al.
Published: (2026)
Attribution-by-design: Ensuring Inference-Time Provenance in Generative Music Systems
by: Morreale, Fabio, et al.
Published: (2025)
by: Morreale, Fabio, et al.
Published: (2025)
Automatic Music Sample Identification with Multi-Track Contrastive Learning
by: Riou, Alain, et al.
Published: (2025)
by: Riou, Alain, et al.
Published: (2025)
A Comprehensive Real-World Assessment of Audio Watermarking Algorithms: Will They Survive Neural Codecs?
by: Özer, Yigitcan, et al.
Published: (2025)
by: Özer, Yigitcan, et al.
Published: (2025)
Leveraging Whisper Embeddings for Audio-based Lyrics Matching
by: Mancini, Eleonora, et al.
Published: (2025)
by: Mancini, Eleonora, et al.
Published: (2025)
Supervised contrastive learning from weakly-labeled audio segments for musical version matching
by: Serrà, Joan, et al.
Published: (2025)
by: Serrà, Joan, et al.
Published: (2025)
Provable unlearning in topic modeling and downstream tasks
by: Wei, Stanley, et al.
Published: (2024)
by: Wei, Stanley, et al.
Published: (2024)
Towards Assessing Data Replication in Music Generation with Music Similarity Metrics on Raw Audio
by: Batlle-Roca, Roser, et al.
Published: (2024)
by: Batlle-Roca, Roser, et al.
Published: (2024)
MR-MT3: Memory Retaining Multi-Track Music Transcription to Mitigate Instrument Leakage
by: Tan, Hao Hao, et al.
Published: (2024)
by: Tan, Hao Hao, et al.
Published: (2024)
Applying sparse autoencoders to unlearn knowledge in language models
by: Farrell, Eoin, et al.
Published: (2024)
by: Farrell, Eoin, et al.
Published: (2024)
Machine unlearning through fine-grained model parameters perturbation
by: Zuo, Zhiwei, et al.
Published: (2024)
by: Zuo, Zhiwei, et al.
Published: (2024)
GUDA: Counterfactual Group-wise Training Data Attribution for Diffusion Models via Unlearning
by: Murata, Naoki, et al.
Published: (2026)
by: Murata, Naoki, et al.
Published: (2026)
PaGoDA: Progressive Growing of a One-Step Generator from a Low-Resolution Diffusion Teacher
by: Kim, Dongjun, et al.
Published: (2024)
by: Kim, Dongjun, et al.
Published: (2024)
Woosh: A Sound Effects Foundation Model
by: Hadjeres, Gaëtan, et al.
Published: (2026)
by: Hadjeres, Gaëtan, et al.
Published: (2026)
CMT: Mid-Training for Efficient Learning of Consistency, Mean Flow, and Flow Map Models
by: Hu, Zheyuan, et al.
Published: (2025)
by: Hu, Zheyuan, et al.
Published: (2025)
Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion
by: Kim, Dongjun, et al.
Published: (2023)
by: Kim, Dongjun, et al.
Published: (2023)
Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tuning
by: Zhang, Yixiao, et al.
Published: (2024)
by: Zhang, Yixiao, et al.
Published: (2024)
Concept-TRAK: Understanding how diffusion models learn concepts through concept-level attribution
by: Park, Yonghyun, et al.
Published: (2025)
by: Park, Yonghyun, et al.
Published: (2025)
Opening Musical Creativity? Embedded Ideologies in Generative-AI Music Systems
by: Pram, Liam, et al.
Published: (2025)
by: Pram, Liam, et al.
Published: (2025)
On the limitation of evaluating machine unlearning using only a single training seed
by: Lanyon, Jamie, et al.
Published: (2025)
by: Lanyon, Jamie, et al.
Published: (2025)
Distill, Forget, Repeat: A Framework for Continual Unlearning in Text-to-Image Diffusion Models
by: George, Naveen, et al.
Published: (2025)
by: George, Naveen, et al.
Published: (2025)
HQ-VAE: Hierarchical Discrete Representation Learning with Variational Bayes
by: Takida, Yuhta, et al.
Published: (2023)
by: Takida, Yuhta, et al.
Published: (2023)
Understanding and Accelerating the Training of Masked Diffusion Language Models
by: Hong, Chunsan, et al.
Published: (2026)
by: Hong, Chunsan, et al.
Published: (2026)
It`s All About Speed: AI`s Impact on Workflow in Music Production
by: McClellan, Finn, et al.
Published: (2026)
by: McClellan, Finn, et al.
Published: (2026)
Edge-preserving noise for diffusion models
by: Vandersanden, Jente, et al.
Published: (2024)
by: Vandersanden, Jente, et al.
Published: (2024)
The Principles of Diffusion Models
by: Lai, Chieh-Hsin, et al.
Published: (2025)
by: Lai, Chieh-Hsin, et al.
Published: (2025)
HERO: Human-Feedback Efficient Reinforcement Learning for Online Diffusion Model Finetuning
by: Hiranaka, Ayano, et al.
Published: (2024)
by: Hiranaka, Ayano, et al.
Published: (2024)
DisMix: Disentangling Mixtures of Musical Instruments for Source-level Pitch and Timbre Manipulation
by: Luo, Yin-Jyun, et al.
Published: (2024)
by: Luo, Yin-Jyun, et al.
Published: (2024)
Denoising Multi-Beta VAE: Representation Learning for Disentanglement and Generation
by: Uppal, Anshuk, et al.
Published: (2025)
by: Uppal, Anshuk, et al.
Published: (2025)
Improving Classifier-Free Guidance in Masked Diffusion: Low-Dim Theoretical Insights with High-Dim Impact
by: Rojas, Kevin, et al.
Published: (2025)
by: Rojas, Kevin, et al.
Published: (2025)
Routing without Forgetting
by: Masano, Alessio, et al.
Published: (2026)
by: Masano, Alessio, et al.
Published: (2026)
Step-resolved data attribution for looped transformers
by: Kaissis, Georgios, et al.
Published: (2026)
by: Kaissis, Georgios, et al.
Published: (2026)
Cross-Modal Learning for Music-to-Music-Video Description Generation
by: Mao, Zhuoyuan, et al.
Published: (2025)
by: Mao, Zhuoyuan, et al.
Published: (2025)
MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models
by: Zhang, Yixiao, et al.
Published: (2024)
by: Zhang, Yixiao, et al.
Published: (2024)
ROCM: RLHF on consistency models
by: Shekhar, Shivanshu, et al.
Published: (2025)
by: Shekhar, Shivanshu, et al.
Published: (2025)
Federated Learning for distribution skewed data using sample weights
by: Nguyen, Hung, et al.
Published: (2024)
by: Nguyen, Hung, et al.
Published: (2024)
Wind speed super-resolution and validation: from ERA5 to CERRA via diffusion models
by: Merizzi, Fabio, et al.
Published: (2024)
by: Merizzi, Fabio, et al.
Published: (2024)
MeanFlow Transformers with Representation Autoencoders
by: Hu, Zheyuan, et al.
Published: (2025)
by: Hu, Zheyuan, et al.
Published: (2025)
SpecMaskFoley: Steering Pretrained Spectral Masked Generative Transformer Toward Synchronized Video-to-audio Synthesis via ControlNet
by: Zhong, Zhi, et al.
Published: (2025)
by: Zhong, Zhi, et al.
Published: (2025)
Noise Scheduling as Information-Guided Allocation in Diffusion Training
by: Raya, Gabriel, et al.
Published: (2026)
by: Raya, Gabriel, et al.
Published: (2026)
Similar Items
-
Emergent, not Immanent: A Baradian Reading of Explainable AI
by: Morreale, Fabio, et al.
Published: (2026) -
Attribution-by-design: Ensuring Inference-Time Provenance in Generative Music Systems
by: Morreale, Fabio, et al.
Published: (2025) -
Automatic Music Sample Identification with Multi-Track Contrastive Learning
by: Riou, Alain, et al.
Published: (2025) -
A Comprehensive Real-World Assessment of Audio Watermarking Algorithms: Will They Survive Neural Codecs?
by: Özer, Yigitcan, et al.
Published: (2025) -
Leveraging Whisper Embeddings for Audio-based Lyrics Matching
by: Mancini, Eleonora, et al.
Published: (2025)