:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Serrà, Joan, Goswami, Dipam, Morreale, Fabio, Liao, Wei-Hsiang, Mitsufuji, Yuki
Format:	Preprint
Published:	2026
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2605.17938
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Emergent, not Immanent: A Baradian Reading of Explainable AI
by: Morreale, Fabio, et al.
Published: (2026)

Attribution-by-design: Ensuring Inference-Time Provenance in Generative Music Systems
by: Morreale, Fabio, et al.
Published: (2025)

Automatic Music Sample Identification with Multi-Track Contrastive Learning
by: Riou, Alain, et al.
Published: (2025)

A Comprehensive Real-World Assessment of Audio Watermarking Algorithms: Will They Survive Neural Codecs?
by: Özer, Yigitcan, et al.
Published: (2025)

Leveraging Whisper Embeddings for Audio-based Lyrics Matching
by: Mancini, Eleonora, et al.
Published: (2025)

Supervised contrastive learning from weakly-labeled audio segments for musical version matching
by: Serrà, Joan, et al.
Published: (2025)

Provable unlearning in topic modeling and downstream tasks
by: Wei, Stanley, et al.
Published: (2024)

Towards Assessing Data Replication in Music Generation with Music Similarity Metrics on Raw Audio
by: Batlle-Roca, Roser, et al.
Published: (2024)

MR-MT3: Memory Retaining Multi-Track Music Transcription to Mitigate Instrument Leakage
by: Tan, Hao Hao, et al.
Published: (2024)

Applying sparse autoencoders to unlearn knowledge in language models
by: Farrell, Eoin, et al.
Published: (2024)

Machine unlearning through fine-grained model parameters perturbation
by: Zuo, Zhiwei, et al.
Published: (2024)

GUDA: Counterfactual Group-wise Training Data Attribution for Diffusion Models via Unlearning
by: Murata, Naoki, et al.
Published: (2026)

PaGoDA: Progressive Growing of a One-Step Generator from a Low-Resolution Diffusion Teacher
by: Kim, Dongjun, et al.
Published: (2024)

Woosh: A Sound Effects Foundation Model
by: Hadjeres, Gaëtan, et al.
Published: (2026)

CMT: Mid-Training for Efficient Learning of Consistency, Mean Flow, and Flow Map Models
by: Hu, Zheyuan, et al.
Published: (2025)

Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion
by: Kim, Dongjun, et al.
Published: (2023)

Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tuning
by: Zhang, Yixiao, et al.
Published: (2024)

Concept-TRAK: Understanding how diffusion models learn concepts through concept-level attribution
by: Park, Yonghyun, et al.
Published: (2025)

Opening Musical Creativity? Embedded Ideologies in Generative-AI Music Systems
by: Pram, Liam, et al.
Published: (2025)

On the limitation of evaluating machine unlearning using only a single training seed
by: Lanyon, Jamie, et al.
Published: (2025)

Distill, Forget, Repeat: A Framework for Continual Unlearning in Text-to-Image Diffusion Models
by: George, Naveen, et al.
Published: (2025)

HQ-VAE: Hierarchical Discrete Representation Learning with Variational Bayes
by: Takida, Yuhta, et al.
Published: (2023)

Understanding and Accelerating the Training of Masked Diffusion Language Models
by: Hong, Chunsan, et al.
Published: (2026)

It`s All About Speed: AI`s Impact on Workflow in Music Production
by: McClellan, Finn, et al.
Published: (2026)

Edge-preserving noise for diffusion models
by: Vandersanden, Jente, et al.
Published: (2024)

The Principles of Diffusion Models
by: Lai, Chieh-Hsin, et al.
Published: (2025)

HERO: Human-Feedback Efficient Reinforcement Learning for Online Diffusion Model Finetuning
by: Hiranaka, Ayano, et al.
Published: (2024)

DisMix: Disentangling Mixtures of Musical Instruments for Source-level Pitch and Timbre Manipulation
by: Luo, Yin-Jyun, et al.
Published: (2024)

Denoising Multi-Beta VAE: Representation Learning for Disentanglement and Generation
by: Uppal, Anshuk, et al.
Published: (2025)

Improving Classifier-Free Guidance in Masked Diffusion: Low-Dim Theoretical Insights with High-Dim Impact
by: Rojas, Kevin, et al.
Published: (2025)

Routing without Forgetting
by: Masano, Alessio, et al.
Published: (2026)

Step-resolved data attribution for looped transformers
by: Kaissis, Georgios, et al.
Published: (2026)

Cross-Modal Learning for Music-to-Music-Video Description Generation
by: Mao, Zhuoyuan, et al.
Published: (2025)

MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models
by: Zhang, Yixiao, et al.
Published: (2024)

ROCM: RLHF on consistency models
by: Shekhar, Shivanshu, et al.
Published: (2025)

Federated Learning for distribution skewed data using sample weights
by: Nguyen, Hung, et al.
Published: (2024)

Wind speed super-resolution and validation: from ERA5 to CERRA via diffusion models
by: Merizzi, Fabio, et al.
Published: (2024)

MeanFlow Transformers with Representation Autoencoders
by: Hu, Zheyuan, et al.
Published: (2025)

SpecMaskFoley: Steering Pretrained Spectral Masked Generative Transformer Toward Synchronized Video-to-audio Synthesis via ControlNet
by: Zhong, Zhi, et al.
Published: (2025)

Noise Scheduling as Information-Guided Allocation in Diffusion Training
by: Raya, Gabriel, et al.
Published: (2026)