Saved in:
| Main Authors: | Liu, Yuxuan, Zhang, Peihong, Sang, Rui, Li, Zhixin, Tan, Yizhou, Cai, Yiqiang, Li, Shengchen |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.01645 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DDSC: Dynamic Dual-Signal Curriculum for Data-Efficient Acoustic Scene Classification under Domain Shift
by: Zhang, Peihong, et al.
Published: (2025)
by: Zhang, Peihong, et al.
Published: (2025)
An Entropy-Guided Curriculum Learning Strategy for Data-Efficient Acoustic Scene Classification under Domain Shift
by: Zhang, Peihong, et al.
Published: (2025)
by: Zhang, Peihong, et al.
Published: (2025)
TopSeg: A Multi-Scale Topological Framework for Data-Efficient Heart Sound Segmentation
by: Zhang, Peihong, et al.
Published: (2025)
by: Zhang, Peihong, et al.
Published: (2025)
Training a Perceptual Model for Evaluating Auditory Similarity in Music Adversarial Attack
by: Liu, Yuxuan, et al.
Published: (2025)
by: Liu, Yuxuan, et al.
Published: (2025)
MAIA: An Inpainting-Based Approach for Music Adversarial Attacks
by: Liu, Yuxuan, et al.
Published: (2025)
by: Liu, Yuxuan, et al.
Published: (2025)
TF-SepNet: An Efficient 1D Kernel Design in CNNs for Low-Complexity Acoustic Scene Classification
by: Cai, Yiqiang, et al.
Published: (2023)
by: Cai, Yiqiang, et al.
Published: (2023)
NMCSE: Noise-Robust Multi-Modal Coupling Signal Estimation Method via Optimal Transport for Cardiovascular Disease Detection
by: Zhang, Peihong, et al.
Published: (2025)
by: Zhang, Peihong, et al.
Published: (2025)
Improving Acoustic Scene Classification with City Features
by: Cai, Yiqiang, et al.
Published: (2025)
by: Cai, Yiqiang, et al.
Published: (2025)
Leveraging Self-supervised Audio Representations for Data-Efficient Acoustic Scene Classification
by: Cai, Yiqiang, et al.
Published: (2024)
by: Cai, Yiqiang, et al.
Published: (2024)
Exploring Differences between Human Perception and Model Inference in Audio Event Recognition
by: Tan, Yizhou, et al.
Published: (2024)
by: Tan, Yizhou, et al.
Published: (2024)
Remix the Timbre: Diffusion-Based Style Transfer Across Polyphonic Stems
by: Chen, Leduo, et al.
Published: (2026)
by: Chen, Leduo, et al.
Published: (2026)
SceneGuard: Training-Time Voice Protection with Scene-Consistent Audible Background Noise
by: Sang, Rui, et al.
Published: (2025)
by: Sang, Rui, et al.
Published: (2025)
Improving Membership Inference in ASR Model Auditing with Perturbed Loss Features
by: Teixeira, Francisco, et al.
Published: (2024)
by: Teixeira, Francisco, et al.
Published: (2024)
Generalized Multi-Source Inference for Text Conditioned Music Diffusion Models
by: Postolache, Emilian, et al.
Published: (2024)
by: Postolache, Emilian, et al.
Published: (2024)
Music Style Transfer With Diffusion Model
by: Huang, Hong, et al.
Published: (2024)
by: Huang, Hong, et al.
Published: (2024)
Membership Inference Attacks against Large Audio Language Models
by: Dong, Jia-Kai, et al.
Published: (2026)
by: Dong, Jia-Kai, et al.
Published: (2026)
DITTO-2: Distilled Diffusion Inference-Time T-Optimization for Music Generation
by: Novack, Zachary, et al.
Published: (2024)
by: Novack, Zachary, et al.
Published: (2024)
Versatile Symbolic Music-for-Music Modeling via Function Alignment
by: Jiang, Junyan, et al.
Published: (2025)
by: Jiang, Junyan, et al.
Published: (2025)
Melodia: Training-Free Music Editing Guided by Attention Probing in Diffusion Models
by: Yang, Yi, et al.
Published: (2025)
by: Yang, Yi, et al.
Published: (2025)
DITTO: Diffusion Inference-Time T-Optimization for Music Generation
by: Novack, Zachary, et al.
Published: (2024)
by: Novack, Zachary, et al.
Published: (2024)
Diffusion-based Symbolic Music Generation with Structured State Space Models
by: Yuan, Shenghua, et al.
Published: (2025)
by: Yuan, Shenghua, et al.
Published: (2025)
Efficient Long-Sequence Diffusion Modeling for Symbolic Music Generation
by: Xu, Jinhan, et al.
Published: (2026)
by: Xu, Jinhan, et al.
Published: (2026)
De-AntiFake: Rethinking the Protective Perturbations Against Voice Cloning Attacks
by: Fan, Wei, et al.
Published: (2025)
by: Fan, Wei, et al.
Published: (2025)
MusicWeaver: Composer-Style Structural Editing and Minute-Scale Coherent Music Generation
by: Wang, Xuanchen, et al.
Published: (2025)
by: Wang, Xuanchen, et al.
Published: (2025)
Music Style Transfer with Time-Varying Inversion of Diffusion Models
by: Li, Sifei, et al.
Published: (2024)
by: Li, Sifei, et al.
Published: (2024)
Quality-aware Masked Diffusion Transformer for Enhanced Music Generation
by: Li, Chang, et al.
Published: (2024)
by: Li, Chang, et al.
Published: (2024)
Combining Genre Classification and Harmonic-Percussive Features with Diffusion Models for Music-Video Generation
by: Pina, Leonardo, et al.
Published: (2024)
by: Pina, Leonardo, et al.
Published: (2024)
Enhancing Dance-to-Music Generation via Negative Conditioning Latent Diffusion Model
by: Sun, Changchang, et al.
Published: (2025)
by: Sun, Changchang, et al.
Published: (2025)
MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models
by: Zhang, Yixiao, et al.
Published: (2024)
by: Zhang, Yixiao, et al.
Published: (2024)
Seed-Music: A Unified Framework for High Quality and Controlled Music Generation
by: Bai, Ye, et al.
Published: (2024)
by: Bai, Ye, et al.
Published: (2024)
Multi-Track MusicLDM: Towards Versatile Music Generation with Latent Diffusion Model
by: Karchkhadze, Tornike, et al.
Published: (2024)
by: Karchkhadze, Tornike, et al.
Published: (2024)
Why Perturbing Symbolic Music is Necessary: Fitting the Distribution of Never-used Notes through a Joint Probabilistic Diffusion Model
by: Liu, Shipei, et al.
Published: (2024)
by: Liu, Shipei, et al.
Published: (2024)
Continual Audio Deepfake Detection via Universal Adversarial Perturbation
by: Li, Wangjie, et al.
Published: (2025)
by: Li, Wangjie, et al.
Published: (2025)
ProGress: Structured Music Generation via Graph Diffusion and Hierarchical Music Analysis
by: Ni-Hahn, Stephen, et al.
Published: (2025)
by: Ni-Hahn, Stephen, et al.
Published: (2025)
SAMUeL: Efficient Vocal-Conditioned Music Generation via Soft Alignment Attention and Latent Diffusion
by: Cheung, Hei Shing, et al.
Published: (2025)
by: Cheung, Hei Shing, et al.
Published: (2025)
Live Music Diffusion Models: Efficient Fine-Tuning and Post-Training of Interactive Diffusion Music Generators
by: Novack, Zachary, et al.
Published: (2026)
by: Novack, Zachary, et al.
Published: (2026)
Mamba-Diffusion Model with Learnable Wavelet for Controllable Symbolic Music Generation
by: Zhang, Jincheng, et al.
Published: (2025)
by: Zhang, Jincheng, et al.
Published: (2025)
GACA-DiT: Diffusion-based Dance-to-Music Generation with Genre-Adaptive Rhythm and Context-Aware Alignment
by: Wang, Jinting, et al.
Published: (2025)
by: Wang, Jinting, et al.
Published: (2025)
ViTex: Visual Texture Control for Multi-Track Symbolic Music Generation via Discrete Diffusion Models
by: Yi, Xiaoyu, et al.
Published: (2026)
by: Yi, Xiaoyu, et al.
Published: (2026)
GD-Retriever: Controllable Generative Text-Music Retrieval with Diffusion Models
by: Guinot, Julien, et al.
Published: (2025)
by: Guinot, Julien, et al.
Published: (2025)
Similar Items
-
DDSC: Dynamic Dual-Signal Curriculum for Data-Efficient Acoustic Scene Classification under Domain Shift
by: Zhang, Peihong, et al.
Published: (2025) -
An Entropy-Guided Curriculum Learning Strategy for Data-Efficient Acoustic Scene Classification under Domain Shift
by: Zhang, Peihong, et al.
Published: (2025) -
TopSeg: A Multi-Scale Topological Framework for Data-Efficient Heart Sound Segmentation
by: Zhang, Peihong, et al.
Published: (2025) -
Training a Perceptual Model for Evaluating Auditory Similarity in Music Adversarial Attack
by: Liu, Yuxuan, et al.
Published: (2025) -
MAIA: An Inpainting-Based Approach for Music Adversarial Attacks
by: Liu, Yuxuan, et al.
Published: (2025)