:: Library Catalog

Omslagsbillede

Saved in:

Bibliografiske detaljer
Main Authors:	Lee, Sungho, Martínez-Ramírez, Marco A., Liao, Wei-Hsiang, Uhlich, Stefan, Fabbro, Giorgio, Lee, Kyogu, Mitsufuji, Yuki
Format:	Preprint
Udgivet:	2024
Fag:	Sound
Online adgang:	https://arxiv.org/abs/2406.01049
Tags:	Tilføj Tag Ingen Tags, Vær først til at tagge denne postø!

Lignende værker

Reverse Engineering of Music Mixing Graphs with Differentiable Processors and Iterative Pruning
af: Lee, Sungho, et al.
Udgivet: (2025)

GRAFX: An Open-Source Library for Audio Processing Graphs in PyTorch
af: Lee, Sungho, et al.
Udgivet: (2024)

ITO-Master: Inference-Time Optimization for Audio Effects Modeling of Music Mastering Processors
af: Koo, Junghyun, et al.
Udgivet: (2025)

Latent Diffusion Bridges for Unsupervised Musical Audio Timbre Transfer
af: Mancusi, Michele, et al.
Udgivet: (2024)

Music Foundation Model as Generic Booster for Music Downstream Tasks
af: Liao, WeiHsiang, et al.
Udgivet: (2024)

The Whole Is Greater than the Sum of Its Parts: Improving Music Source Separation by Bridging Network
af: Sawata, Ryosuke, et al.
Udgivet: (2023)

Automatic Music Mixing using a Generative Model of Effect Embeddings
af: Moliner, Eloi, et al.
Udgivet: (2025)

Rethinking Speech Representation Aggregation in Speech Enhancement: A Phonetic Mutual Information Perspective
af: Han, Seungu, et al.
Udgivet: (2026)

LLM2Fx-Tools: Tool Calling For Music Post-Production
af: Doh, Seungheon, et al.
Udgivet: (2025)

SteerMusic: Enhanced Musical Consistency for Zero-shot Text-guided and Personalized Music Editing
af: Niu, Xinlei, et al.
Udgivet: (2025)

MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models
af: Zhang, Yixiao, et al.
Udgivet: (2024)

Timbre-Trap: A Low-Resource Framework for Instrument-Agnostic Music Transcription
af: Cwitkowitz, Frank, et al.
Udgivet: (2023)

Wavespace: A Highly Explorable Wavetable Generator
af: Lee, Hazounne, et al.
Udgivet: (2024)

Few-step Adversarial Schrödinger Bridge for Generative Speech Enhancement
af: Han, Seungu, et al.
Udgivet: (2025)

Do Captioning Metrics Reflect Music Semantic Alignment?
af: Lee, Jinwoo, et al.
Udgivet: (2024)

Towards Assessing Data Replication in Music Generation with Music Similarity Metrics on Raw Audio
af: Batlle-Roca, Roser, et al.
Udgivet: (2024)

Variable Bitrate Residual Vector Quantization for Audio Coding
af: Chae, Yunkee, et al.
Udgivet: (2024)

Music Auto-Tagging with Robust Music Representation Learned via Domain Adversarial Training
af: Joung, Haesun, et al.
Udgivet: (2024)

TokenSynth: A Token-based Neural Synthesizer for Instrument Cloning and Text-to-Instrument
af: Kim, Kyungsu, et al.
Udgivet: (2025)

Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tuning
af: Zhang, Yixiao, et al.
Udgivet: (2024)

Can Large Language Models Predict Audio Effects Parameters from Natural Language?
af: Doh, Seungheon, et al.
Udgivet: (2025)

Music De-limiter Networks via Sample-wise Gain Inversion
af: Jeon, Chang-Bin, et al.
Udgivet: (2023)

MGE-LDM: Joint Latent Diffusion for Simultaneous Music Generation and Source Extraction
af: Chae, Yunkee, et al.
Udgivet: (2025)

Large-Scale Training Data Attribution for Music Generative Models via Unlearning
af: Choi, Woosung, et al.
Udgivet: (2025)

Cross-Modal Learning for Music-to-Music-Video Description Generation
af: Mao, Zhuoyuan, et al.
Udgivet: (2025)

Fx-Encoder++: Extracting Instrument-Wise Audio Effects Representations from Mixtures
af: Yeh, Yen-Tung, et al.
Udgivet: (2025)

Improving Inference-Time Optimisation for Vocal Effects Style Transfer with a Gaussian Prior
af: Yu, Chin-Yun, et al.
Udgivet: (2025)

DiffVox: A Differentiable Model for Capturing and Analysing Vocal Effects Distributions
af: Yu, Chin-Yun, et al.
Udgivet: (2025)

The Sound Demixing Challenge 2023 $\unicode{x2013}$ Music Demixing Track
af: Fabbro, Giorgio, et al.
Udgivet: (2023)

MR-MT3: Memory Retaining Multi-Track Music Transcription to Mitigate Instrument Leakage
af: Tan, Hao Hao, et al.
Udgivet: (2024)

Vo-Ve: An Explainable Voice-Vector for Speaker Identity Evaluation
af: Lee, Jaejun, et al.
Udgivet: (2025)

LipSody: Lip-to-Speech Synthesis with Enhanced Prosody Consistency
af: Lee, Jaejun, et al.
Udgivet: (2026)

EMG-to-Speech with Fewer Channels
af: Hwang, Injune, et al.
Udgivet: (2026)

Speaking Without Sound: Multi-speaker Silent Speech Voicing with Facial Inputs Only
af: Lee, Jaejun, et al.
Udgivet: (2026)

OpenMU: Your Swiss Army Knife for Music Understanding
af: Zhao, Mengjie, et al.
Udgivet: (2024)

DOSE : Drum One-Shot Extraction from Music Mixture
af: Hwang, Suntae, et al.
Udgivet: (2025)

DisMix: Disentangling Mixtures of Musical Instruments for Source-level Pitch and Timbre Manipulation
af: Luo, Yin-Jyun, et al.
Udgivet: (2024)

Towards Bitrate-Efficient and Noise-Robust Speech Coding with Variable Bitrate RVQ
af: Chae, Yunkee, et al.
Udgivet: (2025)

Differentiable Acoustic Radiance Transfer
af: Lee, Sungho, et al.
Udgivet: (2025)

The Sound Demixing Challenge 2023 $\unicode{x2013}$ Cinematic Demixing Track
af: Uhlich, Stefan, et al.
Udgivet: (2023)