Saved in:
| Main Authors: | Cameron, Joseph, Blackwell, Alan |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.16713 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
A Semantic Timbre Dataset for the Electric Guitar
by: Cameron, Joseph, et al.
Published: (2026)
by: Cameron, Joseph, et al.
Published: (2026)
Pitch-Conditioned Instrument Sound Synthesis From an Interactive Timbre Latent Space
by: Limberg, Christian, et al.
Published: (2025)
by: Limberg, Christian, et al.
Published: (2025)
Latent Diffusion Bridges for Unsupervised Musical Audio Timbre Transfer
by: Mancusi, Michele, et al.
Published: (2024)
by: Mancusi, Michele, et al.
Published: (2024)
A Controllable Perceptual Feature Generative Model for Melody Harmonization via Conditional Variational Autoencoder
by: Huang, Dengyun, et al.
Published: (2025)
by: Huang, Dengyun, et al.
Published: (2025)
Do Joint Language-Audio Embeddings Encode Perceptual Timbre Semantics?
by: Deng, Qixin, et al.
Published: (2025)
by: Deng, Qixin, et al.
Published: (2025)
Zero-Shot Voice Conversion via Content-Aware Timbre Ensemble and Conditional Flow Matching
by: Pan, Yu, et al.
Published: (2024)
by: Pan, Yu, et al.
Published: (2024)
Research on Piano Timbre Transformation System Based on Diffusion Model
by: Hsu, Chun-Chieh, et al.
Published: (2026)
by: Hsu, Chun-Chieh, et al.
Published: (2026)
Tutti: Expressive Multi-Singer Synthesis via Structure-Level Timbre Control and Vocal Texture Modeling
by: Chen, Jiatao, et al.
Published: (2026)
by: Chen, Jiatao, et al.
Published: (2026)
GAN-Based Speech Enhancement for Low SNR Using Latent Feature Conditioning
by: Shetu, Shrishti Saha, et al.
Published: (2024)
by: Shetu, Shrishti Saha, et al.
Published: (2024)
CartoonSing: Unifying Human and Nonhuman Timbres in Singing Generation
by: Han, Jionghao, et al.
Published: (2025)
by: Han, Jionghao, et al.
Published: (2025)
The First Voice Timbre Attribute Detection Challenge
by: Chen, Liping, et al.
Published: (2025)
by: Chen, Liping, et al.
Published: (2025)
The Voice Timbre Attribute Detection 2025 Challenge Evaluation Plan
by: Sheng, Zhengyan, et al.
Published: (2025)
by: Sheng, Zhengyan, et al.
Published: (2025)
Improvements of Discriminative Feature Space Training for Anomalous Sound Detection in Unlabeled Conditions
by: Fujimura, Takuya, et al.
Published: (2024)
by: Fujimura, Takuya, et al.
Published: (2024)
CodecFlow: Efficient Bandwidth Extension via Conditional Flow Matching in Neural Codec Latent Space
by: Zhang, Bowen, et al.
Published: (2026)
by: Zhang, Bowen, et al.
Published: (2026)
Timbre Difference Capturing in Anomalous Sound Detection
by: Nishida, Tomoya, et al.
Published: (2024)
by: Nishida, Tomoya, et al.
Published: (2024)
Assessing the Alignment of Audio Representations with Timbre Similarity Ratings
by: Tian, Haokun, et al.
Published: (2025)
by: Tian, Haokun, et al.
Published: (2025)
Fast Timing-Conditioned Latent Audio Diffusion
by: Evans, Zach, et al.
Published: (2024)
by: Evans, Zach, et al.
Published: (2024)
State Space Models for Bioacoustics: A Comparative Evaluation with Transformers
by: Tang, Chengyu, et al.
Published: (2025)
by: Tang, Chengyu, et al.
Published: (2025)
Timbre Perception, Representation, and its Neuroscientific Exploration: A Comprehensive Review
by: Zhang, Hong, et al.
Published: (2024)
by: Zhang, Hong, et al.
Published: (2024)
Remix the Timbre: Diffusion-Based Style Transfer Across Polyphonic Stems
by: Chen, Leduo, et al.
Published: (2026)
by: Chen, Leduo, et al.
Published: (2026)
Techniques for Quantum-Computing-Aided Algorithmic Composition: Experiments in Rhythm, Timbre, Harmony, and Space
by: Dobrian, Christopher, et al.
Published: (2025)
by: Dobrian, Christopher, et al.
Published: (2025)
Learning Interpretable Features in Audio Latent Spaces via Sparse Autoencoders
by: Paek, Nathan, et al.
Published: (2025)
by: Paek, Nathan, et al.
Published: (2025)
Perceptual Musical Features for Interpretable Audio Tagging
by: Lyberatos, Vassilis, et al.
Published: (2023)
by: Lyberatos, Vassilis, et al.
Published: (2023)
SoundLoCD: An Efficient Conditional Discrete Contrastive Latent Diffusion Model for Text-to-Sound Generation
by: Niu, Xinlei, et al.
Published: (2024)
by: Niu, Xinlei, et al.
Published: (2024)
Conditional Latent Diffusion-Based Speech Enhancement Via Dual Context Learning
by: Zhao, Shengkui, et al.
Published: (2025)
by: Zhao, Shengkui, et al.
Published: (2025)
Text Conditioned Symbolic Drumbeat Generation using Latent Diffusion Models
by: Jajoria, Pushkar, et al.
Published: (2024)
by: Jajoria, Pushkar, et al.
Published: (2024)
Audio Conditioning for Music Generation via Discrete Bottleneck Features
by: Rouard, Simon, et al.
Published: (2024)
by: Rouard, Simon, et al.
Published: (2024)
WaveTransfer: A Flexible End-to-end Multi-instrument Timbre Transfer with Diffusion
by: Baoueb, Teysir, et al.
Published: (2024)
by: Baoueb, Teysir, et al.
Published: (2024)
Unsupervised TTS Acoustic Modeling for TTS with Conditional Disentangled Sequential VAE
by: Lian, Jiachen, et al.
Published: (2022)
by: Lian, Jiachen, et al.
Published: (2022)
QvTAD: Differential Relative Attribute Learning for Voice Timbre Attribute Detection
by: Wu, Zhiyu, et al.
Published: (2025)
by: Wu, Zhiyu, et al.
Published: (2025)
Adversarial Multi-Task Learning for Disentangling Timbre and Pitch in Singing Voice Synthesis
by: Kim, Tae-Woo, et al.
Published: (2022)
by: Kim, Tae-Woo, et al.
Published: (2022)
DiffAttack: Diffusion-based Timbre-reserved Adversarial Attack in Speaker Identification
by: Wang, Qing, et al.
Published: (2025)
by: Wang, Qing, et al.
Published: (2025)
Diffusion Timbre Transfer Via Mutual Information Guided Inpainting
by: Lee, Ching Ho, et al.
Published: (2026)
by: Lee, Ching Ho, et al.
Published: (2026)
Enhancing Dance-to-Music Generation via Negative Conditioning Latent Diffusion Model
by: Sun, Changchang, et al.
Published: (2025)
by: Sun, Changchang, et al.
Published: (2025)
Timbre-Trap: A Low-Resource Framework for Instrument-Agnostic Music Transcription
by: Cwitkowitz, Frank, et al.
Published: (2023)
by: Cwitkowitz, Frank, et al.
Published: (2023)
NOMAD: Unsupervised Learning of Perceptual Embeddings for Speech Enhancement and Non-matching Reference Audio Quality Assessment
by: Ragano, Alessandro, et al.
Published: (2023)
by: Ragano, Alessandro, et al.
Published: (2023)
Training a Perceptual Model for Evaluating Auditory Similarity in Music Adversarial Attack
by: Liu, Yuxuan, et al.
Published: (2025)
by: Liu, Yuxuan, et al.
Published: (2025)
STCTS: Generative Semantic Compression for Ultra-Low Bitrate Speech via Explicit Text-Prosody-Timbre Decomposition
by: Wang, Siyu, et al.
Published: (2025)
by: Wang, Siyu, et al.
Published: (2025)
Polyphonia: Zero-Shot Timbre Transfer in Polyphonic Music with Acoustic-Informed Attention Calibration
by: Li, Haowen, et al.
Published: (2026)
by: Li, Haowen, et al.
Published: (2026)
SAMUeL: Efficient Vocal-Conditioned Music Generation via Soft Alignment Attention and Latent Diffusion
by: Cheung, Hei Shing, et al.
Published: (2025)
by: Cheung, Hei Shing, et al.
Published: (2025)
Similar Items
-
A Semantic Timbre Dataset for the Electric Guitar
by: Cameron, Joseph, et al.
Published: (2026) -
Pitch-Conditioned Instrument Sound Synthesis From an Interactive Timbre Latent Space
by: Limberg, Christian, et al.
Published: (2025) -
Latent Diffusion Bridges for Unsupervised Musical Audio Timbre Transfer
by: Mancusi, Michele, et al.
Published: (2024) -
A Controllable Perceptual Feature Generative Model for Melody Harmonization via Conditional Variational Autoencoder
by: Huang, Dengyun, et al.
Published: (2025) -
Do Joint Language-Audio Embeddings Encode Perceptual Timbre Semantics?
by: Deng, Qixin, et al.
Published: (2025)