Saved in:
| Main Authors: | Vohra, Arhan, Akama, Taketo |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.19109 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
A Computational Analysis of Lyric Similarity Perception
by: Kim, Haven, et al.
Published: (2024)
by: Kim, Haven, et al.
Published: (2024)
Annotation-free Automatic Music Transcription with Scalable Synthetic Data and Adversarial Domain Confusion
by: Sato, Gakusei, et al.
Published: (2023)
by: Sato, Gakusei, et al.
Published: (2023)
Self-supervised restoration of singing voice degraded by pitch shifting using shallow diffusion
by: Liu, Yunyi, et al.
Published: (2026)
by: Liu, Yunyi, et al.
Published: (2026)
Music Proofreading with RefinPaint: Where and How to Modify Compositions given Context
by: Ramoneda, Pedro, et al.
Published: (2024)
by: Ramoneda, Pedro, et al.
Published: (2024)
Annotation-Free MIDI-to-Audio Synthesis via Concatenative Synthesis and Generative Refinement
by: Take, Osamu, et al.
Published: (2024)
by: Take, Osamu, et al.
Published: (2024)
HyperGANStrument: Instrument Sound Synthesis and Editing with Pitch-Invariant Hypernetworks
by: Zhang, Zhe, et al.
Published: (2024)
by: Zhang, Zhe, et al.
Published: (2024)
PF-D2M: A Pose-free Diffusion Model for Universal Dance-to-Music Generation
by: Im, Jaekwon, et al.
Published: (2026)
by: Im, Jaekwon, et al.
Published: (2026)
A Preliminary Investigation on Flexible Singing Voice Synthesis Through Decomposed Framework with Inferrable Features
by: Violeta, Lester Phillip, et al.
Published: (2024)
by: Violeta, Lester Phillip, et al.
Published: (2024)
Towards Realistic Synthetic Data for Automatic Drum Transcription
by: Melucci, Pierfrancesco, et al.
Published: (2026)
by: Melucci, Pierfrancesco, et al.
Published: (2026)
Decoding Selective Auditory Attention to Musical Elements in Ecologically Valid Music Listening
by: Akama, Taketo, et al.
Published: (2025)
by: Akama, Taketo, et al.
Published: (2025)
Naturalistic Music Decoding from EEG Data via Latent Diffusion Models
by: Postolache, Emilian, et al.
Published: (2024)
by: Postolache, Emilian, et al.
Published: (2024)
Perceptually Aligning Representations of Music via Noise-Augmented Autoencoders
by: Bjare, Mathias Rose, et al.
Published: (2025)
by: Bjare, Mathias Rose, et al.
Published: (2025)
Predicting Artificial Neural Network Representations to Learn Recognition Model for Music Identification from Brain Recordings
by: Akama, Taketo, et al.
Published: (2024)
by: Akama, Taketo, et al.
Published: (2024)
Perceptual Musical Features for Interpretable Audio Tagging
by: Lyberatos, Vassilis, et al.
Published: (2023)
by: Lyberatos, Vassilis, et al.
Published: (2023)
Training a Perceptual Model for Evaluating Auditory Similarity in Music Adversarial Attack
by: Liu, Yuxuan, et al.
Published: (2025)
by: Liu, Yuxuan, et al.
Published: (2025)
Are you really listening? Boosting Perceptual Awareness in Music-QA Benchmarks
by: Zang, Yongyi, et al.
Published: (2025)
by: Zang, Yongyi, et al.
Published: (2025)
Music Foundation Model as Generic Booster for Music Downstream Tasks
by: Liao, WeiHsiang, et al.
Published: (2024)
by: Liao, WeiHsiang, et al.
Published: (2024)
Scalable Music Cover Retrieval Using Lyrics-Aligned Audio Embeddings
by: Affolter, Joanne, et al.
Published: (2026)
by: Affolter, Joanne, et al.
Published: (2026)
Back to Ear: Perceptually Driven High Fidelity Music Reconstruction
by: Wang, Kangdi, et al.
Published: (2025)
by: Wang, Kangdi, et al.
Published: (2025)
Expressivity-aware Music Performance Retrieval using Mid-level Perceptual Features and Emotion Word Embeddings
by: Chowdhury, Shreyan, et al.
Published: (2024)
by: Chowdhury, Shreyan, et al.
Published: (2024)
MERIT: Learning Disentangled Music Representations for Audio Similarity
by: Roy, Abhinaba, et al.
Published: (2026)
by: Roy, Abhinaba, et al.
Published: (2026)
A Similarity Network for Correlating Musical Structure to Military Strategy
by: Zhang, Yiwen, et al.
Published: (2026)
by: Zhang, Yiwen, et al.
Published: (2026)
Activation Patching for Interpretable Steering in Music Generation
by: Facchiano, Simone, et al.
Published: (2025)
by: Facchiano, Simone, et al.
Published: (2025)
Musical Word Embedding for Music Tagging and Retrieval
by: Doh, SeungHeon, et al.
Published: (2024)
by: Doh, SeungHeon, et al.
Published: (2024)
SAME: A Semantically-Aligned Music Autoencoder
by: Parker, Julian D., et al.
Published: (2026)
by: Parker, Julian D., et al.
Published: (2026)
PBSCR: The Piano Bootleg Score Composer Recognition Dataset
by: Jain, Arhan, et al.
Published: (2024)
by: Jain, Arhan, et al.
Published: (2024)
MusicRL: Aligning Music Generation to Human Preferences
by: Cideron, Geoffrey, et al.
Published: (2024)
by: Cideron, Geoffrey, et al.
Published: (2024)
URGENT-PK: Perceptually-Aligned Ranking Model Designed for Speech Enhancement Competition
by: Wang, Jiahe, et al.
Published: (2025)
by: Wang, Jiahe, et al.
Published: (2025)
Controllable Embedding Transformation for Mood-Guided Music Retrieval
by: Wilkins, Julia, et al.
Published: (2025)
by: Wilkins, Julia, et al.
Published: (2025)
Multimodal Dataset Normalization and Perceptual Validation for Music-Taste Correspondences
by: Spanio, Matteo, et al.
Published: (2026)
by: Spanio, Matteo, et al.
Published: (2026)
Learning Separated Representations for Instrument-based Music Similarity
by: Hashizume, Yuka, et al.
Published: (2025)
by: Hashizume, Yuka, et al.
Published: (2025)
Similar but Faster: Manipulation of Tempo in Music Audio Embeddings for Tempo Prediction and Search
by: McCallum, Matthew C., et al.
Published: (2024)
by: McCallum, Matthew C., et al.
Published: (2024)
Improving Controllability and Editability for Pretrained Text-to-Music Generation Models
by: Zhang, Yixiao
Published: (2024)
by: Zhang, Yixiao
Published: (2024)
Aligning Text-to-Music Evaluation with Human Preferences
by: Huang, Yichen, et al.
Published: (2025)
by: Huang, Yichen, et al.
Published: (2025)
Perceptual Noise-Masking with Music through Deep Spectral Envelope Shaping
by: Berger, Clémentine, et al.
Published: (2025)
by: Berger, Clémentine, et al.
Published: (2025)
Emotion-Aligned Contrastive Learning Between Images and Music
by: Stewart, Shanti, et al.
Published: (2023)
by: Stewart, Shanti, et al.
Published: (2023)
Constructing Composite Features for Interpretable Music-Tagging
by: Xue, Chenhao, et al.
Published: (2026)
by: Xue, Chenhao, et al.
Published: (2026)
Learning Multidimensional Disentangled Representations of Instrumental Sounds for Musical Similarity Assessment
by: Hashizume, Yuka, et al.
Published: (2024)
by: Hashizume, Yuka, et al.
Published: (2024)
Similarity-Guided Diffusion for Long-Gap Music Inpainting
by: Turland, Sean, et al.
Published: (2025)
by: Turland, Sean, et al.
Published: (2025)
Generating Music with Structure Using Self-Similarity as Attention
by: Hager, Sophia, et al.
Published: (2024)
by: Hager, Sophia, et al.
Published: (2024)
Similar Items
-
A Computational Analysis of Lyric Similarity Perception
by: Kim, Haven, et al.
Published: (2024) -
Annotation-free Automatic Music Transcription with Scalable Synthetic Data and Adversarial Domain Confusion
by: Sato, Gakusei, et al.
Published: (2023) -
Self-supervised restoration of singing voice degraded by pitch shifting using shallow diffusion
by: Liu, Yunyi, et al.
Published: (2026) -
Music Proofreading with RefinPaint: Where and How to Modify Compositions given Context
by: Ramoneda, Pedro, et al.
Published: (2024) -
Annotation-Free MIDI-to-Audio Synthesis via Concatenative Synthesis and Generative Refinement
by: Take, Osamu, et al.
Published: (2024)