:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Vohra, Arhan, Akama, Taketo
Format:	Preprint
Published:	2026
Subjects:	Sound
Online Access:	https://arxiv.org/abs/2601.19109
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

A Computational Analysis of Lyric Similarity Perception
by: Kim, Haven, et al.
Published: (2024)

Annotation-free Automatic Music Transcription with Scalable Synthetic Data and Adversarial Domain Confusion
by: Sato, Gakusei, et al.
Published: (2023)

Self-supervised restoration of singing voice degraded by pitch shifting using shallow diffusion
by: Liu, Yunyi, et al.
Published: (2026)

Music Proofreading with RefinPaint: Where and How to Modify Compositions given Context
by: Ramoneda, Pedro, et al.
Published: (2024)

Annotation-Free MIDI-to-Audio Synthesis via Concatenative Synthesis and Generative Refinement
by: Take, Osamu, et al.
Published: (2024)

HyperGANStrument: Instrument Sound Synthesis and Editing with Pitch-Invariant Hypernetworks
by: Zhang, Zhe, et al.
Published: (2024)

PF-D2M: A Pose-free Diffusion Model for Universal Dance-to-Music Generation
by: Im, Jaekwon, et al.
Published: (2026)

A Preliminary Investigation on Flexible Singing Voice Synthesis Through Decomposed Framework with Inferrable Features
by: Violeta, Lester Phillip, et al.
Published: (2024)

Towards Realistic Synthetic Data for Automatic Drum Transcription
by: Melucci, Pierfrancesco, et al.
Published: (2026)

Decoding Selective Auditory Attention to Musical Elements in Ecologically Valid Music Listening
by: Akama, Taketo, et al.
Published: (2025)

Naturalistic Music Decoding from EEG Data via Latent Diffusion Models
by: Postolache, Emilian, et al.
Published: (2024)

Perceptually Aligning Representations of Music via Noise-Augmented Autoencoders
by: Bjare, Mathias Rose, et al.
Published: (2025)

Predicting Artificial Neural Network Representations to Learn Recognition Model for Music Identification from Brain Recordings
by: Akama, Taketo, et al.
Published: (2024)

Perceptual Musical Features for Interpretable Audio Tagging
by: Lyberatos, Vassilis, et al.
Published: (2023)

Training a Perceptual Model for Evaluating Auditory Similarity in Music Adversarial Attack
by: Liu, Yuxuan, et al.
Published: (2025)

Are you really listening? Boosting Perceptual Awareness in Music-QA Benchmarks
by: Zang, Yongyi, et al.
Published: (2025)

Music Foundation Model as Generic Booster for Music Downstream Tasks
by: Liao, WeiHsiang, et al.
Published: (2024)

Scalable Music Cover Retrieval Using Lyrics-Aligned Audio Embeddings
by: Affolter, Joanne, et al.
Published: (2026)

Back to Ear: Perceptually Driven High Fidelity Music Reconstruction
by: Wang, Kangdi, et al.
Published: (2025)

Expressivity-aware Music Performance Retrieval using Mid-level Perceptual Features and Emotion Word Embeddings
by: Chowdhury, Shreyan, et al.
Published: (2024)

MERIT: Learning Disentangled Music Representations for Audio Similarity
by: Roy, Abhinaba, et al.
Published: (2026)

A Similarity Network for Correlating Musical Structure to Military Strategy
by: Zhang, Yiwen, et al.
Published: (2026)

Activation Patching for Interpretable Steering in Music Generation
by: Facchiano, Simone, et al.
Published: (2025)

Musical Word Embedding for Music Tagging and Retrieval
by: Doh, SeungHeon, et al.
Published: (2024)

SAME: A Semantically-Aligned Music Autoencoder
by: Parker, Julian D., et al.
Published: (2026)

PBSCR: The Piano Bootleg Score Composer Recognition Dataset
by: Jain, Arhan, et al.
Published: (2024)

MusicRL: Aligning Music Generation to Human Preferences
by: Cideron, Geoffrey, et al.
Published: (2024)

URGENT-PK: Perceptually-Aligned Ranking Model Designed for Speech Enhancement Competition
by: Wang, Jiahe, et al.
Published: (2025)

Controllable Embedding Transformation for Mood-Guided Music Retrieval
by: Wilkins, Julia, et al.
Published: (2025)

Multimodal Dataset Normalization and Perceptual Validation for Music-Taste Correspondences
by: Spanio, Matteo, et al.
Published: (2026)

Learning Separated Representations for Instrument-based Music Similarity
by: Hashizume, Yuka, et al.
Published: (2025)

Similar but Faster: Manipulation of Tempo in Music Audio Embeddings for Tempo Prediction and Search
by: McCallum, Matthew C., et al.
Published: (2024)

Improving Controllability and Editability for Pretrained Text-to-Music Generation Models
by: Zhang, Yixiao
Published: (2024)

Aligning Text-to-Music Evaluation with Human Preferences
by: Huang, Yichen, et al.
Published: (2025)

Perceptual Noise-Masking with Music through Deep Spectral Envelope Shaping
by: Berger, Clémentine, et al.
Published: (2025)

Emotion-Aligned Contrastive Learning Between Images and Music
by: Stewart, Shanti, et al.
Published: (2023)

Constructing Composite Features for Interpretable Music-Tagging
by: Xue, Chenhao, et al.
Published: (2026)

Learning Multidimensional Disentangled Representations of Instrumental Sounds for Musical Similarity Assessment
by: Hashizume, Yuka, et al.
Published: (2024)

Similarity-Guided Diffusion for Long-Gap Music Inpainting
by: Turland, Sean, et al.
Published: (2025)

Generating Music with Structure Using Self-Similarity as Attention
by: Hager, Sophia, et al.
Published: (2024)