Saved in:
| Main Author: | Mackenzie, Pierre |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.22621 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Chord Embeddings: Analyzing What They Capture and Their Role for Next Chord Prediction and Artist Attribute Prediction
by: Lahnala, Allison, et al.
Published: (2021)
by: Lahnala, Allison, et al.
Published: (2021)
ChordFormer: A Conformer-Based Architecture for Large-Vocabulary Audio Chord Recognition
by: Akram, Muhammad Waseem, et al.
Published: (2025)
by: Akram, Muhammad Waseem, et al.
Published: (2025)
MusicGen-Chord: Advancing Music Generation through Chord Progressions and Interactive Web-UI
by: Jung, Jongmin, et al.
Published: (2024)
by: Jung, Jongmin, et al.
Published: (2024)
An End-to-End Approach for Chord-Conditioned Song Generation
by: Gao, Shuochen, et al.
Published: (2024)
by: Gao, Shuochen, et al.
Published: (2024)
Incorporating Structure and Chord Constraints in Symbolic Transformer-based Melodic Harmonization
by: Kaliakatsos-Papakostas, Maximos, et al.
Published: (2025)
by: Kaliakatsos-Papakostas, Maximos, et al.
Published: (2025)
Enhancing Quranic Learning: A Multimodal Deep Learning Approach for Arabic Phoneme Recognition
by: Kucukmanisa, Ayhan, et al.
Published: (2025)
by: Kucukmanisa, Ayhan, et al.
Published: (2025)
Guitar Chord Diagram Suggestion for Western Popular Music
by: d'Hooge, Alexandre, et al.
Published: (2024)
by: d'Hooge, Alexandre, et al.
Published: (2024)
Deep Learning for Speech Emotion Recognition: A CNN Approach Utilizing Mel Spectrograms
by: Penumajji, Niketa
Published: (2025)
by: Penumajji, Niketa
Published: (2025)
MusiConGen: Rhythm and Chord Control for Transformer-Based Text-to-Music Generation
by: Lan, Yun-Han, et al.
Published: (2024)
by: Lan, Yun-Han, et al.
Published: (2024)
MMT-BERT: Chord-aware Symbolic Music Generation Based on Multitrack Music Transformer and MusicBERT
by: Zhu, Jinlong, et al.
Published: (2024)
by: Zhu, Jinlong, et al.
Published: (2024)
AUDRON: A Deep Learning Framework with Fused Acoustic Signatures for Drone Type Recognition
by: Chatterjee, Rajdeep, et al.
Published: (2025)
by: Chatterjee, Rajdeep, et al.
Published: (2025)
Transfer Learning-Based Deep Residual Learning for Speech Recognition in Clean and Noisy Environments
by: Djeffal, Noussaiba, et al.
Published: (2025)
by: Djeffal, Noussaiba, et al.
Published: (2025)
Speech Emotion Recognition Using MFCC Features and LSTM-Based Deep Learning Model
by: Oluwademilade, Adelekun, et al.
Published: (2026)
by: Oluwademilade, Adelekun, et al.
Published: (2026)
Explaining Deep Learning Embeddings for Speech Emotion Recognition by Predicting Interpretable Acoustic Features
by: Dixit, Satvik, et al.
Published: (2024)
by: Dixit, Satvik, et al.
Published: (2024)
Cross-Learning Fine-Tuning Strategy for Dysarthric Speech Recognition Via CDSD database
by: Xiao, Qing, et al.
Published: (2025)
by: Xiao, Qing, et al.
Published: (2025)
Masked Latent Prediction and Classification for Self-Supervised Audio Representation Learning
by: Quelennec, Aurian, et al.
Published: (2025)
by: Quelennec, Aurian, et al.
Published: (2025)
From Discord to Harmony: Decomposed Consonance-based Training for Improved Audio Chord Estimation
by: Poltronieri, Andrea, et al.
Published: (2025)
by: Poltronieri, Andrea, et al.
Published: (2025)
MATPAC++: Enhanced Masked Latent Prediction for Self-Supervised Audio Representation Learning
by: Quelennec, Aurian, et al.
Published: (2025)
by: Quelennec, Aurian, et al.
Published: (2025)
Environmental Sound Deepfake Detection Using Deep-Learning Framework
by: Pham, Lam, et al.
Published: (2026)
by: Pham, Lam, et al.
Published: (2026)
ROSE: A Recognition-Oriented Speech Enhancement Framework in Air Traffic Control Using Multi-Objective Learning
by: Yu, Xincheng, et al.
Published: (2023)
by: Yu, Xincheng, et al.
Published: (2023)
Improving Audio Event Recognition with Consistency Regularization
by: Sadhu, Shanmuka, et al.
Published: (2025)
by: Sadhu, Shanmuka, et al.
Published: (2025)
Unifying EEG and Speech for Emotion Recognition: A Two-Step Joint Learning Framework for Handling Missing EEG Data During Inference
by: Tiwari, Upasana, et al.
Published: (2025)
by: Tiwari, Upasana, et al.
Published: (2025)
Voices of the Mountains: Deep Learning-Based Vocal Error Detection System for Kurdish Maqams
by: Khairaldeen, Darvan Shvan, et al.
Published: (2026)
by: Khairaldeen, Darvan Shvan, et al.
Published: (2026)
Graph Embedding with Mel-spectrograms for Underwater Acoustic Target Recognition
by: Feng, Sheng, et al.
Published: (2025)
by: Feng, Sheng, et al.
Published: (2025)
Speech Emotion Recognition via Entropy-Aware Score Selection
by: Chua, ChenYi, et al.
Published: (2025)
by: Chua, ChenYi, et al.
Published: (2025)
RAS: a Reliability Oriented Metric for Automatic Speech Recognition
by: Huang, Wenbin, et al.
Published: (2026)
by: Huang, Wenbin, et al.
Published: (2026)
Elastic Net Regularization and Gabor Dictionary for Classification of Heart Sound Signals using Deep Learning
by: Fakhry, Mahmoud, et al.
Published: (2026)
by: Fakhry, Mahmoud, et al.
Published: (2026)
Let the Model Learn to Feel: Mode-Guided Tonality Injection for Symbolic Music Emotion Recognition
by: Xia, Haiying, et al.
Published: (2025)
by: Xia, Haiying, et al.
Published: (2025)
Automatic Speech Recognition using Advanced Deep Learning Approaches: A survey
by: Kheddar, Hamza, et al.
Published: (2024)
by: Kheddar, Hamza, et al.
Published: (2024)
Enabling Automatic Disordered Speech Recognition: An Impaired Speech Dataset in the Akan Language
by: Wiafe, Isaac, et al.
Published: (2026)
by: Wiafe, Isaac, et al.
Published: (2026)
Amplifying Emotional Signals: Data-Efficient Deep Learning for Robust Speech Emotion Recognition
by: Vu, Tai
Published: (2025)
by: Vu, Tai
Published: (2025)
MERaLiON-SER: Robust Speech Emotion Recognition Model for English and SEA Languages
by: Sailor, Hardik B., et al.
Published: (2025)
by: Sailor, Hardik B., et al.
Published: (2025)
Multi-Accent Mandarin Dry-Vocal Singing Dataset: Benchmark for Singing Accent Recognition
by: Wang, Zihao, et al.
Published: (2025)
by: Wang, Zihao, et al.
Published: (2025)
DialogGraph-LLM: Graph-Informed LLMs for End-to-End Audio Dialogue Intent Recognition
by: Liu, HongYu, et al.
Published: (2025)
by: Liu, HongYu, et al.
Published: (2025)
SpeakerLM: End-to-End Versatile Speaker Diarization and Recognition with Multimodal Large Language Models
by: Yin, Han, et al.
Published: (2025)
by: Yin, Han, et al.
Published: (2025)
EMO-TTA: Improving Test-Time Adaptation of Audio-Language Models for Speech Emotion Recognition
by: Shi, Jiacheng, et al.
Published: (2025)
by: Shi, Jiacheng, et al.
Published: (2025)
Beyond the Mouth: Upper-Face Affective Cues in Audiovisual Sentence Recognition under Acoustic Uncertainty
by: Yang, Zhou, et al.
Published: (2026)
by: Yang, Zhou, et al.
Published: (2026)
PTS-SNN: A Prompt-Tuned Temporal Shift Spiking Neural Networks for Efficient Speech Emotion Recognition
by: Su, Xun, et al.
Published: (2026)
by: Su, Xun, et al.
Published: (2026)
Semi-Supervised Self-Learning Enhanced Music Emotion Recognition
by: Sun, Yifu, et al.
Published: (2024)
by: Sun, Yifu, et al.
Published: (2024)
Disentangled Dual-Branch Graph Learning for Conversational Emotion Recognition
by: Guo, Chengling, et al.
Published: (2026)
by: Guo, Chengling, et al.
Published: (2026)
Similar Items
-
Chord Embeddings: Analyzing What They Capture and Their Role for Next Chord Prediction and Artist Attribute Prediction
by: Lahnala, Allison, et al.
Published: (2021) -
ChordFormer: A Conformer-Based Architecture for Large-Vocabulary Audio Chord Recognition
by: Akram, Muhammad Waseem, et al.
Published: (2025) -
MusicGen-Chord: Advancing Music Generation through Chord Progressions and Interactive Web-UI
by: Jung, Jongmin, et al.
Published: (2024) -
An End-to-End Approach for Chord-Conditioned Song Generation
by: Gao, Shuochen, et al.
Published: (2024) -
Incorporating Structure and Chord Constraints in Symbolic Transformer-based Melodic Harmonization
by: Kaliakatsos-Papakostas, Maximos, et al.
Published: (2025)