:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Kucukmanisa, Ayhan, Gelmez, Derya, Calik, Sukru Selim, Kilimci, Zeynep Hilal
Format:	Preprint
Published:	2025
Subjects:	Sound Artificial Intelligence
Online Access:	https://arxiv.org/abs/2511.17477
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Explainable-AI powered stock price prediction using time series transformers: A Case Study on BIST100
by: Calik, Sukru Selim, et al.
Published: (2025)

A Critical Review of the Need for Knowledge-Centric Evaluation of Quranic Recitation
by: Al-Kharusi, Mohammed Hilal, et al.
Published: (2025)

ACP-ESM: A novel framework for classification of anticancer peptides using protein-oriented transformer approach
by: Kilimci, Zeynep Hilal, et al.
Published: (2024)

Quranic Audio Dataset: Crowdsourced and Labeled Recitation from Non-Arabic Speakers
by: Salameh, Raghad, et al.
Published: (2024)

Self-Supervised Models for Phoneme Recognition: Applications in Children's Speech for Reading Learning
by: Medin, Lucas Block, et al.
Published: (2025)

Chord Recognition with Deep Learning
by: Mackenzie, Pierre
Published: (2025)

Harf-Speech: A Clinically Aligned Framework for Arabic Phoneme-Level Speech Assessment
by: Azad, Asif, et al.
Published: (2026)

Deep Learning for Speech Emotion Recognition: A CNN Approach Utilizing Mel Spectrograms
by: Penumajji, Niketa
Published: (2025)

MEBM-Phoneme: Multi-scale Enhanced BrainMagic for End-to-End MEG Phoneme Classification
by: Jinghua, Liang, et al.
Published: (2026)

Tadabur: A Large-Scale Quran Audio Dataset
by: Alherran, Faisal
Published: (2026)

Arabic Music Classification and Generation using Deep Learning
by: Elshaarawy, Mohamed, et al.
Published: (2024)

Empowering Global Voices: A Data-Efficient, Phoneme-Tone Adaptive Approach to High-Fidelity Speech Synthesis
by: Geng, Yizhong, et al.
Published: (2025)

Automatic Pronunciation Error Detection and Correction of the Holy Quran's Learners Using Deep Learning
by: Abdelfattah, Abdullah, et al.
Published: (2025)

Hybrid CNN-Transformer Architecture for Arabic Speech Emotion Recognition
by: Gheffari, Youcef Soufiane, et al.
Published: (2026)

ProKWS: Personalized Keyword Spotting via Collaborative Learning of Phonemes and Prosody
by: Pan, Jianan, et al.
Published: (2026)

Automatic Speech Recognition using Advanced Deep Learning Approaches: A survey
by: Kheddar, Hamza, et al.
Published: (2024)

TSPC: A Two-Stage Phoneme-Centric Architecture for code-switching Vietnamese-English Speech Recognition
by: Anh, Tran Nguyen, et al.
Published: (2025)

CIPHER: Conformer-based Inference of Phonemes from High-density EEG
by: Madishetty, Varshith
Published: (2026)

Arabic Little STT: Arabic Children Speech Recognition Dataset
by: Alkadri, Mouhand, et al.
Published: (2025)

AUDRON: A Deep Learning Framework with Fused Acoustic Signatures for Drone Type Recognition
by: Chatterjee, Rajdeep, et al.
Published: (2025)

iPhoneme: Brain-to-Text Communication for ALS Using ConformerXL Decoding
by: Cha, Yoonmin, et al.
Published: (2026)

Leveraging Label Potential for Enhanced Multimodal Emotion Recognition
by: Shao, Xuechun, et al.
Published: (2025)

Transfer Learning-Based Deep Residual Learning for Speech Recognition in Clean and Noisy Environments
by: Djeffal, Noussaiba, et al.
Published: (2025)

A Novel Speech Analysis and Correction Tool for Arabic-Speaking Children
by: Berriche, Lamia, et al.
Published: (2024)

Phoneme-Level Feature Discrepancies: A Key to Detecting Sophisticated Speech Deepfakes
by: Zhang, Kuiyuan, et al.
Published: (2024)

Controllable Singing Voice Synthesis using Phoneme-Level Energy Sequence
by: Ryu, Yerin, et al.
Published: (2025)

Semi-Supervised Self-Learning Enhanced Music Emotion Recognition
by: Sun, Yifu, et al.
Published: (2024)

A Hierarchical Deep Learning Approach for Minority Instrument Detection
by: Sechet, Dylan, et al.
Published: (2025)

Leveraging Contrastive Learning and Self-Training for Multimodal Emotion Recognition with Limited Labeled Samples
by: Fan, Qi, et al.
Published: (2024)

Frequency-Weighted Training Losses for Phoneme-Level DNN-based Speech Enhancement
by: Monir, Nasser-Eddine, et al.
Published: (2025)

Music Genre Classification: A Comparative Analysis of Classical Machine Learning and Deep Learning Approaches
by: Prajuli, Sachin, et al.
Published: (2026)

A Machine Learning Approach for MIDI to Guitar Tablature Conversion
by: Kaliakatsos-Papakostas, Maximos, et al.
Published: (2025)

A Toolchain for Comprehensive Audio/Video Analysis Using Deep Learning Based Multimodal Approach (A use case of riot or violent context detection)
by: Pham, Lam, et al.
Published: (2024)

PMF-CEC: Phoneme-augmented Multimodal Fusion for Context-aware ASR Error Correction with Error-specific Selective Decoding
by: He, Jiajun, et al.
Published: (2025)

Omni-AutoThink: Adaptive Multimodal Reasoning via Reinforcement Learning
by: Yang, Dongchao, et al.
Published: (2025)

Speech Emotion Recognition Using MFCC Features and LSTM-Based Deep Learning Model
by: Oluwademilade, Adelekun, et al.
Published: (2026)

Explaining Deep Learning Embeddings for Speech Emotion Recognition by Predicting Interpretable Acoustic Features
by: Dixit, Satvik, et al.
Published: (2024)

Cross-Learning Fine-Tuning Strategy for Dysarthric Speech Recognition Via CDSD database
by: Xiao, Qing, et al.
Published: (2025)

ROSE: A Recognition-Oriented Speech Enhancement Framework in Air Traffic Control Using Multi-Objective Learning
by: Yu, Xincheng, et al.
Published: (2023)

SpeakerLM: End-to-End Versatile Speaker Diarization and Recognition with Multimodal Large Language Models
by: Yin, Han, et al.
Published: (2025)