Saved in:
| Main Authors: | Kucukmanisa, Ayhan, Gelmez, Derya, Calik, Sukru Selim, Kilimci, Zeynep Hilal |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.17477 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Explainable-AI powered stock price prediction using time series transformers: A Case Study on BIST100
by: Calik, Sukru Selim, et al.
Published: (2025)
by: Calik, Sukru Selim, et al.
Published: (2025)
A Critical Review of the Need for Knowledge-Centric Evaluation of Quranic Recitation
by: Al-Kharusi, Mohammed Hilal, et al.
Published: (2025)
by: Al-Kharusi, Mohammed Hilal, et al.
Published: (2025)
ACP-ESM: A novel framework for classification of anticancer peptides using protein-oriented transformer approach
by: Kilimci, Zeynep Hilal, et al.
Published: (2024)
by: Kilimci, Zeynep Hilal, et al.
Published: (2024)
Quranic Audio Dataset: Crowdsourced and Labeled Recitation from Non-Arabic Speakers
by: Salameh, Raghad, et al.
Published: (2024)
by: Salameh, Raghad, et al.
Published: (2024)
Self-Supervised Models for Phoneme Recognition: Applications in Children's Speech for Reading Learning
by: Medin, Lucas Block, et al.
Published: (2025)
by: Medin, Lucas Block, et al.
Published: (2025)
Chord Recognition with Deep Learning
by: Mackenzie, Pierre
Published: (2025)
by: Mackenzie, Pierre
Published: (2025)
Harf-Speech: A Clinically Aligned Framework for Arabic Phoneme-Level Speech Assessment
by: Azad, Asif, et al.
Published: (2026)
by: Azad, Asif, et al.
Published: (2026)
Deep Learning for Speech Emotion Recognition: A CNN Approach Utilizing Mel Spectrograms
by: Penumajji, Niketa
Published: (2025)
by: Penumajji, Niketa
Published: (2025)
MEBM-Phoneme: Multi-scale Enhanced BrainMagic for End-to-End MEG Phoneme Classification
by: Jinghua, Liang, et al.
Published: (2026)
by: Jinghua, Liang, et al.
Published: (2026)
Tadabur: A Large-Scale Quran Audio Dataset
by: Alherran, Faisal
Published: (2026)
by: Alherran, Faisal
Published: (2026)
Arabic Music Classification and Generation using Deep Learning
by: Elshaarawy, Mohamed, et al.
Published: (2024)
by: Elshaarawy, Mohamed, et al.
Published: (2024)
Empowering Global Voices: A Data-Efficient, Phoneme-Tone Adaptive Approach to High-Fidelity Speech Synthesis
by: Geng, Yizhong, et al.
Published: (2025)
by: Geng, Yizhong, et al.
Published: (2025)
Automatic Pronunciation Error Detection and Correction of the Holy Quran's Learners Using Deep Learning
by: Abdelfattah, Abdullah, et al.
Published: (2025)
by: Abdelfattah, Abdullah, et al.
Published: (2025)
Hybrid CNN-Transformer Architecture for Arabic Speech Emotion Recognition
by: Gheffari, Youcef Soufiane, et al.
Published: (2026)
by: Gheffari, Youcef Soufiane, et al.
Published: (2026)
ProKWS: Personalized Keyword Spotting via Collaborative Learning of Phonemes and Prosody
by: Pan, Jianan, et al.
Published: (2026)
by: Pan, Jianan, et al.
Published: (2026)
Automatic Speech Recognition using Advanced Deep Learning Approaches: A survey
by: Kheddar, Hamza, et al.
Published: (2024)
by: Kheddar, Hamza, et al.
Published: (2024)
TSPC: A Two-Stage Phoneme-Centric Architecture for code-switching Vietnamese-English Speech Recognition
by: Anh, Tran Nguyen, et al.
Published: (2025)
by: Anh, Tran Nguyen, et al.
Published: (2025)
CIPHER: Conformer-based Inference of Phonemes from High-density EEG
by: Madishetty, Varshith
Published: (2026)
by: Madishetty, Varshith
Published: (2026)
Arabic Little STT: Arabic Children Speech Recognition Dataset
by: Alkadri, Mouhand, et al.
Published: (2025)
by: Alkadri, Mouhand, et al.
Published: (2025)
AUDRON: A Deep Learning Framework with Fused Acoustic Signatures for Drone Type Recognition
by: Chatterjee, Rajdeep, et al.
Published: (2025)
by: Chatterjee, Rajdeep, et al.
Published: (2025)
iPhoneme: Brain-to-Text Communication for ALS Using ConformerXL Decoding
by: Cha, Yoonmin, et al.
Published: (2026)
by: Cha, Yoonmin, et al.
Published: (2026)
Leveraging Label Potential for Enhanced Multimodal Emotion Recognition
by: Shao, Xuechun, et al.
Published: (2025)
by: Shao, Xuechun, et al.
Published: (2025)
Transfer Learning-Based Deep Residual Learning for Speech Recognition in Clean and Noisy Environments
by: Djeffal, Noussaiba, et al.
Published: (2025)
by: Djeffal, Noussaiba, et al.
Published: (2025)
A Novel Speech Analysis and Correction Tool for Arabic-Speaking Children
by: Berriche, Lamia, et al.
Published: (2024)
by: Berriche, Lamia, et al.
Published: (2024)
Phoneme-Level Feature Discrepancies: A Key to Detecting Sophisticated Speech Deepfakes
by: Zhang, Kuiyuan, et al.
Published: (2024)
by: Zhang, Kuiyuan, et al.
Published: (2024)
Controllable Singing Voice Synthesis using Phoneme-Level Energy Sequence
by: Ryu, Yerin, et al.
Published: (2025)
by: Ryu, Yerin, et al.
Published: (2025)
Semi-Supervised Self-Learning Enhanced Music Emotion Recognition
by: Sun, Yifu, et al.
Published: (2024)
by: Sun, Yifu, et al.
Published: (2024)
A Hierarchical Deep Learning Approach for Minority Instrument Detection
by: Sechet, Dylan, et al.
Published: (2025)
by: Sechet, Dylan, et al.
Published: (2025)
Leveraging Contrastive Learning and Self-Training for Multimodal Emotion Recognition with Limited Labeled Samples
by: Fan, Qi, et al.
Published: (2024)
by: Fan, Qi, et al.
Published: (2024)
Frequency-Weighted Training Losses for Phoneme-Level DNN-based Speech Enhancement
by: Monir, Nasser-Eddine, et al.
Published: (2025)
by: Monir, Nasser-Eddine, et al.
Published: (2025)
Music Genre Classification: A Comparative Analysis of Classical Machine Learning and Deep Learning Approaches
by: Prajuli, Sachin, et al.
Published: (2026)
by: Prajuli, Sachin, et al.
Published: (2026)
A Machine Learning Approach for MIDI to Guitar Tablature Conversion
by: Kaliakatsos-Papakostas, Maximos, et al.
Published: (2025)
by: Kaliakatsos-Papakostas, Maximos, et al.
Published: (2025)
A Toolchain for Comprehensive Audio/Video Analysis Using Deep Learning Based Multimodal Approach (A use case of riot or violent context detection)
by: Pham, Lam, et al.
Published: (2024)
by: Pham, Lam, et al.
Published: (2024)
PMF-CEC: Phoneme-augmented Multimodal Fusion for Context-aware ASR Error Correction with Error-specific Selective Decoding
by: He, Jiajun, et al.
Published: (2025)
by: He, Jiajun, et al.
Published: (2025)
Omni-AutoThink: Adaptive Multimodal Reasoning via Reinforcement Learning
by: Yang, Dongchao, et al.
Published: (2025)
by: Yang, Dongchao, et al.
Published: (2025)
Speech Emotion Recognition Using MFCC Features and LSTM-Based Deep Learning Model
by: Oluwademilade, Adelekun, et al.
Published: (2026)
by: Oluwademilade, Adelekun, et al.
Published: (2026)
Explaining Deep Learning Embeddings for Speech Emotion Recognition by Predicting Interpretable Acoustic Features
by: Dixit, Satvik, et al.
Published: (2024)
by: Dixit, Satvik, et al.
Published: (2024)
Cross-Learning Fine-Tuning Strategy for Dysarthric Speech Recognition Via CDSD database
by: Xiao, Qing, et al.
Published: (2025)
by: Xiao, Qing, et al.
Published: (2025)
ROSE: A Recognition-Oriented Speech Enhancement Framework in Air Traffic Control Using Multi-Objective Learning
by: Yu, Xincheng, et al.
Published: (2023)
by: Yu, Xincheng, et al.
Published: (2023)
SpeakerLM: End-to-End Versatile Speaker Diarization and Recognition with Multimodal Large Language Models
by: Yin, Han, et al.
Published: (2025)
by: Yin, Han, et al.
Published: (2025)
Similar Items
-
Explainable-AI powered stock price prediction using time series transformers: A Case Study on BIST100
by: Calik, Sukru Selim, et al.
Published: (2025) -
A Critical Review of the Need for Knowledge-Centric Evaluation of Quranic Recitation
by: Al-Kharusi, Mohammed Hilal, et al.
Published: (2025) -
ACP-ESM: A novel framework for classification of anticancer peptides using protein-oriented transformer approach
by: Kilimci, Zeynep Hilal, et al.
Published: (2024) -
Quranic Audio Dataset: Crowdsourced and Labeled Recitation from Non-Arabic Speakers
by: Salameh, Raghad, et al.
Published: (2024) -
Self-Supervised Models for Phoneme Recognition: Applications in Children's Speech for Reading Learning
by: Medin, Lucas Block, et al.
Published: (2025)