Saved in:
| Main Authors: | Li, Jiun-Ting, Yan, Bi-Cheng, Wang, Yi-Cheng, Chen, Berlin |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.16876 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Automated Speaking Assessment of Conversation Tests with Novel Graph-based Modeling on Spoken Response Coherence
by: Li, Jiun-Ting, et al.
Published: (2024)
by: Li, Jiun-Ting, et al.
Published: (2024)
Probing the Hidden Talent of ASR Foundation Models for L2 English Oral Assessment
by: Chao, Fu-An, et al.
Published: (2025)
by: Chao, Fu-An, et al.
Published: (2025)
ConPCO: Preserving Phoneme Characteristics for Automatic Pronunciation Assessment Leveraging Contrastive Ordinal Regularization
by: Yan, Bi-Cheng, et al.
Published: (2024)
by: Yan, Bi-Cheng, et al.
Published: (2024)
DANCER: Entity Description Augmented Named Entity Corrector for Automatic Speech Recognition
by: Wang, Yi-Cheng, et al.
Published: (2024)
by: Wang, Yi-Cheng, et al.
Published: (2024)
MultiPA: A Multi-task Speech Pronunciation Assessment Model for Open Response Scenarios
by: Chen, Yu-Wen, et al.
Published: (2023)
by: Chen, Yu-Wen, et al.
Published: (2023)
An Effective Context-Balanced Adaptation Approach for Long-Tailed Speech Recognition
by: Wang, Yi-Cheng, et al.
Published: (2024)
by: Wang, Yi-Cheng, et al.
Published: (2024)
Towards Efficient and Multifaceted Computer-assisted Pronunciation Training Leveraging Hierarchical Selective State Space Model and Decoupled Cross-entropy Loss
by: Chao, Fu-An, et al.
Published: (2025)
by: Chao, Fu-An, et al.
Published: (2025)
Enhancing Code-Switching ASR Leveraging Non-Peaky CTC Loss and Deep Language Posterior Injection
by: Yang, Tzu-Ting, et al.
Published: (2024)
by: Yang, Tzu-Ting, et al.
Published: (2024)
Mitigating Data Imbalance in Automated Speaking Assessment
by: Tsai, Fong-Chun, et al.
Published: (2025)
by: Tsai, Fong-Chun, et al.
Published: (2025)
MuFFIN: Multifaceted Pronunciation Feedback Model with Interactive Hierarchical Neural Modeling
by: Yan, Bi-Cheng, et al.
Published: (2025)
by: Yan, Bi-Cheng, et al.
Published: (2025)
Automatic Text Pronunciation Correlation Generation and Application for Contextual Biasing
by: Cheng, Gaofeng, et al.
Published: (2025)
by: Cheng, Gaofeng, et al.
Published: (2025)
Pronunciation Assessment with Multi-modal Large Language Models
by: Fu, Kaiqi, et al.
Published: (2024)
by: Fu, Kaiqi, et al.
Published: (2024)
Multi-granularity Interactive Attention Framework for Residual Hierarchical Pronunciation Assessment
by: Han, Hong, et al.
Published: (2026)
by: Han, Hong, et al.
Published: (2026)
Fine-Tuning Large Multimodal Models for Automatic Pronunciation Assessment
by: Wang, Ke, et al.
Published: (2025)
by: Wang, Ke, et al.
Published: (2025)
Acquiring Pronunciation Knowledge from Transcribed Speech Audio via Multi-task Learning
by: Sun, Siqi, et al.
Published: (2024)
by: Sun, Siqi, et al.
Published: (2024)
HiPPO: Exploring A Novel Hierarchical Pronunciation Assessment Approach for Spoken Languages
by: Yan, Bi-Cheng, et al.
Published: (2025)
by: Yan, Bi-Cheng, et al.
Published: (2025)
Exploring the Potential of Large Multimodal Models as Effective Alternatives for Pronunciation Assessment
by: Wang, Ke, et al.
Published: (2025)
by: Wang, Ke, et al.
Published: (2025)
Acoustic Feature Mixup for Balanced Multi-aspect Pronunciation Assessment
by: Do, Heejin, et al.
Published: (2024)
by: Do, Heejin, et al.
Published: (2024)
An Effective Mixture-Of-Experts Approach For Code-Switching Speech Recognition Leveraging Encoder Disentanglement
by: Yang, Tzu-Ting, et al.
Published: (2024)
by: Yang, Tzu-Ting, et al.
Published: (2024)
Beyond Modality Limitations: A Unified MLLM Approach to Automated Speaking Assessment with Effective Curriculum Learning
by: Fang, Yu-Hsuan, et al.
Published: (2025)
by: Fang, Yu-Hsuan, et al.
Published: (2025)
Second Language Pronunciation Assessment
Published: (2026)
Published: (2026)
Optimizing Automatic Speech Assessment: W-RankSim Regularization and Hybrid Feature Fusion Strategies
by: Wu, Chung-Wen, et al.
Published: (2024)
by: Wu, Chung-Wen, et al.
Published: (2024)
Session-Level Spoken Language Assessment with a Multimodal Foundation Model via Multi-Target Learning
by: Lin, Hong-Yun, et al.
Published: (2025)
by: Lin, Hong-Yun, et al.
Published: (2025)
Compositional Phoneme Approximation for L1-Grounded L2 Pronunciation Training
by: Park, Jisang, et al.
Published: (2024)
by: Park, Jisang, et al.
Published: (2024)
MMMOS: Multi-domain Multi-axis Audio Quality Assessment
by: Lin, Yi-Cheng, et al.
Published: (2025)
by: Lin, Yi-Cheng, et al.
Published: (2025)
Lost in Pronunciation: Detecting Chinese Offensive Language Disguised by Phonetic Cloaking Replacement
by: Guo, Haotan, et al.
Published: (2025)
by: Guo, Haotan, et al.
Published: (2025)
Enhancing Distractor Generation for Multiple-Choice Questions with Retrieval Augmented Pretraining and Knowledge Graph Integration
by: Yu, Han-Cheng, et al.
Published: (2024)
by: Yu, Han-Cheng, et al.
Published: (2024)
Enhancing Cross-task Transfer of Large Language Models via Activation Steering
by: Tang, Xinyu, et al.
Published: (2025)
by: Tang, Xinyu, et al.
Published: (2025)
BAMBINO-LM: (Bilingual-)Human-Inspired Continual Pretraining of BabyLM
by: Shen, Zhewen, et al.
Published: (2024)
by: Shen, Zhewen, et al.
Published: (2024)
Leveraging Allophony in Self-Supervised Speech Models for Atypical Pronunciation Assessment
by: Choi, Kwanghee, et al.
Published: (2025)
by: Choi, Kwanghee, et al.
Published: (2025)
Towards Fully Exploiting LLM Internal States to Enhance Knowledge Boundary Perception
by: Ni, Shiyu, et al.
Published: (2025)
by: Ni, Shiyu, et al.
Published: (2025)
MAGI: Multi-Agent Guided Interview for Psychiatric Assessment
by: Bi, Guanqun, et al.
Published: (2025)
by: Bi, Guanqun, et al.
Published: (2025)
How Knowledge Popularity Influences and Enhances LLM Knowledge Boundary Perception
by: Ni, Shiyu, et al.
Published: (2025)
by: Ni, Shiyu, et al.
Published: (2025)
Segmentation-free Goodness of Pronunciation
by: Cao, Xinwei, et al.
Published: (2025)
by: Cao, Xinwei, et al.
Published: (2025)
MORE: Multi-mOdal REtrieval Augmented Generative Commonsense Reasoning
by: Cui, Wanqing, et al.
Published: (2024)
by: Cui, Wanqing, et al.
Published: (2024)
UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model
by: Li, Zhaowei, et al.
Published: (2024)
by: Li, Zhaowei, et al.
Published: (2024)
Deep Prompt Multi-task Network for Abuse Language Detection
by: Zhu, Jian, et al.
Published: (2024)
by: Zhu, Jian, et al.
Published: (2024)
Enhancing Entity Aware Machine Translation with Multi-task Learning
by: Trieu, An, et al.
Published: (2025)
by: Trieu, An, et al.
Published: (2025)
Decoding by Contrasting Knowledge: Enhancing LLMs' Confidence on Edited Facts
by: Bi, Baolong, et al.
Published: (2024)
by: Bi, Baolong, et al.
Published: (2024)
$M^3EL$: A Multi-task Multi-topic Dataset for Multi-modal Entity Linking
by: Wang, Fang, et al.
Published: (2024)
by: Wang, Fang, et al.
Published: (2024)
Similar Items
-
Automated Speaking Assessment of Conversation Tests with Novel Graph-based Modeling on Spoken Response Coherence
by: Li, Jiun-Ting, et al.
Published: (2024) -
Probing the Hidden Talent of ASR Foundation Models for L2 English Oral Assessment
by: Chao, Fu-An, et al.
Published: (2025) -
ConPCO: Preserving Phoneme Characteristics for Automatic Pronunciation Assessment Leveraging Contrastive Ordinal Regularization
by: Yan, Bi-Cheng, et al.
Published: (2024) -
DANCER: Entity Description Augmented Named Entity Corrector for Automatic Speech Recognition
by: Wang, Yi-Cheng, et al.
Published: (2024) -
MultiPA: A Multi-task Speech Pronunciation Assessment Model for Open Response Scenarios
by: Chen, Yu-Wen, et al.
Published: (2023)