Saved in:
| Main Authors: | Li, Jiun-Ting, Yan, Bi-Cheng, Lo, Tien-Hong, Wang, Yi-Cheng, Hsu, Yung-Chang, Chen, Berlin |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2409.07064 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Mitigating Data Imbalance in Automated Speaking Assessment
by: Tsai, Fong-Chun, et al.
Published: (2025)
by: Tsai, Fong-Chun, et al.
Published: (2025)
HiPPO: Exploring A Novel Hierarchical Pronunciation Assessment Approach for Spoken Languages
by: Yan, Bi-Cheng, et al.
Published: (2025)
by: Yan, Bi-Cheng, et al.
Published: (2025)
Multi-task Pretraining for Enhancing Interpretable L2 Pronunciation Assessment
by: Li, Jiun-Ting, et al.
Published: (2025)
by: Li, Jiun-Ting, et al.
Published: (2025)
Beyond Modality Limitations: A Unified MLLM Approach to Automated Speaking Assessment with Effective Curriculum Learning
by: Fang, Yu-Hsuan, et al.
Published: (2025)
by: Fang, Yu-Hsuan, et al.
Published: (2025)
An Effective Strategy for Modeling Score Ordinality and Non-uniform Intervals in Automated Speaking Assessment
by: Lo, Tien-Hong, et al.
Published: (2025)
by: Lo, Tien-Hong, et al.
Published: (2025)
An Effective Automated Speaking Assessment Approach to Mitigating Data Scarcity and Imbalanced Distribution
by: Lo, Tien-Hong, et al.
Published: (2024)
by: Lo, Tien-Hong, et al.
Published: (2024)
Advancing Large Language Models to Capture Varied Speaking Styles and Respond Properly in Spoken Conversations
by: Lin, Guan-Ting, et al.
Published: (2024)
by: Lin, Guan-Ting, et al.
Published: (2024)
Probing the Hidden Talent of ASR Foundation Models for L2 English Oral Assessment
by: Chao, Fu-An, et al.
Published: (2025)
by: Chao, Fu-An, et al.
Published: (2025)
Advancing Automated Speaking Assessment Leveraging Multifaceted Relevance and Grammar Information
by: Lu, Hao-Chien, et al.
Published: (2025)
by: Lu, Hao-Chien, et al.
Published: (2025)
CLiFT-ASR: A Cross-Lingual Fine-Tuning Framework for Low-Resource Taiwanese Hokkien Speech Recognition
by: Sung, Hung-Yang, et al.
Published: (2025)
by: Sung, Hung-Yang, et al.
Published: (2025)
A Novel Data Augmentation Approach for Automatic Speaking Assessment on Opinion Expressions
by: Wang, Chung-Chun, et al.
Published: (2025)
by: Wang, Chung-Chun, et al.
Published: (2025)
DANCER: Entity Description Augmented Named Entity Corrector for Automatic Speech Recognition
by: Wang, Yi-Cheng, et al.
Published: (2024)
by: Wang, Yi-Cheng, et al.
Published: (2024)
Session-Level Spoken Language Assessment with a Multimodal Foundation Model via Multi-Target Learning
by: Lin, Hong-Yun, et al.
Published: (2025)
by: Lin, Hong-Yun, et al.
Published: (2025)
An Effective Context-Balanced Adaptation Approach for Long-Tailed Speech Recognition
by: Wang, Yi-Cheng, et al.
Published: (2024)
by: Wang, Yi-Cheng, et al.
Published: (2024)
ConPCO: Preserving Phoneme Characteristics for Automatic Pronunciation Assessment Leveraging Contrastive Ordinal Regularization
by: Yan, Bi-Cheng, et al.
Published: (2024)
by: Yan, Bi-Cheng, et al.
Published: (2024)
Zero-Shot Text-to-Speech as Golden Speech Generator: A Systematic Framework and its Applicability in Automatic Pronunciation Assessment
by: Lo, Tien-Hong, et al.
Published: (2024)
by: Lo, Tien-Hong, et al.
Published: (2024)
Style Amnesia: Investigating Speaking Style Degradation and Mitigation in Multi-Turn Spoken Language Models
by: Lin, Yu-Xiang, et al.
Published: (2025)
by: Lin, Yu-Xiang, et al.
Published: (2025)
On the Fallacy of Global Token Perplexity in Spoken Language Model Evaluation
by: Hsu, Chan-Jan, et al.
Published: (2026)
by: Hsu, Chan-Jan, et al.
Published: (2026)
Contextual Biasing for Streaming ASR via CTC-based Word Spotting
by: Tsai, Kai-Chen, et al.
Published: (2026)
by: Tsai, Kai-Chen, et al.
Published: (2026)
Enhancing Code-Switching ASR Leveraging Non-Peaky CTC Loss and Deep Language Posterior Injection
by: Yang, Tzu-Ting, et al.
Published: (2024)
by: Yang, Tzu-Ting, et al.
Published: (2024)
MedSpeak: A Knowledge Graph-Aided ASR Error Correction Framework for Spoken Medical QA
by: Song, Yutong, et al.
Published: (2026)
by: Song, Yutong, et al.
Published: (2026)
Efficient Dialect-Aware Modeling and Conditioning for Low-Resource Taiwanese Hakka Speech Processing
by: Peng, An-Ci, et al.
Published: (2026)
by: Peng, An-Ci, et al.
Published: (2026)
Reflecting Twice before Speaking with Empathy: Self-Reflective Alternating Inference for Empathy-Aware End-to-End Spoken Dialogue
by: Jia, Yuhang, et al.
Published: (2026)
by: Jia, Yuhang, et al.
Published: (2026)
Enhancing Finite State Machine Design Automation with Large Language Models and Prompt Engineering Techniques
by: Lin, Qun-Kai, et al.
Published: (2025)
by: Lin, Qun-Kai, et al.
Published: (2025)
Speak It Out: Solving Symbol-Related Problems with Symbol-to-Language Conversion for Language Models
by: Wang, Yile, et al.
Published: (2024)
by: Wang, Yile, et al.
Published: (2024)
A Novel LLM-based Two-stage Summarization Approach for Long Dialogues
by: Yin, Yuan-Jhe, et al.
Published: (2024)
by: Yin, Yuan-Jhe, et al.
Published: (2024)
The NTNU System at the S&I Challenge 2025 SLA Open Track
by: Lin, Hong-Yun, et al.
Published: (2025)
by: Lin, Hong-Yun, et al.
Published: (2025)
CARE: Causality Reasoning for Empathetic Responses by Conditional Graph Generation
by: Wang, Jiashuo, et al.
Published: (2022)
by: Wang, Jiashuo, et al.
Published: (2022)
SpeechDPR: End-to-End Spoken Passage Retrieval for Open-Domain Spoken Question Answering
by: Lin, Chyi-Jiunn, et al.
Published: (2024)
by: Lin, Chyi-Jiunn, et al.
Published: (2024)
An Effective Mixture-Of-Experts Approach For Code-Switching Speech Recognition Leveraging Encoder Disentanglement
by: Yang, Tzu-Ting, et al.
Published: (2024)
by: Yang, Tzu-Ting, et al.
Published: (2024)
FreezeEmpath: Efficient Training for Empathetic Spoken Chatbots with Frozen LLMs
by: Hong, Yun, et al.
Published: (2026)
by: Hong, Yun, et al.
Published: (2026)
Spoken Grammar Assessment Using LLM
by: Kopparapu, Sunil Kumar, et al.
Published: (2024)
by: Kopparapu, Sunil Kumar, et al.
Published: (2024)
Spoken Stereoset: On Evaluating Social Bias Toward Speaker in Speech Large Language Models
by: Lin, Yi-Cheng, et al.
Published: (2024)
by: Lin, Yi-Cheng, et al.
Published: (2024)
Spoken DialogSum: An Emotion-Rich Conversational Dataset for Spoken Dialogue Summarization
by: Lu, Yen-Ju, et al.
Published: (2025)
by: Lu, Yen-Ju, et al.
Published: (2025)
StyleBench: Evaluating Speech Language Models on Conversational Speaking Style Control
by: Zhao, Haishu, et al.
Published: (2026)
by: Zhao, Haishu, et al.
Published: (2026)
Mind-Paced Speaking: A Dual-Brain Approach to Real-Time Reasoning in Spoken Language Models
by: Wu, Donghang, et al.
Published: (2025)
by: Wu, Donghang, et al.
Published: (2025)
EMO-Reasoning: Benchmarking Emotional Reasoning Capabilities in Spoken Dialogue Systems
by: Liu, Jingwen, et al.
Published: (2025)
by: Liu, Jingwen, et al.
Published: (2025)
MOSS-TTSD: Text to Spoken Dialogue Generation
by: Zhang, Yuqian, et al.
Published: (2026)
by: Zhang, Yuqian, et al.
Published: (2026)
GSQA: An End-to-End Model for Generative Spoken Question Answering
by: Shih, Min-Han, et al.
Published: (2023)
by: Shih, Min-Han, et al.
Published: (2023)
From PARIS to LE-PARIS: Toward Patent Response Automation with Recommender Systems and Collaborative Large Language Models
by: Chu, Jung-Mei, et al.
Published: (2024)
by: Chu, Jung-Mei, et al.
Published: (2024)
Similar Items
-
Mitigating Data Imbalance in Automated Speaking Assessment
by: Tsai, Fong-Chun, et al.
Published: (2025) -
HiPPO: Exploring A Novel Hierarchical Pronunciation Assessment Approach for Spoken Languages
by: Yan, Bi-Cheng, et al.
Published: (2025) -
Multi-task Pretraining for Enhancing Interpretable L2 Pronunciation Assessment
by: Li, Jiun-Ting, et al.
Published: (2025) -
Beyond Modality Limitations: A Unified MLLM Approach to Automated Speaking Assessment with Effective Curriculum Learning
by: Fang, Yu-Hsuan, et al.
Published: (2025) -
An Effective Strategy for Modeling Score Ordinality and Non-uniform Intervals in Automated Speaking Assessment
by: Lo, Tien-Hong, et al.
Published: (2025)