Saved in:
Bibliographic Details
Main Authors: Pritzen, Julia, Gref, Michael, Zühlke, Dietlind, Schmidt, Christoph
Format: Preprint
Published: 2021
Subjects:
Online Access:https://arxiv.org/abs/2105.12708
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914508056821760
author Pritzen, Julia
Gref, Michael
Zühlke, Dietlind
Schmidt, Christoph
author_facet Pritzen, Julia
Gref, Michael
Zühlke, Dietlind
Schmidt, Christoph
contents Anglicisms are a challenge in German speech recognition. Due to their irregular pronunciation compared to native German words, automatically generated pronunciation dictionaries often include faulty phoneme sequences for Anglicisms. In this work, we propose a multitask sequence-to-sequence approach for grapheme-to-phoneme conversion to improve the phonetization of Anglicisms. We extended a grapheme-to-phoneme model with a classifier to distinguish Anglicisms from native German words. With this approach, the model learns to generate pronunciations differently depending on the classification result. We used our model to create supplementary Anglicism pronunciation dictionaries that are added to an existing German speech recognition model. Tested on a dedicated Anglicism evaluation set, we improved the recognition of Anglicisms compared to a baseline model, reducing the word error rate by 1 % and the Anglicism error rate by 3 %. We show that multitask learning can help solving the challenge of Anglicisms in German speech recognition.
format Preprint
id arxiv_https___arxiv_org_abs_2105_12708
institution arXiv
publishDate 2021
record_format arxiv
spellingShingle Multitask Learning for Grapheme-to-Phoneme Conversion of Anglicisms in German Speech Recognition
Pritzen, Julia
Gref, Michael
Zühlke, Dietlind
Schmidt, Christoph
Computation and Language
Sound
Audio and Speech Processing
Anglicisms are a challenge in German speech recognition. Due to their irregular pronunciation compared to native German words, automatically generated pronunciation dictionaries often include faulty phoneme sequences for Anglicisms. In this work, we propose a multitask sequence-to-sequence approach for grapheme-to-phoneme conversion to improve the phonetization of Anglicisms. We extended a grapheme-to-phoneme model with a classifier to distinguish Anglicisms from native German words. With this approach, the model learns to generate pronunciations differently depending on the classification result. We used our model to create supplementary Anglicism pronunciation dictionaries that are added to an existing German speech recognition model. Tested on a dedicated Anglicism evaluation set, we improved the recognition of Anglicisms compared to a baseline model, reducing the word error rate by 1 % and the Anglicism error rate by 3 %. We show that multitask learning can help solving the challenge of Anglicisms in German speech recognition.
title Multitask Learning for Grapheme-to-Phoneme Conversion of Anglicisms in German Speech Recognition
topic Computation and Language
Sound
Audio and Speech Processing
url https://arxiv.org/abs/2105.12708