Saved in:
Bibliographic Details
Main Authors: Yang, Jingpu, Han, Zehua, Xiang, Mengyu, Wang, Helin, Huang, Yuxiao, Fang, Miao
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2402.14849
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914689593638912
author Yang, Jingpu
Han, Zehua
Xiang, Mengyu
Wang, Helin
Huang, Yuxiao
Fang, Miao
author_facet Yang, Jingpu
Han, Zehua
Xiang, Mengyu
Wang, Helin
Huang, Yuxiao
Fang, Miao
contents With the rapid advancement of Neural Machine Translation (NMT), enhancing translation efficiency and quality has become a focal point of research. Despite the commendable performance of general models such as the Transformer in various aspects, they still fall short in processing long sentences and fully leveraging bidirectional contextual information. This paper introduces an improved model based on the Transformer, implementing an asynchronous and segmented bidirectional decoding strategy aimed at elevating translation efficiency and accuracy. Compared to traditional unidirectional translations from left-to-right or right-to-left, our method demonstrates heightened efficiency and improved translation quality, particularly in handling long sentences. Experimental results on the IWSLT2017 dataset confirm the effectiveness of our approach in accelerating translation and increasing accuracy, especially surpassing traditional unidirectional strategies in long sentence translation. Furthermore, this study analyzes the impact of sentence length on decoding outcomes and explores the model's performance in various scenarios. The findings of this research not only provide an effective encoding strategy for the NMT field but also pave new avenues and directions for future studies.
format Preprint
id arxiv_https___arxiv_org_abs_2402_14849
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Asynchronous and Segmented Bidirectional Encoding for NMT
Yang, Jingpu
Han, Zehua
Xiang, Mengyu
Wang, Helin
Huang, Yuxiao
Fang, Miao
Computation and Language
Artificial Intelligence
Machine Learning
With the rapid advancement of Neural Machine Translation (NMT), enhancing translation efficiency and quality has become a focal point of research. Despite the commendable performance of general models such as the Transformer in various aspects, they still fall short in processing long sentences and fully leveraging bidirectional contextual information. This paper introduces an improved model based on the Transformer, implementing an asynchronous and segmented bidirectional decoding strategy aimed at elevating translation efficiency and accuracy. Compared to traditional unidirectional translations from left-to-right or right-to-left, our method demonstrates heightened efficiency and improved translation quality, particularly in handling long sentences. Experimental results on the IWSLT2017 dataset confirm the effectiveness of our approach in accelerating translation and increasing accuracy, especially surpassing traditional unidirectional strategies in long sentence translation. Furthermore, this study analyzes the impact of sentence length on decoding outcomes and explores the model's performance in various scenarios. The findings of this research not only provide an effective encoding strategy for the NMT field but also pave new avenues and directions for future studies.
title Asynchronous and Segmented Bidirectional Encoding for NMT
topic Computation and Language
Artificial Intelligence
Machine Learning
url https://arxiv.org/abs/2402.14849