Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Qu, Zhi, Ding, Chenchen, Watanabe, Taro
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2406.08092
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866913782096199680
author	Qu, Zhi Ding, Chenchen Watanabe, Taro
author_facet	Qu, Zhi Ding, Chenchen Watanabe, Taro
contents	Understanding representation transfer in multilingual neural machine translation (MNMT) can reveal the reason for the zero-shot translation deficiency. In this work, we systematically analyze the representational issue of MNMT models. We first introduce the identity pair, translating a sentence to itself, to address the lack of the base measure in multilingual investigations, as the identity pair can reflect the representation of a language within the model. Then, we demonstrate that the encoder transfers the source language to the representational subspace of the target language instead of the language-agnostic state. Thus, the zero-shot translation deficiency arises because the representation of a translation is entangled with other languages and not transferred to the target language effectively. Based on our findings, we propose two methods: 1) low-rank language-specific embedding at the encoder, and 2) language-specific contrastive learning of the representation at the decoder. The experimental results on Europarl-15, TED-19, and OPUS-100 datasets show that our methods substantially enhance the performance of zero-shot translations without sacrifices in supervised directions by improving language transfer capacity, thereby providing practical evidence to support our conclusions. Codes are available at https://github.com/zhiqu22/ZeroTrans.
format	Preprint
id	arxiv_https___arxiv_org_abs_2406_08092
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Languages Transferred Within the Encoder: On Representation Transfer in Zero-Shot Multilingual Translation Qu, Zhi Ding, Chenchen Watanabe, Taro Computation and Language Understanding representation transfer in multilingual neural machine translation (MNMT) can reveal the reason for the zero-shot translation deficiency. In this work, we systematically analyze the representational issue of MNMT models. We first introduce the identity pair, translating a sentence to itself, to address the lack of the base measure in multilingual investigations, as the identity pair can reflect the representation of a language within the model. Then, we demonstrate that the encoder transfers the source language to the representational subspace of the target language instead of the language-agnostic state. Thus, the zero-shot translation deficiency arises because the representation of a translation is entangled with other languages and not transferred to the target language effectively. Based on our findings, we propose two methods: 1) low-rank language-specific embedding at the encoder, and 2) language-specific contrastive learning of the representation at the decoder. The experimental results on Europarl-15, TED-19, and OPUS-100 datasets show that our methods substantially enhance the performance of zero-shot translations without sacrifices in supervised directions by improving language transfer capacity, thereby providing practical evidence to support our conclusions. Codes are available at https://github.com/zhiqu22/ZeroTrans.
title	Languages Transferred Within the Encoder: On Representation Transfer in Zero-Shot Multilingual Translation
topic	Computation and Language
url	https://arxiv.org/abs/2406.08092

Similar Items