Saved in:
Bibliographic Details
Main Authors: Xi, Xiangming, Gao, Feng, Xu, Jun, Guo, Fangtai, Jin, Tianlei
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2404.00885
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910393399508992
author Xi, Xiangming
Gao, Feng
Xu, Jun
Guo, Fangtai
Jin, Tianlei
author_facet Xi, Xiangming
Gao, Feng
Xu, Jun
Guo, Fangtai
Jin, Tianlei
contents Multi-task learning (MTL) is a paradigm that simultaneously learns multiple tasks by sharing information at different levels, enhancing the performance of each individual task. While previous research has primarily focused on feature-level or parameter-level task relatedness, and proposed various model architectures and learning algorithms to improve learning performance, we aim to explore output-level task relatedness. This approach introduces a posteriori information into the model, considering that different tasks may produce correlated outputs with mutual influences. We achieve this by incorporating a feedback mechanism into MTL models, where the output of one task serves as a hidden feature for another task, thereby transforming a static MTL model into a dynamic one. To ensure the training process converges, we introduce a convergence loss that measures the trend of a task's outputs during each iteration. Additionally, we propose a Gumbel gating mechanism to determine the optimal projection of feedback signals. We validate the effectiveness of our method and evaluate its performance through experiments conducted on several baseline models in spoken language understanding.
format Preprint
id arxiv_https___arxiv_org_abs_2404_00885
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Modeling Output-Level Task Relatedness in Multi-Task Learning with Feedback Mechanism
Xi, Xiangming
Gao, Feng
Xu, Jun
Guo, Fangtai
Jin, Tianlei
Machine Learning
Multi-task learning (MTL) is a paradigm that simultaneously learns multiple tasks by sharing information at different levels, enhancing the performance of each individual task. While previous research has primarily focused on feature-level or parameter-level task relatedness, and proposed various model architectures and learning algorithms to improve learning performance, we aim to explore output-level task relatedness. This approach introduces a posteriori information into the model, considering that different tasks may produce correlated outputs with mutual influences. We achieve this by incorporating a feedback mechanism into MTL models, where the output of one task serves as a hidden feature for another task, thereby transforming a static MTL model into a dynamic one. To ensure the training process converges, we introduce a convergence loss that measures the trend of a task's outputs during each iteration. Additionally, we propose a Gumbel gating mechanism to determine the optimal projection of feedback signals. We validate the effectiveness of our method and evaluate its performance through experiments conducted on several baseline models in spoken language understanding.
title Modeling Output-Level Task Relatedness in Multi-Task Learning with Feedback Mechanism
topic Machine Learning
url https://arxiv.org/abs/2404.00885