Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Han, Mingyue, Wang, Yinglin
Format:	Preprint
Published:	2021
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2107.01791
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866911020655575040
author	Han, Mingyue Wang, Yinglin
author_facet	Han, Mingyue Wang, Yinglin
contents	Pretrained language models (PLM) achieve surprising performance on the Choice of Plausible Alternatives (COPA) task. However, whether PLMs have truly acquired the ability of causal reasoning remains a question. In this paper, we investigate the problem of semantic similarity bias and reveal the vulnerability of current COPA models by certain attacks. Previous solutions that tackle the superficial cues of unbalanced token distribution still encounter the same problem of semantic bias, even more seriously due to the utilization of more training data. We mitigate this problem by simply adding a regularization loss and experimental results show that this solution not only improves the model's generalization ability, but also assists the models to perform more robustly on a challenging dataset, BCOPA-CE, which has unbiased token distribution and is more difficult for models to distinguish cause and effect.
format	Preprint
id	arxiv_https___arxiv_org_abs_2107_01791
institution	arXiv
publishDate	2021
record_format	arxiv
spellingShingle	Doing Good or Doing Right? Exploring the Weakness of Commonsense Causal Reasoning Models Han, Mingyue Wang, Yinglin Computation and Language Pretrained language models (PLM) achieve surprising performance on the Choice of Plausible Alternatives (COPA) task. However, whether PLMs have truly acquired the ability of causal reasoning remains a question. In this paper, we investigate the problem of semantic similarity bias and reveal the vulnerability of current COPA models by certain attacks. Previous solutions that tackle the superficial cues of unbalanced token distribution still encounter the same problem of semantic bias, even more seriously due to the utilization of more training data. We mitigate this problem by simply adding a regularization loss and experimental results show that this solution not only improves the model's generalization ability, but also assists the models to perform more robustly on a challenging dataset, BCOPA-CE, which has unbiased token distribution and is more difficult for models to distinguish cause and effect.
title	Doing Good or Doing Right? Exploring the Weakness of Commonsense Causal Reasoning Models
topic	Computation and Language
url	https://arxiv.org/abs/2107.01791

Similar Items