Saved in:
Bibliographic Details
Main Authors: Han, Mingyue, Wang, Yinglin
Format: Preprint
Published: 2021
Subjects:
Online Access:https://arxiv.org/abs/2107.01791
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911020655575040
author Han, Mingyue
Wang, Yinglin
author_facet Han, Mingyue
Wang, Yinglin
contents Pretrained language models (PLM) achieve surprising performance on the Choice of Plausible Alternatives (COPA) task. However, whether PLMs have truly acquired the ability of causal reasoning remains a question. In this paper, we investigate the problem of semantic similarity bias and reveal the vulnerability of current COPA models by certain attacks. Previous solutions that tackle the superficial cues of unbalanced token distribution still encounter the same problem of semantic bias, even more seriously due to the utilization of more training data. We mitigate this problem by simply adding a regularization loss and experimental results show that this solution not only improves the model's generalization ability, but also assists the models to perform more robustly on a challenging dataset, BCOPA-CE, which has unbiased token distribution and is more difficult for models to distinguish cause and effect.
format Preprint
id arxiv_https___arxiv_org_abs_2107_01791
institution arXiv
publishDate 2021
record_format arxiv
spellingShingle Doing Good or Doing Right? Exploring the Weakness of Commonsense Causal Reasoning Models
Han, Mingyue
Wang, Yinglin
Computation and Language
Pretrained language models (PLM) achieve surprising performance on the Choice of Plausible Alternatives (COPA) task. However, whether PLMs have truly acquired the ability of causal reasoning remains a question. In this paper, we investigate the problem of semantic similarity bias and reveal the vulnerability of current COPA models by certain attacks. Previous solutions that tackle the superficial cues of unbalanced token distribution still encounter the same problem of semantic bias, even more seriously due to the utilization of more training data. We mitigate this problem by simply adding a regularization loss and experimental results show that this solution not only improves the model's generalization ability, but also assists the models to perform more robustly on a challenging dataset, BCOPA-CE, which has unbiased token distribution and is more difficult for models to distinguish cause and effect.
title Doing Good or Doing Right? Exploring the Weakness of Commonsense Causal Reasoning Models
topic Computation and Language
url https://arxiv.org/abs/2107.01791