Αποθηκεύτηκε σε:
Λεπτομέρειες βιβλιογραφικής εγγραφής
Κύριοι συγγραφείς: Zhou, Yuqing, Tang, Ruixiang, Yao, Ziyu, Zhu, Ziwei
Μορφή: Preprint
Έκδοση: 2024
Θέματα:
Διαθέσιμο Online:https://arxiv.org/abs/2409.17455
Ετικέτες: Προσθήκη ετικέτας
Δεν υπάρχουν, Καταχωρήστε ετικέτα πρώτοι!
_version_ 1866909385301688320
author Zhou, Yuqing
Tang, Ruixiang
Yao, Ziyu
Zhu, Ziwei
author_facet Zhou, Yuqing
Tang, Ruixiang
Yao, Ziyu
Zhu, Ziwei
contents Language models (LMs), despite their advances, often depend on spurious correlations, undermining their accuracy and generalizability. This study addresses the overlooked impact of subtler, more complex shortcuts that compromise model reliability beyond oversimplified shortcuts. We introduce a comprehensive benchmark that categorizes shortcuts into occurrence, style, and concept, aiming to explore the nuanced ways in which these shortcuts influence the performance of LMs. Through extensive experiments across traditional LMs, large language models, and state-of-the-art robust models, our research systematically investigates models' resilience and susceptibilities to sophisticated shortcuts. Our benchmark and code can be found at: https://github.com/yuqing-zhou/shortcut-learning-in-text-classification.
format Preprint
id arxiv_https___arxiv_org_abs_2409_17455
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Navigating the Shortcut Maze: A Comprehensive Analysis of Shortcut Learning in Text Classification by Language Models
Zhou, Yuqing
Tang, Ruixiang
Yao, Ziyu
Zhu, Ziwei
Computation and Language
Machine Learning
Language models (LMs), despite their advances, often depend on spurious correlations, undermining their accuracy and generalizability. This study addresses the overlooked impact of subtler, more complex shortcuts that compromise model reliability beyond oversimplified shortcuts. We introduce a comprehensive benchmark that categorizes shortcuts into occurrence, style, and concept, aiming to explore the nuanced ways in which these shortcuts influence the performance of LMs. Through extensive experiments across traditional LMs, large language models, and state-of-the-art robust models, our research systematically investigates models' resilience and susceptibilities to sophisticated shortcuts. Our benchmark and code can be found at: https://github.com/yuqing-zhou/shortcut-learning-in-text-classification.
title Navigating the Shortcut Maze: A Comprehensive Analysis of Shortcut Learning in Text Classification by Language Models
topic Computation and Language
Machine Learning
url https://arxiv.org/abs/2409.17455