Αποθηκεύτηκε σε:
| Κύριοι συγγραφείς: | , , , |
|---|---|
| Μορφή: | Preprint |
| Έκδοση: |
2024
|
| Θέματα: | |
| Διαθέσιμο Online: | https://arxiv.org/abs/2409.17455 |
| Ετικέτες: |
Προσθήκη ετικέτας
Δεν υπάρχουν, Καταχωρήστε ετικέτα πρώτοι!
|
| _version_ | 1866909385301688320 |
|---|---|
| author | Zhou, Yuqing Tang, Ruixiang Yao, Ziyu Zhu, Ziwei |
| author_facet | Zhou, Yuqing Tang, Ruixiang Yao, Ziyu Zhu, Ziwei |
| contents | Language models (LMs), despite their advances, often depend on spurious correlations, undermining their accuracy and generalizability. This study addresses the overlooked impact of subtler, more complex shortcuts that compromise model reliability beyond oversimplified shortcuts. We introduce a comprehensive benchmark that categorizes shortcuts into occurrence, style, and concept, aiming to explore the nuanced ways in which these shortcuts influence the performance of LMs. Through extensive experiments across traditional LMs, large language models, and state-of-the-art robust models, our research systematically investigates models' resilience and susceptibilities to sophisticated shortcuts. Our benchmark and code can be found at: https://github.com/yuqing-zhou/shortcut-learning-in-text-classification. |
| format | Preprint |
| id |
arxiv_https___arxiv_org_abs_2409_17455 |
| institution | arXiv |
| publishDate | 2024 |
| record_format | arxiv |
| spellingShingle | Navigating the Shortcut Maze: A Comprehensive Analysis of Shortcut Learning in Text Classification by Language Models Zhou, Yuqing Tang, Ruixiang Yao, Ziyu Zhu, Ziwei Computation and Language Machine Learning Language models (LMs), despite their advances, often depend on spurious correlations, undermining their accuracy and generalizability. This study addresses the overlooked impact of subtler, more complex shortcuts that compromise model reliability beyond oversimplified shortcuts. We introduce a comprehensive benchmark that categorizes shortcuts into occurrence, style, and concept, aiming to explore the nuanced ways in which these shortcuts influence the performance of LMs. Through extensive experiments across traditional LMs, large language models, and state-of-the-art robust models, our research systematically investigates models' resilience and susceptibilities to sophisticated shortcuts. Our benchmark and code can be found at: https://github.com/yuqing-zhou/shortcut-learning-in-text-classification. |
| title | Navigating the Shortcut Maze: A Comprehensive Analysis of Shortcut Learning in Text Classification by Language Models |
| topic | Computation and Language Machine Learning |
| url | https://arxiv.org/abs/2409.17455 |