Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Wu, Hongqiu, Liu, Linfeng, Zhao, Hai, Zhang, Min
Format:	Preprint
Published:	2023
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2310.05450
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866916077710082048
author	Wu, Hongqiu Liu, Linfeng Zhao, Hai Zhang, Min
author_facet	Wu, Hongqiu Liu, Linfeng Zhao, Hai Zhang, Min
contents	Beyond the great cognitive powers showcased by language models, it is crucial to scrutinize whether their reasoning capabilities stem from strong generalization or merely exposure to relevant data. As opposed to constructing increasingly complex logic, this paper probes into the boolean logic, the root capability of a logical reasoner. We find that any pre-trained language models even including large language models only behave like a random selector in the face of multi-nested boolean logic, a task that humans can handle with ease. To empower language models with this fundamental capability, this paper proposes a new self-supervised learning method \textit{Curriculum Logical Reasoning} (\textsc{Clr}), where we augment the training data with nested boolean logic chain step-by-step, and program the training from simpler logical patterns gradually to harder ones. This new training paradigm allows language models to effectively generalize to much harder and longer-hop logic, which can hardly be learned through naive training. Furthermore, we show that boolean logic is a great foundation for improving the subsequent general logical tasks.
format	Preprint
id	arxiv_https___arxiv_org_abs_2310_05450
institution	arXiv
publishDate	2023
record_format	arxiv
spellingShingle	Empower Nested Boolean Logic via Self-Supervised Curriculum Learning Wu, Hongqiu Liu, Linfeng Zhao, Hai Zhang, Min Computation and Language Beyond the great cognitive powers showcased by language models, it is crucial to scrutinize whether their reasoning capabilities stem from strong generalization or merely exposure to relevant data. As opposed to constructing increasingly complex logic, this paper probes into the boolean logic, the root capability of a logical reasoner. We find that any pre-trained language models even including large language models only behave like a random selector in the face of multi-nested boolean logic, a task that humans can handle with ease. To empower language models with this fundamental capability, this paper proposes a new self-supervised learning method \textit{Curriculum Logical Reasoning} (\textsc{Clr}), where we augment the training data with nested boolean logic chain step-by-step, and program the training from simpler logical patterns gradually to harder ones. This new training paradigm allows language models to effectively generalize to much harder and longer-hop logic, which can hardly be learned through naive training. Furthermore, we show that boolean logic is a great foundation for improving the subsequent general logical tasks.
title	Empower Nested Boolean Logic via Self-Supervised Curriculum Learning
topic	Computation and Language
url	https://arxiv.org/abs/2310.05450

Similar Items