Saved in:
Bibliographic Details
Main Authors: Zhang, Yi-Fan, Zhang, Hanlin, Li, Li Erran, Xing, Eric
Format: Preprint
Published: 2022
Subjects:
Online Access:https://arxiv.org/abs/2212.08686
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866916180712751104
author Zhang, Yi-Fan
Zhang, Hanlin
Li, Li Erran
Xing, Eric
author_facet Zhang, Yi-Fan
Zhang, Hanlin
Li, Li Erran
Xing, Eric
contents Pre-trained language models (LMs) have shown remarkable reasoning performance using explanations or chain-of-thoughts (CoT)) for in-context learning. On the other hand, these reasoning tasks are usually presumed to be more approachable for symbolic programming. To understand the mechanism of reasoning of LMs, we curate synthetic datasets containing equivalent (natural, symbolic) data pairs, where symbolic examples contain first-order logic rules and predicates from non-parametric knowledge bases (KBs), supporting automated verification of intermediate reasoning results. Then we revisit neuro-symbolic approaches and propose to learn from demonstrations containing logic rules and corresponding examples to iteratively reason over KBs, recovering Prolog's backward chaining algorithm and supporting automated verification of LMs' outputs. Comprehensive experiments are included to systematically compare LMLP with CoT in deductive reasoning settings, showing that LMLP enjoys more than $25\%$ higher accuracy than CoT on length generalization benchmarks even with smaller model sizes.
format Preprint
id arxiv_https___arxiv_org_abs_2212_08686
institution arXiv
publishDate 2022
record_format arxiv
spellingShingle Evaluating Step-by-Step Reasoning through Symbolic Verification
Zhang, Yi-Fan
Zhang, Hanlin
Li, Li Erran
Xing, Eric
Computation and Language
Pre-trained language models (LMs) have shown remarkable reasoning performance using explanations or chain-of-thoughts (CoT)) for in-context learning. On the other hand, these reasoning tasks are usually presumed to be more approachable for symbolic programming. To understand the mechanism of reasoning of LMs, we curate synthetic datasets containing equivalent (natural, symbolic) data pairs, where symbolic examples contain first-order logic rules and predicates from non-parametric knowledge bases (KBs), supporting automated verification of intermediate reasoning results. Then we revisit neuro-symbolic approaches and propose to learn from demonstrations containing logic rules and corresponding examples to iteratively reason over KBs, recovering Prolog's backward chaining algorithm and supporting automated verification of LMs' outputs. Comprehensive experiments are included to systematically compare LMLP with CoT in deductive reasoning settings, showing that LMLP enjoys more than $25\%$ higher accuracy than CoT on length generalization benchmarks even with smaller model sizes.
title Evaluating Step-by-Step Reasoning through Symbolic Verification
topic Computation and Language
url https://arxiv.org/abs/2212.08686