Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Li, Chunyang, Wang, Weiqi, Zheng, Tianshi, Song, Yangqiu
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2502.16169
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866912399843393536
author	Li, Chunyang Wang, Weiqi Zheng, Tianshi Song, Yangqiu
author_facet	Li, Chunyang Wang, Weiqi Zheng, Tianshi Song, Yangqiu
contents	Inductive reasoning, a cornerstone of human cognition, enables generalization from limited data but hasn't yet been fully achieved by large language models (LLMs). While modern LLMs excel at reasoning tasks, their ability to maintain stable and consistent rule abstraction under imperfect observations remains underexplored. To fill this gap, in this work, we introduce Robust Rule Induction, a task that evaluates LLMs' capability in inferring rules from data that are fused with noisy examples. To address this task, we further propose Sample-steered Rule Refinement (SRR), a method enhancing reasoning stability via observation diversification and execution-guided feedback. Experiments across arithmetic, cryptography, and list functions reveal: (1) SRR outperforms other methods with minimal performance degradation under noise; (2) Despite slight accuracy variation, LLMs exhibit instability under noise (e.g., 0% accuracy change with only 70% consistent score); (3) Counterfactual task gaps highlight LLMs' reliance on memorized patterns over genuine abstraction. Our findings challenge LLMs' reasoning robustness, revealing susceptibility to hypothesis drift and pattern overfitting, while providing empirical evidence critical for developing human-like inductive systems. Code and data are available at https://github.com/HKUST-KnowComp/Robust-Rule-Induction.
format	Preprint
id	arxiv_https___arxiv_org_abs_2502_16169
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Patterns Over Principles: The Fragility of Inductive Reasoning in LLMs under Noisy Observations Li, Chunyang Wang, Weiqi Zheng, Tianshi Song, Yangqiu Artificial Intelligence Inductive reasoning, a cornerstone of human cognition, enables generalization from limited data but hasn't yet been fully achieved by large language models (LLMs). While modern LLMs excel at reasoning tasks, their ability to maintain stable and consistent rule abstraction under imperfect observations remains underexplored. To fill this gap, in this work, we introduce Robust Rule Induction, a task that evaluates LLMs' capability in inferring rules from data that are fused with noisy examples. To address this task, we further propose Sample-steered Rule Refinement (SRR), a method enhancing reasoning stability via observation diversification and execution-guided feedback. Experiments across arithmetic, cryptography, and list functions reveal: (1) SRR outperforms other methods with minimal performance degradation under noise; (2) Despite slight accuracy variation, LLMs exhibit instability under noise (e.g., 0% accuracy change with only 70% consistent score); (3) Counterfactual task gaps highlight LLMs' reliance on memorized patterns over genuine abstraction. Our findings challenge LLMs' reasoning robustness, revealing susceptibility to hypothesis drift and pattern overfitting, while providing empirical evidence critical for developing human-like inductive systems. Code and data are available at https://github.com/HKUST-KnowComp/Robust-Rule-Induction.
title	Patterns Over Principles: The Fragility of Inductive Reasoning in LLMs under Noisy Observations
topic	Artificial Intelligence
url	https://arxiv.org/abs/2502.16169

Similar Items