Saved in:
Bibliographic Details
Main Authors: Li, Chunyang, Wang, Weiqi, Zheng, Tianshi, Song, Yangqiu
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2502.16169
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866912399843393536
author Li, Chunyang
Wang, Weiqi
Zheng, Tianshi
Song, Yangqiu
author_facet Li, Chunyang
Wang, Weiqi
Zheng, Tianshi
Song, Yangqiu
contents Inductive reasoning, a cornerstone of human cognition, enables generalization from limited data but hasn't yet been fully achieved by large language models (LLMs). While modern LLMs excel at reasoning tasks, their ability to maintain stable and consistent rule abstraction under imperfect observations remains underexplored. To fill this gap, in this work, we introduce Robust Rule Induction, a task that evaluates LLMs' capability in inferring rules from data that are fused with noisy examples. To address this task, we further propose Sample-steered Rule Refinement (SRR), a method enhancing reasoning stability via observation diversification and execution-guided feedback. Experiments across arithmetic, cryptography, and list functions reveal: (1) SRR outperforms other methods with minimal performance degradation under noise; (2) Despite slight accuracy variation, LLMs exhibit instability under noise (e.g., 0% accuracy change with only 70% consistent score); (3) Counterfactual task gaps highlight LLMs' reliance on memorized patterns over genuine abstraction. Our findings challenge LLMs' reasoning robustness, revealing susceptibility to hypothesis drift and pattern overfitting, while providing empirical evidence critical for developing human-like inductive systems. Code and data are available at https://github.com/HKUST-KnowComp/Robust-Rule-Induction.
format Preprint
id arxiv_https___arxiv_org_abs_2502_16169
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Patterns Over Principles: The Fragility of Inductive Reasoning in LLMs under Noisy Observations
Li, Chunyang
Wang, Weiqi
Zheng, Tianshi
Song, Yangqiu
Artificial Intelligence
Inductive reasoning, a cornerstone of human cognition, enables generalization from limited data but hasn't yet been fully achieved by large language models (LLMs). While modern LLMs excel at reasoning tasks, their ability to maintain stable and consistent rule abstraction under imperfect observations remains underexplored. To fill this gap, in this work, we introduce Robust Rule Induction, a task that evaluates LLMs' capability in inferring rules from data that are fused with noisy examples. To address this task, we further propose Sample-steered Rule Refinement (SRR), a method enhancing reasoning stability via observation diversification and execution-guided feedback. Experiments across arithmetic, cryptography, and list functions reveal: (1) SRR outperforms other methods with minimal performance degradation under noise; (2) Despite slight accuracy variation, LLMs exhibit instability under noise (e.g., 0% accuracy change with only 70% consistent score); (3) Counterfactual task gaps highlight LLMs' reliance on memorized patterns over genuine abstraction. Our findings challenge LLMs' reasoning robustness, revealing susceptibility to hypothesis drift and pattern overfitting, while providing empirical evidence critical for developing human-like inductive systems. Code and data are available at https://github.com/HKUST-KnowComp/Robust-Rule-Induction.
title Patterns Over Principles: The Fragility of Inductive Reasoning in LLMs under Noisy Observations
topic Artificial Intelligence
url https://arxiv.org/abs/2502.16169