Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Yang, Yuancheng, Yang, Lin, Wang, Xu, Tong, Chao, Yang, Haihua
Format:	Preprint
Published:	2026
Subjects:	Computation and Language Artificial Intelligence
Online Access:	https://arxiv.org/abs/2602.21228
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866910148272848896
author	Yang, Yuancheng Yang, Lin Wang, Xu Tong, Chao Yang, Haihua
author_facet	Yang, Yuancheng Yang, Lin Wang, Xu Tong, Chao Yang, Haihua
contents	As applications of large language models (LLMs) become increasingly complex, the demand for robust complex instruction following capabilities is growing accordingly. We argue that a thorough understanding of the instruction itself, especially the latent reasoning structure embedded between the lines, is crucial for improving instruction following. Therefore we target complex instructions that involve implicit reasoning, intricate logical relations, and multi-constraint dependencies. We propose ImpRIF, a method to enhance LLMs' understanding of implicit reasoning instructions, thereby improving its ability to follow complex instructions. We formalize such instructions as verifiable reasoning graphs, enabling programmatic verification and graph-driven chain-of-thought reasoning. Based on this formulation, we synthesize large-scale single- and multi-turn data, propose fine-tuning with graph reasoning, and apply reinforcement learning to explicitly train models to reason along the graph. On five complex instruction following benchmarks, our models substantially outperform their base models. These results demonstrate that enhancing implicit reasoning capabilities can significantly improve complex instruction following.
format	Preprint
id	arxiv_https___arxiv_org_abs_2602_21228
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	ImpRIF: Stronger Implicit Reasoning Leads to Better Complex Instruction Following Yang, Yuancheng Yang, Lin Wang, Xu Tong, Chao Yang, Haihua Computation and Language Artificial Intelligence As applications of large language models (LLMs) become increasingly complex, the demand for robust complex instruction following capabilities is growing accordingly. We argue that a thorough understanding of the instruction itself, especially the latent reasoning structure embedded between the lines, is crucial for improving instruction following. Therefore we target complex instructions that involve implicit reasoning, intricate logical relations, and multi-constraint dependencies. We propose ImpRIF, a method to enhance LLMs' understanding of implicit reasoning instructions, thereby improving its ability to follow complex instructions. We formalize such instructions as verifiable reasoning graphs, enabling programmatic verification and graph-driven chain-of-thought reasoning. Based on this formulation, we synthesize large-scale single- and multi-turn data, propose fine-tuning with graph reasoning, and apply reinforcement learning to explicitly train models to reason along the graph. On five complex instruction following benchmarks, our models substantially outperform their base models. These results demonstrate that enhancing implicit reasoning capabilities can significantly improve complex instruction following.
title	ImpRIF: Stronger Implicit Reasoning Leads to Better Complex Instruction Following
topic	Computation and Language Artificial Intelligence
url	https://arxiv.org/abs/2602.21228

Similar Items