Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Zeng, Qiuhai, Rajkumar, Sarvesh, Wang, Di, Gyanchandani, Narendra, Yan, Wenbo
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2509.18607
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866909802114842624
author	Zeng, Qiuhai Rajkumar, Sarvesh Wang, Di Gyanchandani, Narendra Yan, Wenbo
author_facet	Zeng, Qiuhai Rajkumar, Sarvesh Wang, Di Gyanchandani, Narendra Yan, Wenbo
contents	Large Language Models (LLMs) have demonstrated remarkable capabilities in interactive decision-making tasks, but existing methods often struggle with error accumulation and lack robust self-correction mechanisms. We introduce "Reflect before Act" (REBACT), a novel approach that enhances LLM-based decision-making by introducing a critical reflect step prior to taking the next action. This approach allows for immediate error correction, ensuring smooth action path and adaptibity to environment feedback. We evaluate REBACT on three diverse interactive environments: ALFWorld, WebShop, and TextCraft. Our results demonstrate that REBACT significantly outperforms strong baselines, improving success rates by up to 24% on WebShop (achieving 61%), 6.72% on ALFWorld (achieving 98.51%), and 0.5% on TextCraft (achieving 99.5%) using Claude3.5-sonnet as the underlying LLM. Further analysis reveals that REBACT's performance improvements are achieved with only a few modification steps, demonstrating its computational efficiency.
format	Preprint
id	arxiv_https___arxiv_org_abs_2509_18607
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Reflect before Act: Proactive Error Correction in Language Models Zeng, Qiuhai Rajkumar, Sarvesh Wang, Di Gyanchandani, Narendra Yan, Wenbo Machine Learning Large Language Models (LLMs) have demonstrated remarkable capabilities in interactive decision-making tasks, but existing methods often struggle with error accumulation and lack robust self-correction mechanisms. We introduce "Reflect before Act" (REBACT), a novel approach that enhances LLM-based decision-making by introducing a critical reflect step prior to taking the next action. This approach allows for immediate error correction, ensuring smooth action path and adaptibity to environment feedback. We evaluate REBACT on three diverse interactive environments: ALFWorld, WebShop, and TextCraft. Our results demonstrate that REBACT significantly outperforms strong baselines, improving success rates by up to 24% on WebShop (achieving 61%), 6.72% on ALFWorld (achieving 98.51%), and 0.5% on TextCraft (achieving 99.5%) using Claude3.5-sonnet as the underlying LLM. Further analysis reveals that REBACT's performance improvements are achieved with only a few modification steps, demonstrating its computational efficiency.
title	Reflect before Act: Proactive Error Correction in Language Models
topic	Machine Learning
url	https://arxiv.org/abs/2509.18607

Similar Items