Saved in:
Bibliographic Details
Main Authors: Zeng, Qiuhai, Rajkumar, Sarvesh, Wang, Di, Gyanchandani, Narendra, Yan, Wenbo
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2509.18607
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909802114842624
author Zeng, Qiuhai
Rajkumar, Sarvesh
Wang, Di
Gyanchandani, Narendra
Yan, Wenbo
author_facet Zeng, Qiuhai
Rajkumar, Sarvesh
Wang, Di
Gyanchandani, Narendra
Yan, Wenbo
contents Large Language Models (LLMs) have demonstrated remarkable capabilities in interactive decision-making tasks, but existing methods often struggle with error accumulation and lack robust self-correction mechanisms. We introduce "Reflect before Act" (REBACT), a novel approach that enhances LLM-based decision-making by introducing a critical reflect step prior to taking the next action. This approach allows for immediate error correction, ensuring smooth action path and adaptibity to environment feedback. We evaluate REBACT on three diverse interactive environments: ALFWorld, WebShop, and TextCraft. Our results demonstrate that REBACT significantly outperforms strong baselines, improving success rates by up to 24% on WebShop (achieving 61%), 6.72% on ALFWorld (achieving 98.51%), and 0.5% on TextCraft (achieving 99.5%) using Claude3.5-sonnet as the underlying LLM. Further analysis reveals that REBACT's performance improvements are achieved with only a few modification steps, demonstrating its computational efficiency.
format Preprint
id arxiv_https___arxiv_org_abs_2509_18607
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Reflect before Act: Proactive Error Correction in Language Models
Zeng, Qiuhai
Rajkumar, Sarvesh
Wang, Di
Gyanchandani, Narendra
Yan, Wenbo
Machine Learning
Large Language Models (LLMs) have demonstrated remarkable capabilities in interactive decision-making tasks, but existing methods often struggle with error accumulation and lack robust self-correction mechanisms. We introduce "Reflect before Act" (REBACT), a novel approach that enhances LLM-based decision-making by introducing a critical reflect step prior to taking the next action. This approach allows for immediate error correction, ensuring smooth action path and adaptibity to environment feedback. We evaluate REBACT on three diverse interactive environments: ALFWorld, WebShop, and TextCraft. Our results demonstrate that REBACT significantly outperforms strong baselines, improving success rates by up to 24% on WebShop (achieving 61%), 6.72% on ALFWorld (achieving 98.51%), and 0.5% on TextCraft (achieving 99.5%) using Claude3.5-sonnet as the underlying LLM. Further analysis reveals that REBACT's performance improvements are achieved with only a few modification steps, demonstrating its computational efficiency.
title Reflect before Act: Proactive Error Correction in Language Models
topic Machine Learning
url https://arxiv.org/abs/2509.18607