Saved in:
Bibliographic Details
Main Authors: Orimo, Yuki, Kurata, Iori, Mori, Hodaka, Okuno, Ryuhei, Sawada, Ryohto, Okanohara, Daisuke
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2512.03549
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866912745972039680
author Orimo, Yuki
Kurata, Iori
Mori, Hodaka
Okuno, Ryuhei
Sawada, Ryohto
Okanohara, Daisuke
author_facet Orimo, Yuki
Kurata, Iori
Mori, Hodaka
Okuno, Ryuhei
Sawada, Ryohto
Okanohara, Daisuke
contents We introduce PARC, a coding agent for the autonomous and robust execution of long-horizon computational tasks. PARC is built on a hierarchical multi-agent architecture incorporating task planning, execution, and a mechanism that evaluates its own actions and their outcomes from an independent context and provides feedback, namely self-assessment and self-feedback. This design enables PARC to detect and correct high-level strategic errors and sustain progress without human intervention. We evaluate PARC across computational science and data science tasks. In materials science, it autonomously reproduces key results from studies on lithium-ion conduction and alloy segregation. In particular, it coordinates dozens of parallel simulation tasks, each requiring roughly 43 hours of computation, managing orchestration, monitoring, and error correction end-to-end. In Kaggle-based experiments, starting from minimal natural-language instructions, PARC conducts data analysis and implements search strategies, producing solutions competitive with human-engineered baselines. These results highlight the potential of integrating a hierarchical multi-agent system with self-assessment and self-feedback to enable AI systems capable of independent, large-scale scientific and analytical work.
format Preprint
id arxiv_https___arxiv_org_abs_2512_03549
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle PARC: An Autonomous Self-Reflective Coding Agent for Robust Execution of Long-Horizon Tasks
Orimo, Yuki
Kurata, Iori
Mori, Hodaka
Okuno, Ryuhei
Sawada, Ryohto
Okanohara, Daisuke
Artificial Intelligence
We introduce PARC, a coding agent for the autonomous and robust execution of long-horizon computational tasks. PARC is built on a hierarchical multi-agent architecture incorporating task planning, execution, and a mechanism that evaluates its own actions and their outcomes from an independent context and provides feedback, namely self-assessment and self-feedback. This design enables PARC to detect and correct high-level strategic errors and sustain progress without human intervention. We evaluate PARC across computational science and data science tasks. In materials science, it autonomously reproduces key results from studies on lithium-ion conduction and alloy segregation. In particular, it coordinates dozens of parallel simulation tasks, each requiring roughly 43 hours of computation, managing orchestration, monitoring, and error correction end-to-end. In Kaggle-based experiments, starting from minimal natural-language instructions, PARC conducts data analysis and implements search strategies, producing solutions competitive with human-engineered baselines. These results highlight the potential of integrating a hierarchical multi-agent system with self-assessment and self-feedback to enable AI systems capable of independent, large-scale scientific and analytical work.
title PARC: An Autonomous Self-Reflective Coding Agent for Robust Execution of Long-Horizon Tasks
topic Artificial Intelligence
url https://arxiv.org/abs/2512.03549