Table of Contents: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Jayasinghe, Nethmi, Gontero, Diana, Brown, Spencer T., Sangwan, Vinod K., Hersam, Mark C., Trivedi, Amit Ranjan
Format:	Preprint
Published:	2026
Subjects:	Machine Learning Robotics
Online Access:	https://arxiv.org/abs/2602.07227
Tags:	Add Tag No Tags, Be the first to tag this record!

Table of Contents:

Robotic policies deployed in real-world environments often encounter post-training faults, where retraining, exploration, or system identification are impractical. We introduce an inference-time, cerebellar-inspired residual control framework that augments a frozen reinforcement learning policy with online corrective actions, enabling fault recovery without modifying base policy parameters. The framework instantiates core cerebellar principles, including high-dimensional pattern separation via fixed feature expansion, parallel microzone-style residual pathways, and local error-driven plasticity with excitatory and inhibitory eligibility traces operating at distinct time scales. These mechanisms enable fast, localized correction under post-training disturbances while avoiding destabilizing global policy updates. A conservative, performance-driven meta-adaptation regulates residual authority and plasticity, preserving nominal behavior and suppressing unnecessary intervention. Experiments on MuJoCo benchmarks under actuator, dynamic, and environmental perturbations show improvements of up to $+66\%$ on \texttt{HalfCheetah-v5} and $+53\%$ on \texttt{Humanoid-v5} under moderate faults, with graceful degradation under severe shifts and complementary robustness from consolidating persistent residual corrections into policy parameters.

Similar Items