Saved in:
Bibliographic Details
Main Authors: Nagiredla, Kishan R., Semage, Buddhika L., A. V, Arun Kumar, Karimpanal, Thommen G., Rana, Santu
Format: Preprint
Published: 2023
Subjects:
Online Access:https://arxiv.org/abs/2309.04085
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • Co-designing autonomous robotic agents involves simultaneously optimizing the controller and physical design of the agent. Its inherent bi-level optimization formulation necessitates an outer loop design optimization driven by an inner loop control optimization. This can be challenging when the design space is large and each design evaluation involves a data-intensive reinforcement learning process for control optimization. To improve the sample efficiency of co-design, we propose a multi-fidelity-based exploration strategy in which we tie the controllers learned across the design spaces through a universal policy learner for warm-starting subsequent controller learning problems. Experiments performed on a wide range of agent design problems demonstrate the superiority of our method compared to baselines. Additionally, analysis of the optimized designs shows interesting design alterations, including design simplifications and non-intuitive alterations.