Saved in:
Bibliographic Details
Main Authors: Gao, Sicong, Qian, Chen, Xian, Laurence, Wu, Liao, Pagnucco, Maurice, Song, Yang
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2601.02798
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914234964639744
author Gao, Sicong
Qian, Chen
Xian, Laurence
Wu, Liao
Pagnucco, Maurice
Song, Yang
author_facet Gao, Sicong
Qian, Chen
Xian, Laurence
Wu, Liao
Pagnucco, Maurice
Song, Yang
contents Autonomous navigation is crucial for both medical and industrial endoscopic robots, enabling safe and efficient exploration of narrow tubular environments without continuous human intervention, where avoiding contact with the inner walls has been a longstanding challenge for prior approaches. We present a follow-the-leader endoscopic robot based on a flexible continuum structure designed to minimize contact between the endoscope body and intestinal walls, thereby reducing patient discomfort. To achieve this objective, we propose a vision-based deep reinforcement learning framework guided by monocular depth estimation. A realistic intestinal simulation environment was constructed in \textit{NVIDIA Omniverse} to train and evaluate autonomous navigation strategies. Furthermore, thousands of synthetic intraluminal images were generated using NVIDIA Replicator to fine-tune the Depth Anything model, enabling dense three-dimensional perception of the intestinal environment with a single monocular camera. Subsequently, we introduce a geometry-aware reward and penalty mechanism to enable accurate lumen tracking. Compared with the original Depth Anything model, our method improves $δ_{1}$ depth accuracy by 39.2% and reduces the navigation J-index by 0.67 relative to the second-best method, demonstrating the robustness and effectiveness of the proposed approach.
format Preprint
id arxiv_https___arxiv_org_abs_2601_02798
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Reinforcement Learning for Follow-the-Leader Robotic Endoscopic Navigation via Synthetic Data
Gao, Sicong
Qian, Chen
Xian, Laurence
Wu, Liao
Pagnucco, Maurice
Song, Yang
Robotics
Autonomous navigation is crucial for both medical and industrial endoscopic robots, enabling safe and efficient exploration of narrow tubular environments without continuous human intervention, where avoiding contact with the inner walls has been a longstanding challenge for prior approaches. We present a follow-the-leader endoscopic robot based on a flexible continuum structure designed to minimize contact between the endoscope body and intestinal walls, thereby reducing patient discomfort. To achieve this objective, we propose a vision-based deep reinforcement learning framework guided by monocular depth estimation. A realistic intestinal simulation environment was constructed in \textit{NVIDIA Omniverse} to train and evaluate autonomous navigation strategies. Furthermore, thousands of synthetic intraluminal images were generated using NVIDIA Replicator to fine-tune the Depth Anything model, enabling dense three-dimensional perception of the intestinal environment with a single monocular camera. Subsequently, we introduce a geometry-aware reward and penalty mechanism to enable accurate lumen tracking. Compared with the original Depth Anything model, our method improves $δ_{1}$ depth accuracy by 39.2% and reduces the navigation J-index by 0.67 relative to the second-best method, demonstrating the robustness and effectiveness of the proposed approach.
title Reinforcement Learning for Follow-the-Leader Robotic Endoscopic Navigation via Synthetic Data
topic Robotics
url https://arxiv.org/abs/2601.02798