Enregistré dans:
Détails bibliographiques
Auteurs principaux: Cheng, Qilong, Mackay, Matthew, Bereyhi, Ali
Format: Preprint
Publié: 2026
Sujets:
Accès en ligne:https://arxiv.org/abs/2602.17537
Tags: Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
_version_ 1866912913649827840
author Cheng, Qilong
Mackay, Matthew
Bereyhi, Ali
author_facet Cheng, Qilong
Mackay, Matthew
Bereyhi, Ali
contents Robotic camera systems enable dynamic, repeatable motion beyond human capabilities, yet their adoption remains limited by the high cost and operational complexity of industrial-grade platforms. We present the Intelligent Robotic Imaging System (IRIS), a task-specific 6-DOF manipulator designed for autonomous, learning-driven cinematic motion control. IRIS integrates a lightweight, fully 3D-printed hardware design with a goal-conditioned visuomotor imitation learning framework based on Action Chunking with Transformers (ACT). The system learns object-aware and perceptually smooth camera trajectories directly from human demonstrations, eliminating the need for explicit geometric programming. The complete platform costs under $1,000 USD, supports a 1.5 kg payload, and achieves approximately 1 mm repeatability. Real-world experiments demonstrate accurate trajectory tracking, reliable autonomous execution, and generalization across diverse cinematic motions.
format Preprint
id arxiv_https___arxiv_org_abs_2602_17537
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle IRIS: Learning-Driven Task-Specific Cinema Robot Arm for Visuomotor Motion Control
Cheng, Qilong
Mackay, Matthew
Bereyhi, Ali
Robotics
Machine Learning
Robotic camera systems enable dynamic, repeatable motion beyond human capabilities, yet their adoption remains limited by the high cost and operational complexity of industrial-grade platforms. We present the Intelligent Robotic Imaging System (IRIS), a task-specific 6-DOF manipulator designed for autonomous, learning-driven cinematic motion control. IRIS integrates a lightweight, fully 3D-printed hardware design with a goal-conditioned visuomotor imitation learning framework based on Action Chunking with Transformers (ACT). The system learns object-aware and perceptually smooth camera trajectories directly from human demonstrations, eliminating the need for explicit geometric programming. The complete platform costs under $1,000 USD, supports a 1.5 kg payload, and achieves approximately 1 mm repeatability. Real-world experiments demonstrate accurate trajectory tracking, reliable autonomous execution, and generalization across diverse cinematic motions.
title IRIS: Learning-Driven Task-Specific Cinema Robot Arm for Visuomotor Motion Control
topic Robotics
Machine Learning
url https://arxiv.org/abs/2602.17537