Affichage MARC: :: Library Catalog

Enregistré dans:

Détails bibliographiques
Auteurs principaux:	Cheng, Qilong, Mackay, Matthew, Bereyhi, Ali
Format:	Preprint
Publié:	2026
Sujets:	Robotics Machine Learning
Accès en ligne:	https://arxiv.org/abs/2602.17537
Tags:	Ajouter un tag Pas de tags, Soyez le premier à ajouter un tag!

_version_	1866912913649827840
author	Cheng, Qilong Mackay, Matthew Bereyhi, Ali
author_facet	Cheng, Qilong Mackay, Matthew Bereyhi, Ali
contents	Robotic camera systems enable dynamic, repeatable motion beyond human capabilities, yet their adoption remains limited by the high cost and operational complexity of industrial-grade platforms. We present the Intelligent Robotic Imaging System (IRIS), a task-specific 6-DOF manipulator designed for autonomous, learning-driven cinematic motion control. IRIS integrates a lightweight, fully 3D-printed hardware design with a goal-conditioned visuomotor imitation learning framework based on Action Chunking with Transformers (ACT). The system learns object-aware and perceptually smooth camera trajectories directly from human demonstrations, eliminating the need for explicit geometric programming. The complete platform costs under $1,000 USD, supports a 1.5 kg payload, and achieves approximately 1 mm repeatability. Real-world experiments demonstrate accurate trajectory tracking, reliable autonomous execution, and generalization across diverse cinematic motions.
format	Preprint
id	arxiv_https___arxiv_org_abs_2602_17537
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	IRIS: Learning-Driven Task-Specific Cinema Robot Arm for Visuomotor Motion Control Cheng, Qilong Mackay, Matthew Bereyhi, Ali Robotics Machine Learning Robotic camera systems enable dynamic, repeatable motion beyond human capabilities, yet their adoption remains limited by the high cost and operational complexity of industrial-grade platforms. We present the Intelligent Robotic Imaging System (IRIS), a task-specific 6-DOF manipulator designed for autonomous, learning-driven cinematic motion control. IRIS integrates a lightweight, fully 3D-printed hardware design with a goal-conditioned visuomotor imitation learning framework based on Action Chunking with Transformers (ACT). The system learns object-aware and perceptually smooth camera trajectories directly from human demonstrations, eliminating the need for explicit geometric programming. The complete platform costs under $1,000 USD, supports a 1.5 kg payload, and achieves approximately 1 mm repeatability. Real-world experiments demonstrate accurate trajectory tracking, reliable autonomous execution, and generalization across diverse cinematic motions.
title	IRIS: Learning-Driven Task-Specific Cinema Robot Arm for Visuomotor Motion Control
topic	Robotics Machine Learning
url	https://arxiv.org/abs/2602.17537

Documents similaires