Saved in:
Bibliographic Details
Main Authors: Kienle, Claudius, Alt, Benjamin, Katic, Darko, Jäkel, Rainer, Peters, Jan
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2409.08704
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913875881885696
author Kienle, Claudius
Alt, Benjamin
Katic, Darko
Jäkel, Rainer
Peters, Jan
author_facet Kienle, Claudius
Alt, Benjamin
Katic, Darko
Jäkel, Rainer
Peters, Jan
contents CAD models are widely used in industry and are essential for robotic automation processes. However, these models are rarely considered in novel AI-based approaches, such as the automatic synthesis of robot programs, as there are no readily available methods that would allow CAD models to be incorporated for the analysis, interpretation, or extraction of information. To address these limitations, we propose QueryCAD, the first system designed for CAD question answering, enabling the extraction of precise information from CAD models using natural language queries. QueryCAD incorporates SegCAD, an open-vocabulary instance segmentation model we developed to identify and select specific parts of the CAD model based on part descriptions. We further propose a CAD question answering benchmark to evaluate QueryCAD and establish a foundation for future research. Lastly, we integrate QueryCAD within an automatic robot program synthesis framework, validating its ability to enhance deep-learning solutions for robotics by enabling them to process CAD models (https://claudius-kienle.github.com/querycad).
format Preprint
id arxiv_https___arxiv_org_abs_2409_08704
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle QueryCAD: Grounded Question Answering for CAD Models
Kienle, Claudius
Alt, Benjamin
Katic, Darko
Jäkel, Rainer
Peters, Jan
Robotics
CAD models are widely used in industry and are essential for robotic automation processes. However, these models are rarely considered in novel AI-based approaches, such as the automatic synthesis of robot programs, as there are no readily available methods that would allow CAD models to be incorporated for the analysis, interpretation, or extraction of information. To address these limitations, we propose QueryCAD, the first system designed for CAD question answering, enabling the extraction of precise information from CAD models using natural language queries. QueryCAD incorporates SegCAD, an open-vocabulary instance segmentation model we developed to identify and select specific parts of the CAD model based on part descriptions. We further propose a CAD question answering benchmark to evaluate QueryCAD and establish a foundation for future research. Lastly, we integrate QueryCAD within an automatic robot program synthesis framework, validating its ability to enhance deep-learning solutions for robotics by enabling them to process CAD models (https://claudius-kienle.github.com/querycad).
title QueryCAD: Grounded Question Answering for CAD Models
topic Robotics
url https://arxiv.org/abs/2409.08704