Saved in:
Bibliographic Details
Main Authors: Kuster, Boris, Simonič, Mihael, Mavsar, Matija, Nemec, Bojan
Format: Recurso digital
Language:
Published: Zenodo 2025
Online Access:https://doi.org/10.5281/zenodo.14929983
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866902092450365440
author Kuster, Boris
Simonič, Mihael
Mavsar, Matija
Nemec, Bojan
author_facet Kuster, Boris
Simonič, Mihael
Mavsar, Matija
Nemec, Bojan
contents <p>Module for prediction and execution of robotic skills using vision-language models (VLMs).</p> <p>Initial textual instructions (e.g. task board completion steps) along with an optional auxiliary image (e.g. depicting taskboard components) are processed into a robot-executable task list. This module relies on a skill library (consisting of motion primitives for executing tasks , e.g. steps in taskboard benchmark).</p> <p>It can also be queried to determine action success (e.g. whether or not the door has been opened). </p> <p>Internally, it uses langchain, so the module can connect to different VLMs (local models or OpenAI API). </p>
format Recurso digital
id zenodo_https___doi_org_10_5281_zenodo_14929983
institution Zenodo
language
publishDate 2025
publisher Zenodo
record_format zenodo
spellingShingle VLM Action Parser Library
Kuster, Boris
Simonič, Mihael
Mavsar, Matija
Nemec, Bojan
<p>Module for prediction and execution of robotic skills using vision-language models (VLMs).</p> <p>Initial textual instructions (e.g. task board completion steps) along with an optional auxiliary image (e.g. depicting taskboard components) are processed into a robot-executable task list. This module relies on a skill library (consisting of motion primitives for executing tasks , e.g. steps in taskboard benchmark).</p> <p>It can also be queried to determine action success (e.g. whether or not the door has been opened). </p> <p>Internally, it uses langchain, so the module can connect to different VLMs (local models or OpenAI API). </p>
title VLM Action Parser Library
url https://doi.org/10.5281/zenodo.14929983