Saved in:
| Main Authors: | , , , |
|---|---|
| Format: | Recurso digital |
| Language: | |
| Published: |
Zenodo
2025
|
| Online Access: | https://doi.org/10.5281/zenodo.14929983 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Table of Contents:
- <p>Module for prediction and execution of robotic skills using vision-language models (VLMs).</p> <p>Initial textual instructions (e.g. task board completion steps) along with an optional auxiliary image (e.g. depicting taskboard components) are processed into a robot-executable task list. This module relies on a skill library (consisting of motion primitives for executing tasks , e.g. steps in taskboard benchmark).</p> <p>It can also be queried to determine action success (e.g. whether or not the door has been opened). </p> <p>Internally, it uses langchain, so the module can connect to different VLMs (local models or OpenAI API). </p>