Enregistré dans:
Détails bibliographiques
Auteurs principaux: Duricic, Tomislav, Müllner, Peter, Weidinger, Nicole, ElSayed, Neven, Kowald, Dominik, Veas, Eduardo
Format: Preprint
Publié: 2024
Sujets:
Accès en ligne:https://arxiv.org/abs/2407.09147
Tags: Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
_version_ 1866913428549926912
author Duricic, Tomislav
Müllner, Peter
Weidinger, Nicole
ElSayed, Neven
Kowald, Dominik
Veas, Eduardo
author_facet Duricic, Tomislav
Müllner, Peter
Weidinger, Nicole
ElSayed, Neven
Kowald, Dominik
Veas, Eduardo
contents Many industrial sectors rely on well-trained employees that are able to operate complex machinery. In this work, we demonstrate an AI-powered immersive assistance system that supports users in performing complex tasks in industrial environments. Specifically, our system leverages a VR environment that resembles a juice mixer setup. This digital twin of a physical setup simulates complex industrial machinery used to mix preparations or liquids (e.g., similar to the pharmaceutical industry) and includes various containers, sensors, pumps, and flow controllers. This setup demonstrates our system's capabilities in a controlled environment while acting as a proof-of-concept for broader industrial applications. The core components of our multimodal AI assistant are a large language model and a speech-to-text model that process a video and audio recording of an expert performing the task in a VR environment. The video and speech input extracted from the expert's video enables it to provide step-by-step guidance to support users in executing complex tasks. This demonstration showcases the potential of our AI-powered assistant to reduce cognitive load, increase productivity, and enhance safety in industrial environments.
format Preprint
id arxiv_https___arxiv_org_abs_2407_09147
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle AI-Powered Immersive Assistance for Interactive Task Execution in Industrial Environments
Duricic, Tomislav
Müllner, Peter
Weidinger, Nicole
ElSayed, Neven
Kowald, Dominik
Veas, Eduardo
Human-Computer Interaction
Information Retrieval
Many industrial sectors rely on well-trained employees that are able to operate complex machinery. In this work, we demonstrate an AI-powered immersive assistance system that supports users in performing complex tasks in industrial environments. Specifically, our system leverages a VR environment that resembles a juice mixer setup. This digital twin of a physical setup simulates complex industrial machinery used to mix preparations or liquids (e.g., similar to the pharmaceutical industry) and includes various containers, sensors, pumps, and flow controllers. This setup demonstrates our system's capabilities in a controlled environment while acting as a proof-of-concept for broader industrial applications. The core components of our multimodal AI assistant are a large language model and a speech-to-text model that process a video and audio recording of an expert performing the task in a VR environment. The video and speech input extracted from the expert's video enables it to provide step-by-step guidance to support users in executing complex tasks. This demonstration showcases the potential of our AI-powered assistant to reduce cognitive load, increase productivity, and enhance safety in industrial environments.
title AI-Powered Immersive Assistance for Interactive Task Execution in Industrial Environments
topic Human-Computer Interaction
Information Retrieval
url https://arxiv.org/abs/2407.09147