Enregistré dans:
Détails bibliographiques
Auteurs principaux: Tejwani, Ravi, Velazquez, Karl, Payne, John, Bonato, Paolo, Asada, Harry
Format: Preprint
Publié: 2025
Sujets:
Accès en ligne:https://arxiv.org/abs/2502.02772
Tags: Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
_version_ 1866910919204798464
author Tejwani, Ravi
Velazquez, Karl
Payne, John
Bonato, Paolo
Asada, Harry
author_facet Tejwani, Ravi
Velazquez, Karl
Payne, John
Bonato, Paolo
Asada, Harry
contents A method for cross-modality embedding of force profile and words is presented for synergistic coordination of verbal and haptic communication. When two people carry a large, heavy object together, they coordinate through verbal communication about the intended movements and physical forces applied to the object. This natural integration of verbal and physical cues enables effective coordination. Similarly, human-robot interaction could achieve this level of coordination by integrating verbal and haptic communication modalities. This paper presents a framework for embedding words and force profiles in a unified manner, so that the two communication modalities can be integrated and coordinated in a way that is effective and synergistic. Here, it will be shown that, although language and physical force profiles are deemed completely different, the two can be embedded in a unified latent space and proximity between the two can be quantified. In this latent space, a force profile and words can a) supplement each other, b) integrate the individual effects, and c) substitute in an exchangeable manner. First, the need for cross-modality embedding is addressed, and the basic architecture and key building block technologies are presented. Methods for data collection and implementation challenges will be addressed, followed by experimental results and discussions.
format Preprint
id arxiv_https___arxiv_org_abs_2502_02772
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Cross-modality Force and Language Embeddings for Natural Human-Robot Communication
Tejwani, Ravi
Velazquez, Karl
Payne, John
Bonato, Paolo
Asada, Harry
Robotics
Artificial Intelligence
Human-Computer Interaction
A method for cross-modality embedding of force profile and words is presented for synergistic coordination of verbal and haptic communication. When two people carry a large, heavy object together, they coordinate through verbal communication about the intended movements and physical forces applied to the object. This natural integration of verbal and physical cues enables effective coordination. Similarly, human-robot interaction could achieve this level of coordination by integrating verbal and haptic communication modalities. This paper presents a framework for embedding words and force profiles in a unified manner, so that the two communication modalities can be integrated and coordinated in a way that is effective and synergistic. Here, it will be shown that, although language and physical force profiles are deemed completely different, the two can be embedded in a unified latent space and proximity between the two can be quantified. In this latent space, a force profile and words can a) supplement each other, b) integrate the individual effects, and c) substitute in an exchangeable manner. First, the need for cross-modality embedding is addressed, and the basic architecture and key building block technologies are presented. Methods for data collection and implementation challenges will be addressed, followed by experimental results and discussions.
title Cross-modality Force and Language Embeddings for Natural Human-Robot Communication
topic Robotics
Artificial Intelligence
Human-Computer Interaction
url https://arxiv.org/abs/2502.02772