Internformat: :: Library Catalog

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Bien, Seongjin, Kneissl, Carlo, Jülg, Tobias, Fundel, Frank, Ressler-Antal, Thomas, Walter, Florian, Ommer, Björn, Kutyniok, Gitta, Burgard, Wolfram
Format:	Preprint
Veröffentlicht:	2026
Schlagworte:	Robotics Machine Learning
Online-Zugang:	https://arxiv.org/abs/2603.08255
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

_version_	1866911498975051776
author	Bien, Seongjin Kneissl, Carlo Jülg, Tobias Fundel, Frank Ressler-Antal, Thomas Walter, Florian Ommer, Björn Kutyniok, Gitta Burgard, Wolfram
author_facet	Bien, Seongjin Kneissl, Carlo Jülg, Tobias Fundel, Frank Ressler-Antal, Thomas Walter, Florian Ommer, Björn Kutyniok, Gitta Burgard, Wolfram
contents	Tactile sensation is essential for contact-rich manipulation tasks. It provides direct feedback on object geometry, surface properties, and interaction forces, enhancing perception and enabling fine-grained control. An inherent limitation of tactile sensors is that readings are available only when an object is touched. This precludes their use during planning and the initial execution phase of a task. Predicting tactile information from visual information can bridge this gap. A common approach is to learn a direct mapping from camera images to the output of vision-based tactile sensors. However, the resulting model will depend strongly on the specific setup and on how well the camera can capture the area where an object is touched. In this work, we introduce FlowTouch, a novel model for view-invariant visuo-tactile prediction. Our key idea is to use an object's local 3D mesh to encode rich information for predicting tactile patterns while abstracting away from scene-dependent details. FlowTouch integrates scene reconstruction and Flow Matching-based models for image generation. Our results show that FlowTouch is able to bridge the sim-to-real gap and generalize to new sensor instances. We further show that the resulting tactile images can be used for downstream grasp stability prediction. Our code, datasets and videos are available at https://flowtouch.github.io/
format	Preprint
id	arxiv_https___arxiv_org_abs_2603_08255
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	FlowTouch: View-Invariant Visuo-Tactile Prediction Bien, Seongjin Kneissl, Carlo Jülg, Tobias Fundel, Frank Ressler-Antal, Thomas Walter, Florian Ommer, Björn Kutyniok, Gitta Burgard, Wolfram Robotics Machine Learning Tactile sensation is essential for contact-rich manipulation tasks. It provides direct feedback on object geometry, surface properties, and interaction forces, enhancing perception and enabling fine-grained control. An inherent limitation of tactile sensors is that readings are available only when an object is touched. This precludes their use during planning and the initial execution phase of a task. Predicting tactile information from visual information can bridge this gap. A common approach is to learn a direct mapping from camera images to the output of vision-based tactile sensors. However, the resulting model will depend strongly on the specific setup and on how well the camera can capture the area where an object is touched. In this work, we introduce FlowTouch, a novel model for view-invariant visuo-tactile prediction. Our key idea is to use an object's local 3D mesh to encode rich information for predicting tactile patterns while abstracting away from scene-dependent details. FlowTouch integrates scene reconstruction and Flow Matching-based models for image generation. Our results show that FlowTouch is able to bridge the sim-to-real gap and generalize to new sensor instances. We further show that the resulting tactile images can be used for downstream grasp stability prediction. Our code, datasets and videos are available at https://flowtouch.github.io/
title	FlowTouch: View-Invariant Visuo-Tactile Prediction
topic	Robotics Machine Learning
url	https://arxiv.org/abs/2603.08255

Ähnliche Einträge