Saved in:
Bibliographic Details
Main Authors: Etaat, Daniel, Kalaria, Dvij, Rahmanian, Nima, Sastry, Shankar
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2503.20936
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866916926684397568
author Etaat, Daniel
Kalaria, Dvij
Rahmanian, Nima
Sastry, Shankar
author_facet Etaat, Daniel
Kalaria, Dvij
Rahmanian, Nima
Sastry, Shankar
contents Physical agility is a necessary skill in competitive table tennis, but by no means sufficient. Champions excel in this fast-paced and highly dynamic environment by anticipating their opponent's intent - buying themselves the necessary time to react. In this work, we take one step towards designing such an anticipatory agent. Previous works have developed systems capable of real-time table tennis gameplay, though they often do not leverage anticipation. Among the works that forecast opponent actions, their approaches are limited by dataset size and variety. Our paper contributes (1) a scalable system for reconstructing monocular video of table tennis matches in 3D and (2) an uncertainty-aware controller that anticipates opponent actions. We demonstrate in simulation that our policy improves the ball return rate against high-speed hits from 49.9% to 59.0% as compared to a baseline non-anticipatory policy.
format Preprint
id arxiv_https___arxiv_org_abs_2503_20936
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle LATTE-MV: Learning to Anticipate Table Tennis Hits from Monocular Videos
Etaat, Daniel
Kalaria, Dvij
Rahmanian, Nima
Sastry, Shankar
Computer Vision and Pattern Recognition
Artificial Intelligence
Physical agility is a necessary skill in competitive table tennis, but by no means sufficient. Champions excel in this fast-paced and highly dynamic environment by anticipating their opponent's intent - buying themselves the necessary time to react. In this work, we take one step towards designing such an anticipatory agent. Previous works have developed systems capable of real-time table tennis gameplay, though they often do not leverage anticipation. Among the works that forecast opponent actions, their approaches are limited by dataset size and variety. Our paper contributes (1) a scalable system for reconstructing monocular video of table tennis matches in 3D and (2) an uncertainty-aware controller that anticipates opponent actions. We demonstrate in simulation that our policy improves the ball return rate against high-speed hits from 49.9% to 59.0% as compared to a baseline non-anticipatory policy.
title LATTE-MV: Learning to Anticipate Table Tennis Hits from Monocular Videos
topic Computer Vision and Pattern Recognition
Artificial Intelligence
url https://arxiv.org/abs/2503.20936