Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Etaat, Daniel, Kalaria, Dvij, Rahmanian, Nima, Sastry, Shankar
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2503.20936
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866916926684397568
author	Etaat, Daniel Kalaria, Dvij Rahmanian, Nima Sastry, Shankar
author_facet	Etaat, Daniel Kalaria, Dvij Rahmanian, Nima Sastry, Shankar
contents	Physical agility is a necessary skill in competitive table tennis, but by no means sufficient. Champions excel in this fast-paced and highly dynamic environment by anticipating their opponent's intent - buying themselves the necessary time to react. In this work, we take one step towards designing such an anticipatory agent. Previous works have developed systems capable of real-time table tennis gameplay, though they often do not leverage anticipation. Among the works that forecast opponent actions, their approaches are limited by dataset size and variety. Our paper contributes (1) a scalable system for reconstructing monocular video of table tennis matches in 3D and (2) an uncertainty-aware controller that anticipates opponent actions. We demonstrate in simulation that our policy improves the ball return rate against high-speed hits from 49.9% to 59.0% as compared to a baseline non-anticipatory policy.
format	Preprint
id	arxiv_https___arxiv_org_abs_2503_20936
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	LATTE-MV: Learning to Anticipate Table Tennis Hits from Monocular Videos Etaat, Daniel Kalaria, Dvij Rahmanian, Nima Sastry, Shankar Computer Vision and Pattern Recognition Artificial Intelligence Physical agility is a necessary skill in competitive table tennis, but by no means sufficient. Champions excel in this fast-paced and highly dynamic environment by anticipating their opponent's intent - buying themselves the necessary time to react. In this work, we take one step towards designing such an anticipatory agent. Previous works have developed systems capable of real-time table tennis gameplay, though they often do not leverage anticipation. Among the works that forecast opponent actions, their approaches are limited by dataset size and variety. Our paper contributes (1) a scalable system for reconstructing monocular video of table tennis matches in 3D and (2) an uncertainty-aware controller that anticipates opponent actions. We demonstrate in simulation that our policy improves the ball return rate against high-speed hits from 49.9% to 59.0% as compared to a baseline non-anticipatory policy.
title	LATTE-MV: Learning to Anticipate Table Tennis Hits from Monocular Videos
topic	Computer Vision and Pattern Recognition Artificial Intelligence
url	https://arxiv.org/abs/2503.20936

Similar Items