Saved in:
Bibliographic Details
Main Authors: Zhao, Weiguang, Xu, Haoran, Miao, Xingyu, Zhao, Qin, Zhang, Rui, Huang, Kaizhu, Gao, Ning, Cao, Peizhou, Sun, Mingze, Yu, Mulin, Lu, Tao, Xu, Linning, Dong, Junting, Pang, Jiangmiao
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2602.04441
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911421749526528
author Zhao, Weiguang
Xu, Haoran
Miao, Xingyu
Zhao, Qin
Zhang, Rui
Huang, Kaizhu
Gao, Ning
Cao, Peizhou
Sun, Mingze
Yu, Mulin
Lu, Tao
Xu, Linning
Dong, Junting
Pang, Jiangmiao
author_facet Zhao, Weiguang
Xu, Haoran
Miao, Xingyu
Zhao, Qin
Zhang, Rui
Huang, Kaizhu
Gao, Ning
Cao, Peizhou
Sun, Mingze
Yu, Mulin
Lu, Tao
Xu, Linning
Dong, Junting
Pang, Jiangmiao
contents Point tracking aims to follow visual points through complex motion, occlusion, and viewpoint changes, and has advanced rapidly with modern foundation models. Yet progress toward general point tracking remains constrained by limited high-quality data, as existing datasets often provide insufficient diversity and imperfect trajectory annotations. To this end, we introduce SynthVerse, a large-scale, diverse synthetic dataset specifically designed for point tracking. SynthVerse includes several new domains and object types missing from existing synthetic datasets, such as animated-film-style content, embodied manipulation, scene navigation, and articulated objects. SynthVerse substantially expands dataset diversity by covering a broader range of object categories and providing high-quality dynamic motions and interactions, enabling more robust training and evaluation for general point tracking. In addition, we establish a highly diverse point tracking benchmark to systematically evaluate state-of-the-art methods under broader domain shifts. Extensive experiments and analyses demonstrate that training with SynthVerse yields consistent improvements in generalization and reveal limitations of existing trackers under diverse settings.
format Preprint
id arxiv_https___arxiv_org_abs_2602_04441
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle SynthVerse: A Large-Scale Diverse Synthetic Dataset for Point Tracking
Zhao, Weiguang
Xu, Haoran
Miao, Xingyu
Zhao, Qin
Zhang, Rui
Huang, Kaizhu
Gao, Ning
Cao, Peizhou
Sun, Mingze
Yu, Mulin
Lu, Tao
Xu, Linning
Dong, Junting
Pang, Jiangmiao
Computer Vision and Pattern Recognition
Point tracking aims to follow visual points through complex motion, occlusion, and viewpoint changes, and has advanced rapidly with modern foundation models. Yet progress toward general point tracking remains constrained by limited high-quality data, as existing datasets often provide insufficient diversity and imperfect trajectory annotations. To this end, we introduce SynthVerse, a large-scale, diverse synthetic dataset specifically designed for point tracking. SynthVerse includes several new domains and object types missing from existing synthetic datasets, such as animated-film-style content, embodied manipulation, scene navigation, and articulated objects. SynthVerse substantially expands dataset diversity by covering a broader range of object categories and providing high-quality dynamic motions and interactions, enabling more robust training and evaluation for general point tracking. In addition, we establish a highly diverse point tracking benchmark to systematically evaluate state-of-the-art methods under broader domain shifts. Extensive experiments and analyses demonstrate that training with SynthVerse yields consistent improvements in generalization and reveal limitations of existing trackers under diverse settings.
title SynthVerse: A Large-Scale Diverse Synthetic Dataset for Point Tracking
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2602.04441