Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Huang, Zhuoxu, Fan, Zhenkun, Han, Jungong, Kittler, Josef
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2606.01604
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866911739236319232
author	Huang, Zhuoxu Fan, Zhenkun Han, Jungong Kittler, Josef
author_facet	Huang, Zhuoxu Fan, Zhenkun Han, Jungong Kittler, Josef
contents	Investigating spatial-temporal correlations, specifically how spatial points vary over time, is crucial for understanding point cloud videos. Traditional methods, particularly flow-based techniques, struggle with these correlations due to the unordered spatial arrangement of sequential point cloud data. To address this challenge, we propose a novel approach that regularizes spatial-temporal correlation learning by formulating the problem as a solvable Partial Differential Equation (PDE). While PDEs have long been effective in the physical domain, their application to novel sequential data like point cloud video remains underexplored. Inspired by fluid analysis, we construct a simplified PDE, and the process of solving PDE is guided and refined by a contrastive learning structure between the temporal embeddings and the spatial embeddings. With this extra supervision, our method, named MotionPDE, serves as an effective, plug-and-play enhancement module for existing backbone models, adding minimal computational overhead and parameters. Capitalizing on the contrastive learning process, we delve deeper into the self-supervised capabilities of MotionPDE, yielding promising results that underscore its utility and adaptability in point cloud video data interpretation. The code repo with trained checkpoints will be available at https://github.com/zhh6425/motionpde.git for facilitating future research.
format	Preprint
id	arxiv_https___arxiv_org_abs_2606_01604
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	Paving the Way for Point Cloud Video Representation Learning Using A PDE Model Huang, Zhuoxu Fan, Zhenkun Han, Jungong Kittler, Josef Computer Vision and Pattern Recognition Investigating spatial-temporal correlations, specifically how spatial points vary over time, is crucial for understanding point cloud videos. Traditional methods, particularly flow-based techniques, struggle with these correlations due to the unordered spatial arrangement of sequential point cloud data. To address this challenge, we propose a novel approach that regularizes spatial-temporal correlation learning by formulating the problem as a solvable Partial Differential Equation (PDE). While PDEs have long been effective in the physical domain, their application to novel sequential data like point cloud video remains underexplored. Inspired by fluid analysis, we construct a simplified PDE, and the process of solving PDE is guided and refined by a contrastive learning structure between the temporal embeddings and the spatial embeddings. With this extra supervision, our method, named MotionPDE, serves as an effective, plug-and-play enhancement module for existing backbone models, adding minimal computational overhead and parameters. Capitalizing on the contrastive learning process, we delve deeper into the self-supervised capabilities of MotionPDE, yielding promising results that underscore its utility and adaptability in point cloud video data interpretation. The code repo with trained checkpoints will be available at https://github.com/zhh6425/motionpde.git for facilitating future research.
title	Paving the Way for Point Cloud Video Representation Learning Using A PDE Model
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2606.01604

Similar Items