Saved in:
Bibliographic Details
Main Authors: Shi, Dingyuan, Zhang, Lulu, Tong, Yongxin, Xu, Ke
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2502.00725
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866916594742984704
author Shi, Dingyuan
Zhang, Lulu
Tong, Yongxin
Xu, Ke
author_facet Shi, Dingyuan
Zhang, Lulu
Tong, Yongxin
Xu, Ke
contents Advancements in mobility services, navigation systems, and smart transportation technologies have made it possible to collect large amounts of path data. Modeling the distribution of this path data, known as the Path Generation (PG) problem, is crucial for understanding urban mobility patterns and developing intelligent transportation systems. Recent studies have explored using diffusion models to address the PG problem due to their ability to capture multimodal distributions and support conditional generation. A recent work devises a diffusion process explicitly in graph space and achieves state-of-the-art performance. However, this method suffers a high computation cost in terms of both time and memory, which prohibits its application. In this paper, we analyze this method both theoretically and experimentally and find that the main culprit of its high computation cost is its explicit design of the diffusion process in graph space. To improve efficiency, we devise a Latent-space Path Diffusion (LPD) model, which operates in latent space instead of graph space. Our LPD significantly reduces both time and memory costs by up to 82.8% and 83.1%, respectively. Despite these reductions, our approach does not suffer from performance degradation. It outperforms the state-of-the-art method in most scenarios by 24.5%~34.0%.
format Preprint
id arxiv_https___arxiv_org_abs_2502_00725
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Understanding and Mitigating the High Computational Cost in Path Data Diffusion
Shi, Dingyuan
Zhang, Lulu
Tong, Yongxin
Xu, Ke
Machine Learning
Advancements in mobility services, navigation systems, and smart transportation technologies have made it possible to collect large amounts of path data. Modeling the distribution of this path data, known as the Path Generation (PG) problem, is crucial for understanding urban mobility patterns and developing intelligent transportation systems. Recent studies have explored using diffusion models to address the PG problem due to their ability to capture multimodal distributions and support conditional generation. A recent work devises a diffusion process explicitly in graph space and achieves state-of-the-art performance. However, this method suffers a high computation cost in terms of both time and memory, which prohibits its application. In this paper, we analyze this method both theoretically and experimentally and find that the main culprit of its high computation cost is its explicit design of the diffusion process in graph space. To improve efficiency, we devise a Latent-space Path Diffusion (LPD) model, which operates in latent space instead of graph space. Our LPD significantly reduces both time and memory costs by up to 82.8% and 83.1%, respectively. Despite these reductions, our approach does not suffer from performance degradation. It outperforms the state-of-the-art method in most scenarios by 24.5%~34.0%.
title Understanding and Mitigating the High Computational Cost in Path Data Diffusion
topic Machine Learning
url https://arxiv.org/abs/2502.00725