Saved in:
Bibliographic Details
Main Authors: Yu, Seungjun, Kim, Kisung, Kim, Daejung, Han, Haewook, Lee, Jinhan
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2502.12178
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909499220033536
author Yu, Seungjun
Kim, Kisung
Kim, Daejung
Han, Haewook
Lee, Jinhan
author_facet Yu, Seungjun
Kim, Kisung
Kim, Daejung
Han, Haewook
Lee, Jinhan
contents Diffusion-based models are recognized for their effectiveness in using real-world driving data to generate realistic and diverse traffic scenarios. These models employ guided sampling to incorporate specific traffic preferences and enhance scenario realism. However, guiding the sampling process to conform to traffic rules and preferences can result in deviations from real-world traffic priors and potentially leading to unrealistic behaviors. To address this challenge, we introduce a multi-guided diffusion model that utilizes a novel training strategy to closely adhere to traffic priors, even when employing various combinations of guides. This model adopts a multi-task learning framework, enabling a single diffusion model to process various guide inputs. For increased guided sampling precision, our model is fine-tuned using the Direct Preference Optimization (DPO) algorithm. This algorithm optimizes preferences based on guide scores, effectively navigating the complexities and challenges associated with the expensive and often non-differentiable gradient calculations during the guided sampling fine-tuning process. Evaluated using the nuScenes dataset our model provides a strong baseline for balancing realism, diversity and controllability in the traffic scenario generation.
format Preprint
id arxiv_https___arxiv_org_abs_2502_12178
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Direct Preference Optimization-Enhanced Multi-Guided Diffusion Model for Traffic Scenario Generation
Yu, Seungjun
Kim, Kisung
Kim, Daejung
Han, Haewook
Lee, Jinhan
Machine Learning
Multiagent Systems
Diffusion-based models are recognized for their effectiveness in using real-world driving data to generate realistic and diverse traffic scenarios. These models employ guided sampling to incorporate specific traffic preferences and enhance scenario realism. However, guiding the sampling process to conform to traffic rules and preferences can result in deviations from real-world traffic priors and potentially leading to unrealistic behaviors. To address this challenge, we introduce a multi-guided diffusion model that utilizes a novel training strategy to closely adhere to traffic priors, even when employing various combinations of guides. This model adopts a multi-task learning framework, enabling a single diffusion model to process various guide inputs. For increased guided sampling precision, our model is fine-tuned using the Direct Preference Optimization (DPO) algorithm. This algorithm optimizes preferences based on guide scores, effectively navigating the complexities and challenges associated with the expensive and often non-differentiable gradient calculations during the guided sampling fine-tuning process. Evaluated using the nuScenes dataset our model provides a strong baseline for balancing realism, diversity and controllability in the traffic scenario generation.
title Direct Preference Optimization-Enhanced Multi-Guided Diffusion Model for Traffic Scenario Generation
topic Machine Learning
Multiagent Systems
url https://arxiv.org/abs/2502.12178