Saved in:
Bibliographic Details
Main Authors: Wang, Haichao, Okupnik, Alexander, Han, Yuxing, Wen, Gene, Schneider, Johannes, Flouris, Kyriakos
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2604.03310
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • Long-range human movement generation remains a central challenge in computer vision and graphics. Generating coherent transitions across semantically distinct motion domains remains largely unexplored. This capability is particularly important for applications such as dance choreography, where movements must fluidly transition across diverse stylistic and semantic motifs. We propose a simple and effective inference-time optimization framework inspired by diffusion-based stochastic optimal control. Specifically, a control-energy objective that explicitly regularizes the transition trajectories of a pretrained diffusion model. We show that optimizing this objective at inference time yields transitions with fidelity and temporal coherence. This is the first work to provide a general framework for controlled long-range human motion generation with explicit transition modeling.