Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Luo, Xinglong, Luo, Ao, Wang, Zhengning, Yang, Yueqi, Feng, Chaoyu, Lei, Lei, Zeng, Bing, Liu, Shuaicheng
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2602.23022
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866917362029035520
author	Luo, Xinglong Luo, Ao Wang, Zhengning Yang, Yueqi Feng, Chaoyu Lei, Lei Zeng, Bing Liu, Shuaicheng
author_facet	Luo, Xinglong Luo, Ao Wang, Zhengning Yang, Yueqi Feng, Chaoyu Lei, Lei Zeng, Bing Liu, Shuaicheng
contents	Image alignment is a fundamental task in computer vision with broad applications. Existing methods predominantly employ optical flow-based image warping. However, this technique is susceptible to common challenges such as occlusions and illumination variations, leading to degraded alignment visual quality and compromised accuracy in downstream tasks. In this paper, we present DMAligner, a diffusion-based framework for image alignment through alignment-oriented view synthesis. DMAligner is crafted to tackle the challenges in image alignment from a new perspective, employing a generation-based solution that showcases strong capabilities and avoids the problems associated with flow-based image warping. Specifically, we propose a Dynamics-aware Diffusion Training approach for learning conditional image generation, synthesizing a novel view for image alignment. This incorporates a Dynamics-aware Mask Producing (DMP) module to adaptively distinguish dynamic foreground regions from static backgrounds, enabling the diffusion model to more effectively handle challenges that classical methods struggle to solve. Furthermore, we develop the Dynamic Scene Image Alignment (DSIA) dataset using Blender, which includes 1,033 indoor and outdoor scenes with over 30K image pairs tailored for image alignment. Extensive experimental results demonstrate the superiority of the proposed approach on DSIA benchmarks, as well as on a series of widely-used video datasets for qualitative comparisons. Our code is available at https://github.com/boomluo02/DMAligner.
format	Preprint
id	arxiv_https___arxiv_org_abs_2602_23022
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	DMAligner: Enhancing Image Alignment via Diffusion Model Based View Synthesis Luo, Xinglong Luo, Ao Wang, Zhengning Yang, Yueqi Feng, Chaoyu Lei, Lei Zeng, Bing Liu, Shuaicheng Computer Vision and Pattern Recognition Image alignment is a fundamental task in computer vision with broad applications. Existing methods predominantly employ optical flow-based image warping. However, this technique is susceptible to common challenges such as occlusions and illumination variations, leading to degraded alignment visual quality and compromised accuracy in downstream tasks. In this paper, we present DMAligner, a diffusion-based framework for image alignment through alignment-oriented view synthesis. DMAligner is crafted to tackle the challenges in image alignment from a new perspective, employing a generation-based solution that showcases strong capabilities and avoids the problems associated with flow-based image warping. Specifically, we propose a Dynamics-aware Diffusion Training approach for learning conditional image generation, synthesizing a novel view for image alignment. This incorporates a Dynamics-aware Mask Producing (DMP) module to adaptively distinguish dynamic foreground regions from static backgrounds, enabling the diffusion model to more effectively handle challenges that classical methods struggle to solve. Furthermore, we develop the Dynamic Scene Image Alignment (DSIA) dataset using Blender, which includes 1,033 indoor and outdoor scenes with over 30K image pairs tailored for image alignment. Extensive experimental results demonstrate the superiority of the proposed approach on DSIA benchmarks, as well as on a series of widely-used video datasets for qualitative comparisons. Our code is available at https://github.com/boomluo02/DMAligner.
title	DMAligner: Enhancing Image Alignment via Diffusion Model Based View Synthesis
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2602.23022

Similar Items