Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Jin, Youngwan, Park, Incheol, Nalcakan, Yagiz, Ju, Hyeongjin, Yeo, Sanghyeop, Kim, Shiho
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2602.15490
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866911451971584000
author	Jin, Youngwan Park, Incheol Nalcakan, Yagiz Ju, Hyeongjin Yeo, Sanghyeop Kim, Shiho
author_facet	Jin, Youngwan Park, Incheol Nalcakan, Yagiz Ju, Hyeongjin Yeo, Sanghyeop Kim, Shiho
contents	General-purpose super-resolution models, particularly Vision Transformers, have achieved remarkable success but exhibit fundamental inefficiencies in common infrared imaging scenarios like surveillance and autonomous driving, which operate from fixed or nearly-static viewpoints. These models fail to exploit the strong, persistent spatial priors inherent in such scenes, leading to redundant learning and suboptimal performance. To address this, we propose the Regional Prior attention Transformer for infrared image Super-Resolution (RPT-SR), a novel architecture that explicitly encodes scene layout information into the attention mechanism. Our core contribution is a dual-token framework that fuses (1) learnable, regional prior tokens, which act as a persistent memory for the scene's global structure, with (2) local tokens that capture the frame-specific content of the current input. By utilizing these tokens into an attention, our model allows the priors to dynamically modulate the local reconstruction process. Extensive experiments validate our approach. While most prior works focus on a single infrared band, we demonstrate the broad applicability and versatility of RPT-SR by establishing new state-of-the-art performance across diverse datasets covering both Long-Wave (LWIR) and Short-Wave (SWIR) spectra
format	Preprint
id	arxiv_https___arxiv_org_abs_2602_15490
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	RPT-SR: Regional Prior attention Transformer for infrared image Super-Resolution Jin, Youngwan Park, Incheol Nalcakan, Yagiz Ju, Hyeongjin Yeo, Sanghyeop Kim, Shiho Computer Vision and Pattern Recognition Artificial Intelligence General-purpose super-resolution models, particularly Vision Transformers, have achieved remarkable success but exhibit fundamental inefficiencies in common infrared imaging scenarios like surveillance and autonomous driving, which operate from fixed or nearly-static viewpoints. These models fail to exploit the strong, persistent spatial priors inherent in such scenes, leading to redundant learning and suboptimal performance. To address this, we propose the Regional Prior attention Transformer for infrared image Super-Resolution (RPT-SR), a novel architecture that explicitly encodes scene layout information into the attention mechanism. Our core contribution is a dual-token framework that fuses (1) learnable, regional prior tokens, which act as a persistent memory for the scene's global structure, with (2) local tokens that capture the frame-specific content of the current input. By utilizing these tokens into an attention, our model allows the priors to dynamically modulate the local reconstruction process. Extensive experiments validate our approach. While most prior works focus on a single infrared band, we demonstrate the broad applicability and versatility of RPT-SR by establishing new state-of-the-art performance across diverse datasets covering both Long-Wave (LWIR) and Short-Wave (SWIR) spectra
title	RPT-SR: Regional Prior attention Transformer for infrared image Super-Resolution
topic	Computer Vision and Pattern Recognition Artificial Intelligence
url	https://arxiv.org/abs/2602.15490

Similar Items