Saved in:
Bibliographic Details
Main Authors: Zou, Yang, Zhu, Xingyue, Han, Kaiqi, Ma, Jun, Li, Xingyuan, Jiang, Zhiying, Liu, Jinyuan
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2601.04682
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914512187162624
author Zou, Yang
Zhu, Xingyue
Han, Kaiqi
Ma, Jun
Li, Xingyuan
Jiang, Zhiying
Liu, Jinyuan
author_facet Zou, Yang
Zhu, Xingyue
Han, Kaiqi
Ma, Jun
Li, Xingyuan
Jiang, Zhiying
Liu, Jinyuan
contents Infrared video has been of great interest in visual tasks under challenging environments, but often suffers from severe atmospheric turbulence and compression degradation. Existing video super-resolution (VSR) methods either neglect the inherent modality gap between infrared and visible images or fail to restore turbulence-induced distortions. Directly cascading turbulence mitigation (TM) algorithms with VSR methods leads to error propagation and accumulation due to the decoupled modeling of degradation between turbulence and resolution. We introduce HATIR, a Heat-Aware Diffusion for Turbulent InfraRed Video Super-Resolution, which injects heat-aware deformation priors into the diffusion sampling path to jointly model the inverse process of turbulent degradation and structural detail loss. Specifically, HATIR constructs a Phasor-Guided Flow Estimator, rooted in the physical principle that thermally active regions exhibit consistent phasor responses over time, enabling reliable turbulence-aware flow to guide the reverse diffusion process. To ensure the fidelity of structural recovery under nonuniform distortions, a Turbulence-Aware Decoder is proposed to selectively suppress unstable temporal cues and enhance edge-aware feature aggregation via turbulence gating and structure-aware attention. We built FLIR-IVSR, the first dataset for turbulent infrared VSR, comprising paired LR-HR sequences from a FLIR T1050sc camera (1024 X 768) spanning 640 diverse scenes with varying camera and object motion conditions. This encourages future research in infrared VSR. Project page: https://github.com/JZ0606/HATIR
format Preprint
id arxiv_https___arxiv_org_abs_2601_04682
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle HATIR: Heat-Aware Diffusion for Turbulent Infrared Video Super-Resolution
Zou, Yang
Zhu, Xingyue
Han, Kaiqi
Ma, Jun
Li, Xingyuan
Jiang, Zhiying
Liu, Jinyuan
Computer Vision and Pattern Recognition
Infrared video has been of great interest in visual tasks under challenging environments, but often suffers from severe atmospheric turbulence and compression degradation. Existing video super-resolution (VSR) methods either neglect the inherent modality gap between infrared and visible images or fail to restore turbulence-induced distortions. Directly cascading turbulence mitigation (TM) algorithms with VSR methods leads to error propagation and accumulation due to the decoupled modeling of degradation between turbulence and resolution. We introduce HATIR, a Heat-Aware Diffusion for Turbulent InfraRed Video Super-Resolution, which injects heat-aware deformation priors into the diffusion sampling path to jointly model the inverse process of turbulent degradation and structural detail loss. Specifically, HATIR constructs a Phasor-Guided Flow Estimator, rooted in the physical principle that thermally active regions exhibit consistent phasor responses over time, enabling reliable turbulence-aware flow to guide the reverse diffusion process. To ensure the fidelity of structural recovery under nonuniform distortions, a Turbulence-Aware Decoder is proposed to selectively suppress unstable temporal cues and enhance edge-aware feature aggregation via turbulence gating and structure-aware attention. We built FLIR-IVSR, the first dataset for turbulent infrared VSR, comprising paired LR-HR sequences from a FLIR T1050sc camera (1024 X 768) spanning 640 diverse scenes with varying camera and object motion conditions. This encourages future research in infrared VSR. Project page: https://github.com/JZ0606/HATIR
title HATIR: Heat-Aware Diffusion for Turbulent Infrared Video Super-Resolution
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2601.04682