Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Chen, Jiuchen, Yan, Xinyu, Xu, Qizhi, Li, Kaiqi
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2504.09621
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866910910698749952
author	Chen, Jiuchen Yan, Xinyu Xu, Qizhi Li, Kaiqi
author_facet	Chen, Jiuchen Yan, Xinyu Xu, Qizhi Li, Kaiqi
contents	Global contextual information and local detail features are essential for haze removal tasks. Deep learning models perform well on small, low-resolution images, but they encounter difficulties with large, high-resolution ones due to GPU memory limitations. As a compromise, they often resort to image slicing or downsampling. The former diminishes global information, while the latter discards high-frequency details. To address these challenges, we propose DehazeXL, a haze removal method that effectively balances global context and local feature extraction, enabling end-to-end modeling of large images on mainstream GPU hardware. Additionally, to evaluate the efficiency of global context utilization in haze removal performance, we design a visual attribution method tailored to the characteristics of haze removal tasks. Finally, recognizing the lack of benchmark datasets for haze removal in large images, we have developed an ultra-high-resolution haze removal dataset (8KDehaze) to support model training and testing. It includes 10000 pairs of clear and hazy remote sensing images, each sized at 8192 $\times$ 8192 pixels. Extensive experiments demonstrate that DehazeXL can infer images up to 10240 $\times$ 10240 pixels with only 21 GB of memory, achieving state-of-the-art results among all evaluated methods. The source code and experimental dataset are available at https://github.com/CastleChen339/DehazeXL.
format	Preprint
id	arxiv_https___arxiv_org_abs_2504_09621
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Tokenize Image Patches: Global Context Fusion for Effective Haze Removal in Large Images Chen, Jiuchen Yan, Xinyu Xu, Qizhi Li, Kaiqi Computer Vision and Pattern Recognition Global contextual information and local detail features are essential for haze removal tasks. Deep learning models perform well on small, low-resolution images, but they encounter difficulties with large, high-resolution ones due to GPU memory limitations. As a compromise, they often resort to image slicing or downsampling. The former diminishes global information, while the latter discards high-frequency details. To address these challenges, we propose DehazeXL, a haze removal method that effectively balances global context and local feature extraction, enabling end-to-end modeling of large images on mainstream GPU hardware. Additionally, to evaluate the efficiency of global context utilization in haze removal performance, we design a visual attribution method tailored to the characteristics of haze removal tasks. Finally, recognizing the lack of benchmark datasets for haze removal in large images, we have developed an ultra-high-resolution haze removal dataset (8KDehaze) to support model training and testing. It includes 10000 pairs of clear and hazy remote sensing images, each sized at 8192 $\times$ 8192 pixels. Extensive experiments demonstrate that DehazeXL can infer images up to 10240 $\times$ 10240 pixels with only 21 GB of memory, achieving state-of-the-art results among all evaluated methods. The source code and experimental dataset are available at https://github.com/CastleChen339/DehazeXL.
title	Tokenize Image Patches: Global Context Fusion for Effective Haze Removal in Large Images
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2504.09621

Similar Items