Salvato in:
Dettagli Bibliografici
Autori principali: Kapoor, Aditya, Khadilkar, Harshad, Gubbi, Jayvardhana
Natura: Preprint
Pubblicazione: 2024
Soggetti:
Accesso online:https://arxiv.org/abs/2407.16302
Tags: Aggiungi Tag
Nessun Tag, puoi essere il primo ad aggiungerne!!
_version_ 1866917873050451968
author Kapoor, Aditya
Khadilkar, Harshad
Gubbi, Jayvardhana
author_facet Kapoor, Aditya
Khadilkar, Harshad
Gubbi, Jayvardhana
contents Distortion identification and rectification in images and videos is vital for achieving good performance in downstream vision applications. Instead of relying on fixed trial-and-error based image processing pipelines, we propose a two-level sequential planning approach for automated image distortion classification and rectification. At the higher level it detects the class of corruptions present in the input image, if any. The lower level selects a specific algorithm to be applied, from a set of externally provided candidate algorithms. The entire two-level setup runs in the form of a single forward pass during inference and it is to be queried iteratively until the retrieval of the original image. We demonstrate improvements compared to three baselines on the object detection task on COCO image dataset with rich set of distortions. The advantage of our approach is its dynamic reconfiguration, conditioned on the input image and generalisability to unseen candidate algorithms at inference time, since it relies only on the comparison of their output of the image embeddings.
format Preprint
id arxiv_https___arxiv_org_abs_2407_16302
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle DeepClean: Integrated Distortion Identification and Algorithm Selection for Rectifying Image Corruptions
Kapoor, Aditya
Khadilkar, Harshad
Gubbi, Jayvardhana
Computer Vision and Pattern Recognition
Image and Video Processing
Distortion identification and rectification in images and videos is vital for achieving good performance in downstream vision applications. Instead of relying on fixed trial-and-error based image processing pipelines, we propose a two-level sequential planning approach for automated image distortion classification and rectification. At the higher level it detects the class of corruptions present in the input image, if any. The lower level selects a specific algorithm to be applied, from a set of externally provided candidate algorithms. The entire two-level setup runs in the form of a single forward pass during inference and it is to be queried iteratively until the retrieval of the original image. We demonstrate improvements compared to three baselines on the object detection task on COCO image dataset with rich set of distortions. The advantage of our approach is its dynamic reconfiguration, conditioned on the input image and generalisability to unseen candidate algorithms at inference time, since it relies only on the comparison of their output of the image embeddings.
title DeepClean: Integrated Distortion Identification and Algorithm Selection for Rectifying Image Corruptions
topic Computer Vision and Pattern Recognition
Image and Video Processing
url https://arxiv.org/abs/2407.16302