Saved in:
Bibliographic Details
Main Authors: Potje, Guilherme, Cadar, Felipe, Araujo, Andre, Martins, Renato, Nascimento, Erickson R.
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2404.19174
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914777301778432
author Potje, Guilherme
Cadar, Felipe
Araujo, Andre
Martins, Renato
Nascimento, Erickson R.
author_facet Potje, Guilherme
Cadar, Felipe
Araujo, Andre
Martins, Renato
Nascimento, Erickson R.
contents We introduce a lightweight and accurate architecture for resource-efficient visual correspondence. Our method, dubbed XFeat (Accelerated Features), revisits fundamental design choices in convolutional neural networks for detecting, extracting, and matching local features. Our new model satisfies a critical need for fast and robust algorithms suitable to resource-limited devices. In particular, accurate image matching requires sufficiently large image resolutions - for this reason, we keep the resolution as large as possible while limiting the number of channels in the network. Besides, our model is designed to offer the choice of matching at the sparse or semi-dense levels, each of which may be more suitable for different downstream applications, such as visual navigation and augmented reality. Our model is the first to offer semi-dense matching efficiently, leveraging a novel match refinement module that relies on coarse local descriptors. XFeat is versatile and hardware-independent, surpassing current deep learning-based local features in speed (up to 5x faster) with comparable or better accuracy, proven in pose estimation and visual localization. We showcase it running in real-time on an inexpensive laptop CPU without specialized hardware optimizations. Code and weights are available at www.verlab.dcc.ufmg.br/descriptors/xfeat_cvpr24.
format Preprint
id arxiv_https___arxiv_org_abs_2404_19174
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle XFeat: Accelerated Features for Lightweight Image Matching
Potje, Guilherme
Cadar, Felipe
Araujo, Andre
Martins, Renato
Nascimento, Erickson R.
Computer Vision and Pattern Recognition
We introduce a lightweight and accurate architecture for resource-efficient visual correspondence. Our method, dubbed XFeat (Accelerated Features), revisits fundamental design choices in convolutional neural networks for detecting, extracting, and matching local features. Our new model satisfies a critical need for fast and robust algorithms suitable to resource-limited devices. In particular, accurate image matching requires sufficiently large image resolutions - for this reason, we keep the resolution as large as possible while limiting the number of channels in the network. Besides, our model is designed to offer the choice of matching at the sparse or semi-dense levels, each of which may be more suitable for different downstream applications, such as visual navigation and augmented reality. Our model is the first to offer semi-dense matching efficiently, leveraging a novel match refinement module that relies on coarse local descriptors. XFeat is versatile and hardware-independent, surpassing current deep learning-based local features in speed (up to 5x faster) with comparable or better accuracy, proven in pose estimation and visual localization. We showcase it running in real-time on an inexpensive laptop CPU without specialized hardware optimizations. Code and weights are available at www.verlab.dcc.ufmg.br/descriptors/xfeat_cvpr24.
title XFeat: Accelerated Features for Lightweight Image Matching
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2404.19174