Saved in:
Bibliographic Details
Main Authors: Sap, Duygu, Lotz, Martin, Mattinson, Connor
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2510.16514
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909856151109632
author Sap, Duygu
Lotz, Martin
Mattinson, Connor
author_facet Sap, Duygu
Lotz, Martin
Mattinson, Connor
contents We propose a method for image categorization and retrieval that leverages graphs and a graph attention network (GAT)-based autoencoder. Our approach is representative-centric, that is, we execute the categorization and retrieval process via the representative models we construct for the images and image categories. We utilize a graph where nodes represent images (or their representatives) and edges capture similarity relationships. GAT highlights important features and relationships between images, enabling the autoencoder to construct context-aware latent representations that capture the key features of each image relative to its neighbors. We obtain category representatives from these embeddings and categorize a query image by comparing its representative to the category representatives. We then retrieve the most similar image to the query image within its identified category. We demonstrate the effectiveness of our representative-centric approach through experiments with both the GAT autoencoders and standard feature-based techniques.
format Preprint
id arxiv_https___arxiv_org_abs_2510_16514
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Image Categorization and Search via a GAT Autoencoder and Representative Models
Sap, Duygu
Lotz, Martin
Mattinson, Connor
Computer Vision and Pattern Recognition
Artificial Intelligence
We propose a method for image categorization and retrieval that leverages graphs and a graph attention network (GAT)-based autoencoder. Our approach is representative-centric, that is, we execute the categorization and retrieval process via the representative models we construct for the images and image categories. We utilize a graph where nodes represent images (or their representatives) and edges capture similarity relationships. GAT highlights important features and relationships between images, enabling the autoencoder to construct context-aware latent representations that capture the key features of each image relative to its neighbors. We obtain category representatives from these embeddings and categorize a query image by comparing its representative to the category representatives. We then retrieve the most similar image to the query image within its identified category. We demonstrate the effectiveness of our representative-centric approach through experiments with both the GAT autoencoders and standard feature-based techniques.
title Image Categorization and Search via a GAT Autoencoder and Representative Models
topic Computer Vision and Pattern Recognition
Artificial Intelligence
url https://arxiv.org/abs/2510.16514