Saved in:
Bibliographic Details
Main Authors: Das, Aryan, Rachamalla, Tanishq, Singh, Pravendra, Biswas, Koushik, Verma, Vinay Kumar, Garcia, Salvador, Plaza, Antonio, Roy, Swalpa Kumar
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2505.12217
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • We introduce HyperCap, the first large-scale hyperspectral captioning dataset designed to enhance model performance and effectiveness in remote sensing applications. Unlike traditional hyperspectral imaging (HSI) benchmarks, HyperCap integrates spectral data with pixel-wise textual annotations, enabling deeper semantic understanding. This dataset enhances model performance in tasks like classification and feature extraction, providing a valuable resource for advanced remote sensing applications. HyperCap is constructed from four benchmark datasets and annotated through a hybrid approach combining automated and manual methods to ensure accuracy and consistency. Empirical evaluations using state-of-the-art encoders and diverse fusion techniques demonstrate significant improvements in classification performance. These results underscore the potential of vision-language learning in HSI and position HyperCap as a foundational dataset for future research in the field. The code and dataset are available at https://github.com/arya-domain/HyperCap.