Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Snyder, Thomas, Yang, H. Lexie, Schnake, Stefan, Schotthöfer, Steffen
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2601.08882
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866909989809946624
author	Snyder, Thomas Yang, H. Lexie Schnake, Stefan Schotthöfer, Steffen
author_facet	Snyder, Thomas Yang, H. Lexie Schnake, Stefan Schotthöfer, Steffen
contents	Deploying geospatial foundation models on resource-constrained edge devices demands compact architectures that maintain high downstream performance. However, their large parameter counts and the accuracy loss often induced by compression limit practical adoption. In this work, we leverage manifold-constrained optimization framework DLRT to compress large vision transformer-based geospatial foundation models during transfer learning. By enforcing structured low-dimensional parameterizations aligned with downstream objectives, this approach achieves strong compression while preserving task-specific accuracy. We show that the method outperforms of-the-shelf low-rank methods as LoRA. Experiments on diverse geospatial benchmarks confirm substantial parameter reduction with minimal accuracy loss, enabling high-performing, on-device geospatial models.
format	Preprint
id	arxiv_https___arxiv_org_abs_2601_08882
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	Compressing Vision Transformers in Geospatial Transfer Learning with Manifold-Constrained Optimization Snyder, Thomas Yang, H. Lexie Schnake, Stefan Schotthöfer, Steffen Computer Vision and Pattern Recognition Artificial Intelligence Deploying geospatial foundation models on resource-constrained edge devices demands compact architectures that maintain high downstream performance. However, their large parameter counts and the accuracy loss often induced by compression limit practical adoption. In this work, we leverage manifold-constrained optimization framework DLRT to compress large vision transformer-based geospatial foundation models during transfer learning. By enforcing structured low-dimensional parameterizations aligned with downstream objectives, this approach achieves strong compression while preserving task-specific accuracy. We show that the method outperforms of-the-shelf low-rank methods as LoRA. Experiments on diverse geospatial benchmarks confirm substantial parameter reduction with minimal accuracy loss, enabling high-performing, on-device geospatial models.
title	Compressing Vision Transformers in Geospatial Transfer Learning with Manifold-Constrained Optimization
topic	Computer Vision and Pattern Recognition Artificial Intelligence
url	https://arxiv.org/abs/2601.08882

Similar Items