Saved in:
Bibliographic Details
Main Authors: Krey, Elias B., Neukirch, Nils, Strodthoff, Nils
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2605.11203
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866918494962974720
author Krey, Elias B.
Neukirch, Nils
Strodthoff, Nils
author_facet Krey, Elias B.
Neukirch, Nils
Strodthoff, Nils
contents Intermediate feature representations represent the backbone for the expressivity and adaptability of deep neural networks. However, their geometric structure remains poorly understood. In this submission, we provide indirect insights into this matter by applying a broad selection of manipulations in input space, ranging from geometric and photometric transformations to local masking and semantic manipulations using generative image editing models, and assess the feasibility of learning a mapping in the feature space, mapping from the original to the manipulated feature map. To this end, we devise different types of mappings, from linear to non-linear and local to global mappings and assess both the reconstruction quality of the mapping as well as the semantic content of the mapped representations. We demonstrate the feasibility of learning such mappings for all considered transformations. While global (transformer) models that operate on the full feature map often achieve best results, we show that the same can be achieved with a shared linear model operating on a single feature vector typically with very little degradation in reconstruction quality, even for highly non-trivial semantic manipulations. We analyze the corresponding mappings across different feature layers and characterize them according to dominance of weight vs. bias and the effective rank of the linear transformations. These results provide hints for the hypothesis that the feature space is to a first degree of approximation organized in linear structures. From a broader perspective, the study demonstrates that generative image editing models might open the door to a deeper understanding of the feature space through input manipulation.
format Preprint
id arxiv_https___arxiv_org_abs_2605_11203
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle FeatMap: Understanding image manipulation in the feature space and its implications for feature space geometry
Krey, Elias B.
Neukirch, Nils
Strodthoff, Nils
Machine Learning
Computer Vision and Pattern Recognition
Intermediate feature representations represent the backbone for the expressivity and adaptability of deep neural networks. However, their geometric structure remains poorly understood. In this submission, we provide indirect insights into this matter by applying a broad selection of manipulations in input space, ranging from geometric and photometric transformations to local masking and semantic manipulations using generative image editing models, and assess the feasibility of learning a mapping in the feature space, mapping from the original to the manipulated feature map. To this end, we devise different types of mappings, from linear to non-linear and local to global mappings and assess both the reconstruction quality of the mapping as well as the semantic content of the mapped representations. We demonstrate the feasibility of learning such mappings for all considered transformations. While global (transformer) models that operate on the full feature map often achieve best results, we show that the same can be achieved with a shared linear model operating on a single feature vector typically with very little degradation in reconstruction quality, even for highly non-trivial semantic manipulations. We analyze the corresponding mappings across different feature layers and characterize them according to dominance of weight vs. bias and the effective rank of the linear transformations. These results provide hints for the hypothesis that the feature space is to a first degree of approximation organized in linear structures. From a broader perspective, the study demonstrates that generative image editing models might open the door to a deeper understanding of the feature space through input manipulation.
title FeatMap: Understanding image manipulation in the feature space and its implications for feature space geometry
topic Machine Learning
Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2605.11203