Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Zhang, Boxiang, Yang, Baijian
Format:	Preprint
Published:	2026
Subjects:	Machine Learning Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2602.05243
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866911668611579904
author	Zhang, Boxiang Yang, Baijian
author_facet	Zhang, Boxiang Yang, Baijian
contents	Transformers achieve strong accuracy but incur high compute and memory cost. Structured pruning reduces inference cost, but most methods rely on retraining or multi-stage optimization, which limits post-training deployment. We propose CORP, a closed-form one-shot structured pruning method that removes MLP dimensions and attention substructures using only unlabeled calibration data without gradients or fine-tuning. CORP formulates structured pruning as a representation recovery problem. It models removed components as affine functions of retained components and derives closed-form ridge regression solutions that fold compensation into model weights. This minimizes a layer-local affine/logit reconstruction objective under the calibration distribution. Experiments on ImageNet with DeiT reveal strong redundancy in both MLP and attention representations. With CORP, models retain high accuracy under aggressive sparsity. On DeiT-Huge, CORP achieves 83.27% Top-1 accuracy after pruning 50\% of both MLP and attention structures.
format	Preprint
id	arxiv_https___arxiv_org_abs_2602_05243
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	CORP: Closed-Form One-shot Representation-Preserving Structured Pruning for Transformers Zhang, Boxiang Yang, Baijian Machine Learning Computer Vision and Pattern Recognition Transformers achieve strong accuracy but incur high compute and memory cost. Structured pruning reduces inference cost, but most methods rely on retraining or multi-stage optimization, which limits post-training deployment. We propose CORP, a closed-form one-shot structured pruning method that removes MLP dimensions and attention substructures using only unlabeled calibration data without gradients or fine-tuning. CORP formulates structured pruning as a representation recovery problem. It models removed components as affine functions of retained components and derives closed-form ridge regression solutions that fold compensation into model weights. This minimizes a layer-local affine/logit reconstruction objective under the calibration distribution. Experiments on ImageNet with DeiT reveal strong redundancy in both MLP and attention representations. With CORP, models retain high accuracy under aggressive sparsity. On DeiT-Huge, CORP achieves 83.27% Top-1 accuracy after pruning 50\% of both MLP and attention structures.
title	CORP: Closed-Form One-shot Representation-Preserving Structured Pruning for Transformers
topic	Machine Learning Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2602.05243

Similar Items