Saved in:
Bibliographic Details
Main Authors: Zhang, Boxiang, Yang, Baijian
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2602.05243
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911668611579904
author Zhang, Boxiang
Yang, Baijian
author_facet Zhang, Boxiang
Yang, Baijian
contents Transformers achieve strong accuracy but incur high compute and memory cost. Structured pruning reduces inference cost, but most methods rely on retraining or multi-stage optimization, which limits post-training deployment. We propose CORP, a closed-form one-shot structured pruning method that removes MLP dimensions and attention substructures using only unlabeled calibration data without gradients or fine-tuning. CORP formulates structured pruning as a representation recovery problem. It models removed components as affine functions of retained components and derives closed-form ridge regression solutions that fold compensation into model weights. This minimizes a layer-local affine/logit reconstruction objective under the calibration distribution. Experiments on ImageNet with DeiT reveal strong redundancy in both MLP and attention representations. With CORP, models retain high accuracy under aggressive sparsity. On DeiT-Huge, CORP achieves 83.27% Top-1 accuracy after pruning 50\% of both MLP and attention structures.
format Preprint
id arxiv_https___arxiv_org_abs_2602_05243
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle CORP: Closed-Form One-shot Representation-Preserving Structured Pruning for Transformers
Zhang, Boxiang
Yang, Baijian
Machine Learning
Computer Vision and Pattern Recognition
Transformers achieve strong accuracy but incur high compute and memory cost. Structured pruning reduces inference cost, but most methods rely on retraining or multi-stage optimization, which limits post-training deployment. We propose CORP, a closed-form one-shot structured pruning method that removes MLP dimensions and attention substructures using only unlabeled calibration data without gradients or fine-tuning. CORP formulates structured pruning as a representation recovery problem. It models removed components as affine functions of retained components and derives closed-form ridge regression solutions that fold compensation into model weights. This minimizes a layer-local affine/logit reconstruction objective under the calibration distribution. Experiments on ImageNet with DeiT reveal strong redundancy in both MLP and attention representations. With CORP, models retain high accuracy under aggressive sparsity. On DeiT-Huge, CORP achieves 83.27% Top-1 accuracy after pruning 50\% of both MLP and attention structures.
title CORP: Closed-Form One-shot Representation-Preserving Structured Pruning for Transformers
topic Machine Learning
Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2602.05243