Saved in:
| Main Authors: | Li, Shengrui, Chen, Junzhe, Han, Xueting, Bai, Jing |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2402.09773 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization
by: Deng, Hexuan, et al.
Published: (2024)
by: Deng, Hexuan, et al.
Published: (2024)
DLP: Dynamic Layerwise Pruning in Large Language Models
by: Chen, Yuli, et al.
Published: (2025)
by: Chen, Yuli, et al.
Published: (2025)
Efficient Shapley Value-based Non-Uniform Pruning of Large Language Models
by: Sun, Chuan, et al.
Published: (2025)
by: Sun, Chuan, et al.
Published: (2025)
IG-Pruning: Input-Guided Block Pruning for Large Language Models
by: Qiao, Kangyu, et al.
Published: (2025)
by: Qiao, Kangyu, et al.
Published: (2025)
Sparsity Induction for Accurate Post-Training Pruning of Large Language Models
by: Jiang, Minhao, et al.
Published: (2026)
by: Jiang, Minhao, et al.
Published: (2026)
Instruction-Following Pruning for Large Language Models
by: Hou, Bairu, et al.
Published: (2025)
by: Hou, Bairu, et al.
Published: (2025)
Large Language Model Pruning
by: Huang, Hanjuan, et al.
Published: (2024)
by: Huang, Hanjuan, et al.
Published: (2024)
E$^3$-Pruner: Towards Efficient, Economical, and Effective Layer Pruning for Large Language Models
by: Yuan, Tao, et al.
Published: (2025)
by: Yuan, Tao, et al.
Published: (2025)
BlockPruner: Fine-grained Pruning for Large Language Models
by: Zhong, Longguang, et al.
Published: (2024)
by: Zhong, Longguang, et al.
Published: (2024)
SparseLLM: Towards Global Pruning for Pre-trained Language Models
by: Bai, Guangji, et al.
Published: (2024)
by: Bai, Guangji, et al.
Published: (2024)
Entropy-Based Block Pruning for Efficient Large Language Models
by: Yang, Liangwei, et al.
Published: (2025)
by: Yang, Liangwei, et al.
Published: (2025)
MINI-LLM: Memory-Efficient Structured Pruning for Large Language Models
by: Cheng, Hongrong, et al.
Published: (2024)
by: Cheng, Hongrong, et al.
Published: (2024)
KVPruner: Structural Pruning for Faster and Memory-Efficient Large Language Models
by: Lv, Bo, et al.
Published: (2024)
by: Lv, Bo, et al.
Published: (2024)
ALPS: Attention Localization and Pruning Strategy for Efficient Alignment of Large Language Models
by: Chen, Hao, et al.
Published: (2025)
by: Chen, Hao, et al.
Published: (2025)
High-Fidelity Pruning for Large Language Models
by: Zhu, Yijun, et al.
Published: (2026)
by: Zhu, Yijun, et al.
Published: (2026)
A Simple Linear Patch Revives Layer-Pruned Large Language Models
by: Chen, Xinrui, et al.
Published: (2025)
by: Chen, Xinrui, et al.
Published: (2025)
PASER: Post-Training Data Selection for Efficient Pruned Large Language Model Recovery
by: He, Bowei, et al.
Published: (2025)
by: He, Bowei, et al.
Published: (2025)
PAT: Pruning-Aware Tuning for Large Language Models
by: Liu, Yijiang, et al.
Published: (2024)
by: Liu, Yijiang, et al.
Published: (2024)
Deterministic Differentiable Structured Pruning for Large Language Models
by: Huang, Weiyu, et al.
Published: (2026)
by: Huang, Weiyu, et al.
Published: (2026)
Efficient Post-Training Pruning of Large Language Models with Statistical Correction
by: Yu, Peiqi, et al.
Published: (2026)
by: Yu, Peiqi, et al.
Published: (2026)
Olica: Efficient Structured Pruning of Large Language Models without Retraining
by: He, Jiujun, et al.
Published: (2025)
by: He, Jiujun, et al.
Published: (2025)
Pruning Multilingual Large Language Models for Multilingual Inference
by: Kim, Hwichan, et al.
Published: (2024)
by: Kim, Hwichan, et al.
Published: (2024)
Pruning General Large Language Models into Customized Expert Models
by: Zhao, Yirao, et al.
Published: (2025)
by: Zhao, Yirao, et al.
Published: (2025)
Adaptive Pruning for Large Language Models with Structural Importance Awareness
by: Zheng, Haotian, et al.
Published: (2024)
by: Zheng, Haotian, et al.
Published: (2024)
Think Before You Prune: Selective Self-Generated Calibration for Pruning Large Reasoning Models
by: Xiang, Yang, et al.
Published: (2025)
by: Xiang, Yang, et al.
Published: (2025)
Towards Robust Pruning: An Adaptive Knowledge-Retention Pruning Strategy for Language Models
by: Li, Jianwei, et al.
Published: (2023)
by: Li, Jianwei, et al.
Published: (2023)
Structural Pruning of Large Vision Language Models: A Comprehensive Study on Pruning Dynamics, Recovery, and Data Efficiency
by: Huang, Yiran, et al.
Published: (2026)
by: Huang, Yiran, et al.
Published: (2026)
Beware of Calibration Data for Pruning Large Language Models
by: Ji, Yixin, et al.
Published: (2024)
by: Ji, Yixin, et al.
Published: (2024)
BESA: Pruning Large Language Models with Blockwise Parameter-Efficient Sparsity Allocation
by: Xu, Peng, et al.
Published: (2024)
by: Xu, Peng, et al.
Published: (2024)
SDMPrune: Self-Distillation MLP Pruning for Efficient Large Language Models
by: Zhu, Hourun, et al.
Published: (2025)
by: Zhu, Hourun, et al.
Published: (2025)
POP: Prefill-Only Pruning for Efficient Large Model Inference
by: He, Junhui, et al.
Published: (2026)
by: He, Junhui, et al.
Published: (2026)
GPTailor: Large Language Model Pruning Through Layer Cutting and Stitching
by: Su, Guinan, et al.
Published: (2025)
by: Su, Guinan, et al.
Published: (2025)
Pruning Large Language Models by Identifying and Preserving Functional Networks
by: Liu, Yiheng, et al.
Published: (2025)
by: Liu, Yiheng, et al.
Published: (2025)
Fit and Prune: Fast and Training-free Visual Token Pruning for Multi-modal Large Language Models
by: Ye, Weihao, et al.
Published: (2024)
by: Ye, Weihao, et al.
Published: (2024)
Cluster-Driven Expert Pruning for Mixture-of-Experts Large Language Models
by: Guo, Hongcheng, et al.
Published: (2025)
by: Guo, Hongcheng, et al.
Published: (2025)
Prompt-based Depth Pruning of Large Language Models
by: Wee, Juyun, et al.
Published: (2025)
by: Wee, Juyun, et al.
Published: (2025)
Prune&Comp: Free Lunch for Layer-Pruned LLMs via Iterative Pruning with Magnitude Compensation
by: Chen, Xinrui, et al.
Published: (2025)
by: Chen, Xinrui, et al.
Published: (2025)
Pruning Weights but Not Truth: Safeguarding Truthfulness While Pruning LLMs
by: Fu, Yao, et al.
Published: (2025)
by: Fu, Yao, et al.
Published: (2025)
Mostly Text, Smart Visuals: Asymmetric Text-Visual Pruning for Large Vision-Language Models
by: Li, Sijie, et al.
Published: (2026)
by: Li, Sijie, et al.
Published: (2026)
EfficientXpert: Efficient Domain Adaptation for Large Language Models via Propagation-Aware Pruning
by: Zhao, Songlin, et al.
Published: (2025)
by: Zhao, Songlin, et al.
Published: (2025)
Similar Items
-
DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization
by: Deng, Hexuan, et al.
Published: (2024) -
DLP: Dynamic Layerwise Pruning in Large Language Models
by: Chen, Yuli, et al.
Published: (2025) -
Efficient Shapley Value-based Non-Uniform Pruning of Large Language Models
by: Sun, Chuan, et al.
Published: (2025) -
IG-Pruning: Input-Guided Block Pruning for Large Language Models
by: Qiao, Kangyu, et al.
Published: (2025) -
Sparsity Induction for Accurate Post-Training Pruning of Large Language Models
by: Jiang, Minhao, et al.
Published: (2026)