Saved in:
| Main Authors: | Zhu, Yijun, Wang, Jianxin, Shen, Chengchao |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.08083 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
SDMPrune: Self-Distillation MLP Pruning for Efficient Large Language Models
by: Zhu, Hourun, et al.
Published: (2025)
by: Zhu, Hourun, et al.
Published: (2025)
Learning Compact Vision Tokens for Efficient Large Multimodal Models
by: Tang, Hao, et al.
Published: (2025)
by: Tang, Hao, et al.
Published: (2025)
Adaptive MLP Pruning for Large Vision Transformers
by: Shen, Chengchao
Published: (2026)
by: Shen, Chengchao
Published: (2026)
Who Reasons in the Large Language Models?
by: Shao, Jie, et al.
Published: (2025)
by: Shao, Jie, et al.
Published: (2025)
Instruction-Following Pruning for Large Language Models
by: Hou, Bairu, et al.
Published: (2025)
by: Hou, Bairu, et al.
Published: (2025)
Saliency-driven Dynamic Token Pruning for Large Language Models
by: Tao, Yao, et al.
Published: (2025)
by: Tao, Yao, et al.
Published: (2025)
Counterfactual Probing for Hallucination Detection and Mitigation in Large Language Models
by: Feng, Yijun
Published: (2025)
by: Feng, Yijun
Published: (2025)
DPPA: Pruning Method for Large Language Model to Model Merging
by: Zhu, Yaochen, et al.
Published: (2024)
by: Zhu, Yaochen, et al.
Published: (2024)
GPTailor: Large Language Model Pruning Through Layer Cutting and Stitching
by: Su, Guinan, et al.
Published: (2025)
by: Su, Guinan, et al.
Published: (2025)
Global-Recent Semantic Reasoning on Dynamic Text-Attributed Graphs with Large Language Models
by: Wang, Yunan, et al.
Published: (2025)
by: Wang, Yunan, et al.
Published: (2025)
DarwinLM: Evolutionary Structured Pruning of Large Language Models
by: Tang, Shengkun, et al.
Published: (2025)
by: Tang, Shengkun, et al.
Published: (2025)
ScaleBox: Enabling High-Fidelity and Scalable Code Verification for Large Language Models
by: Zheng, Jiasheng, et al.
Published: (2026)
by: Zheng, Jiasheng, et al.
Published: (2026)
IG-Pruning: Input-Guided Block Pruning for Large Language Models
by: Qiao, Kangyu, et al.
Published: (2025)
by: Qiao, Kangyu, et al.
Published: (2025)
Deterministic Differentiable Structured Pruning for Large Language Models
by: Huang, Weiyu, et al.
Published: (2026)
by: Huang, Weiyu, et al.
Published: (2026)
Diversity-Guided MLP Reduction for Efficient Large Vision Transformers
by: Shen, Chengchao, et al.
Published: (2025)
by: Shen, Chengchao, et al.
Published: (2025)
TypedThinker: Diversify Large Language Model Reasoning with Typed Thinking
by: Wang, Danqing, et al.
Published: (2024)
by: Wang, Danqing, et al.
Published: (2024)
Multi-Grained Contrast for Data-Efficient Unsupervised Representation Learning
by: Shen, Chengchao, et al.
Published: (2024)
by: Shen, Chengchao, et al.
Published: (2024)
PIP: Perturbation-based Iterative Pruning for Large Language Models
by: Cao, Yi, et al.
Published: (2025)
by: Cao, Yi, et al.
Published: (2025)
NutePrune: Efficient Progressive Pruning with Numerous Teachers for Large Language Models
by: Li, Shengrui, et al.
Published: (2024)
by: Li, Shengrui, et al.
Published: (2024)
RUIE: Retrieval-based Unified Information Extraction using Large Language Model
by: Liao, Xincheng, et al.
Published: (2024)
by: Liao, Xincheng, et al.
Published: (2024)
Pruning Large Language Models to Intra-module Low-rank Architecture with Transitional Activations
by: Shen, Bowen, et al.
Published: (2024)
by: Shen, Bowen, et al.
Published: (2024)
Large Language Model Pruning
by: Huang, Hanjuan, et al.
Published: (2024)
by: Huang, Hanjuan, et al.
Published: (2024)
Multiple Object Stitching for Unsupervised Representation Learning
by: Shen, Chengchao, et al.
Published: (2025)
by: Shen, Chengchao, et al.
Published: (2025)
DISP-LLM: Dimension-Independent Structural Pruning for Large Language Models
by: Gao, Shangqian, et al.
Published: (2024)
by: Gao, Shangqian, et al.
Published: (2024)
Pruning Large Language Models with Semi-Structural Adaptive Sparse Training
by: Huang, Weiyu, et al.
Published: (2024)
by: Huang, Weiyu, et al.
Published: (2024)
Pruning Multilingual Large Language Models for Multilingual Inference
by: Kim, Hwichan, et al.
Published: (2024)
by: Kim, Hwichan, et al.
Published: (2024)
KVPruner: Structural Pruning for Faster and Memory-Efficient Large Language Models
by: Lv, Bo, et al.
Published: (2024)
by: Lv, Bo, et al.
Published: (2024)
Calibrating the Confidence of Large Language Models by Eliciting Fidelity
by: Zhang, Mozhi, et al.
Published: (2024)
by: Zhang, Mozhi, et al.
Published: (2024)
HieraVid: Hierarchical Token Pruning for Fast Video Large Language Models
by: Guo, Yansong, et al.
Published: (2026)
by: Guo, Yansong, et al.
Published: (2026)
Cluster-Driven Expert Pruning for Mixture-of-Experts Large Language Models
by: Guo, Hongcheng, et al.
Published: (2025)
by: Guo, Hongcheng, et al.
Published: (2025)
Beyond Size: How Gradients Shape Pruning Decisions in Large Language Models
by: Das, Rocktim Jyoti, et al.
Published: (2023)
by: Das, Rocktim Jyoti, et al.
Published: (2023)
BlockPruner: Fine-grained Pruning for Large Language Models
by: Zhong, Longguang, et al.
Published: (2024)
by: Zhong, Longguang, et al.
Published: (2024)
RCPU: Rotation-Constrained Error Compensation for Structured Pruning of Large Language Models
by: Haruta, Shuichiro, et al.
Published: (2025)
by: Haruta, Shuichiro, et al.
Published: (2025)
Entropy-Based Block Pruning for Efficient Large Language Models
by: Yang, Liangwei, et al.
Published: (2025)
by: Yang, Liangwei, et al.
Published: (2025)
Efficient Post-Training Pruning of Large Language Models with Statistical Correction
by: Yu, Peiqi, et al.
Published: (2026)
by: Yu, Peiqi, et al.
Published: (2026)
PAT: Pruning-Aware Tuning for Large Language Models
by: Liu, Yijiang, et al.
Published: (2024)
by: Liu, Yijiang, et al.
Published: (2024)
Towards High-Fidelity Synthetic Multi-platform Social Media Datasets via Large Language Models
by: Tari, Henry, et al.
Published: (2025)
by: Tari, Henry, et al.
Published: (2025)
DLP: Dynamic Layerwise Pruning in Large Language Models
by: Chen, Yuli, et al.
Published: (2025)
by: Chen, Yuli, et al.
Published: (2025)
Prompt-based Depth Pruning of Large Language Models
by: Wee, Juyun, et al.
Published: (2025)
by: Wee, Juyun, et al.
Published: (2025)
Beware of Calibration Data for Pruning Large Language Models
by: Ji, Yixin, et al.
Published: (2024)
by: Ji, Yixin, et al.
Published: (2024)
Similar Items
-
SDMPrune: Self-Distillation MLP Pruning for Efficient Large Language Models
by: Zhu, Hourun, et al.
Published: (2025) -
Learning Compact Vision Tokens for Efficient Large Multimodal Models
by: Tang, Hao, et al.
Published: (2025) -
Adaptive MLP Pruning for Large Vision Transformers
by: Shen, Chengchao
Published: (2026) -
Who Reasons in the Large Language Models?
by: Shao, Jie, et al.
Published: (2025) -
Instruction-Following Pruning for Large Language Models
by: Hou, Bairu, et al.
Published: (2025)