Saved in:
Bibliographic Details
Main Authors: Li, Zhuo, Du, Guodong, Shi, Zesheng, Guo, Weiyang, Yao, Weijun, Zhou, Yuan, Zhang, Jiabo, Li, Jing
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2605.22205
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • Large language models increasingly require specialization across diverse domains, yet existing approaches struggle to balance multi-domain capacities with strict memory and inference constraints. In this work, we introduce SkillWeave, a modular improvement framework that enables LLMs to specialize under fixed memory budgets. SkillWeave partitions full capabilities of a general-purpose model into skillpacks -- lightweight, domain-specific delta modules -- that reorganize and refine the model's internal knowledge. For efficient deployment, SkillWeave integrates SkillZip to compress skillpacks into compact and inference-ready format, enabling strong multi-domain performance with low-latency execution. On multi-task and agentic benchmarks, a 9B SkillWeave model outperforms several baselines and even surpasses a 32B monolithic LLM, while achieving up to 4x speedup.