Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Muttakhiroh, Iing, Fevens, Thomas
Format:	Preprint
Published:	2025
Subjects:	Computation and Language Machine Learning
Online Access:	https://arxiv.org/abs/2508.03571
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866918115431940096
author	Muttakhiroh, Iing Fevens, Thomas
author_facet	Muttakhiroh, Iing Fevens, Thomas
contents	Large Language Models (LLMs) often suffer from performance degradation when faced with domain shifts, primarily due to catastrophic forgetting. In this work, we propose KILO (Knowledge-Instructed Learning for Continual Adaptation), a novel continual learning framework that integrates dynamic knowledge graphs with instruction tuning. By leveraging retrieved domain-specific knowledge as guidance during training, KILO enhances both adaptability to new domains and retention of previously acquired knowledge. We pretrain our model on WikiText-103 and evaluate sequential adaptation across four diverse target domains: BioASQ, SciQ, TweetEval, and MIND. Our experiments demonstrate that KILO consistently outperforms strong baselines, including continual fine-tuning, ERNIE 2.0, and CPT, in terms of backward transfer, forward transfer, F1 score, retention rate, and training efficiency. These results highlight the effectiveness of combining structured knowledge retrieval and instruction prompting to overcome domain shift challenges in continual learning scenarios.
format	Preprint
id	arxiv_https___arxiv_org_abs_2508_03571
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Tackling Distribution Shift in LLM via KILO: Knowledge-Instructed Learning for Continual Adaptation Muttakhiroh, Iing Fevens, Thomas Computation and Language Machine Learning Large Language Models (LLMs) often suffer from performance degradation when faced with domain shifts, primarily due to catastrophic forgetting. In this work, we propose KILO (Knowledge-Instructed Learning for Continual Adaptation), a novel continual learning framework that integrates dynamic knowledge graphs with instruction tuning. By leveraging retrieved domain-specific knowledge as guidance during training, KILO enhances both adaptability to new domains and retention of previously acquired knowledge. We pretrain our model on WikiText-103 and evaluate sequential adaptation across four diverse target domains: BioASQ, SciQ, TweetEval, and MIND. Our experiments demonstrate that KILO consistently outperforms strong baselines, including continual fine-tuning, ERNIE 2.0, and CPT, in terms of backward transfer, forward transfer, F1 score, retention rate, and training efficiency. These results highlight the effectiveness of combining structured knowledge retrieval and instruction prompting to overcome domain shift challenges in continual learning scenarios.
title	Tackling Distribution Shift in LLM via KILO: Knowledge-Instructed Learning for Continual Adaptation
topic	Computation and Language Machine Learning
url	https://arxiv.org/abs/2508.03571

Similar Items