Saved in:
Bibliographic Details
Main Authors: Muttakhiroh, Iing, Fevens, Thomas
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2508.03571
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866918115431940096
author Muttakhiroh, Iing
Fevens, Thomas
author_facet Muttakhiroh, Iing
Fevens, Thomas
contents Large Language Models (LLMs) often suffer from performance degradation when faced with domain shifts, primarily due to catastrophic forgetting. In this work, we propose KILO (Knowledge-Instructed Learning for Continual Adaptation), a novel continual learning framework that integrates dynamic knowledge graphs with instruction tuning. By leveraging retrieved domain-specific knowledge as guidance during training, KILO enhances both adaptability to new domains and retention of previously acquired knowledge. We pretrain our model on WikiText-103 and evaluate sequential adaptation across four diverse target domains: BioASQ, SciQ, TweetEval, and MIND. Our experiments demonstrate that KILO consistently outperforms strong baselines, including continual fine-tuning, ERNIE 2.0, and CPT, in terms of backward transfer, forward transfer, F1 score, retention rate, and training efficiency. These results highlight the effectiveness of combining structured knowledge retrieval and instruction prompting to overcome domain shift challenges in continual learning scenarios.
format Preprint
id arxiv_https___arxiv_org_abs_2508_03571
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Tackling Distribution Shift in LLM via KILO: Knowledge-Instructed Learning for Continual Adaptation
Muttakhiroh, Iing
Fevens, Thomas
Computation and Language
Machine Learning
Large Language Models (LLMs) often suffer from performance degradation when faced with domain shifts, primarily due to catastrophic forgetting. In this work, we propose KILO (Knowledge-Instructed Learning for Continual Adaptation), a novel continual learning framework that integrates dynamic knowledge graphs with instruction tuning. By leveraging retrieved domain-specific knowledge as guidance during training, KILO enhances both adaptability to new domains and retention of previously acquired knowledge. We pretrain our model on WikiText-103 and evaluate sequential adaptation across four diverse target domains: BioASQ, SciQ, TweetEval, and MIND. Our experiments demonstrate that KILO consistently outperforms strong baselines, including continual fine-tuning, ERNIE 2.0, and CPT, in terms of backward transfer, forward transfer, F1 score, retention rate, and training efficiency. These results highlight the effectiveness of combining structured knowledge retrieval and instruction prompting to overcome domain shift challenges in continual learning scenarios.
title Tackling Distribution Shift in LLM via KILO: Knowledge-Instructed Learning for Continual Adaptation
topic Computation and Language
Machine Learning
url https://arxiv.org/abs/2508.03571