Saved in:
Bibliographic Details
Main Authors: Upadhyaya, Nishanth, Sridharamurthy, Raghavendra
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2412.16653
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • In this article, we introduce 'Internalized Self-Correction' (InSeC) for large language models (LLMs). While many approaches exist for self-reflection at inference time, we propose a novel method that combines ideas from negative sampling, self-reflection during training, and inference time. InSeC allows LLMs to correct themselves by introducing mistakes and their corresponding corrections during training, thereby converting the learning process into a true supervised learning task with both positive and negative examples. This approach can be extended to improve instruction following and correct hallucinations or incorrect sentences generated by LLMs.