Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Xu, Bowen, Wu, Shaoyu, Liu, Kai, Hu, Lulu
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2404.18410
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866929698503655424
author	Xu, Bowen Wu, Shaoyu Liu, Kai Hu, Lulu
author_facet	Xu, Bowen Wu, Shaoyu Liu, Kai Hu, Lulu
contents	With the proliferation of large language models (LLMs), the comprehensive alignment of such models across multiple tasks has emerged as a critical area of research. Existing alignment methodologies primarily address single task, such as multi-turn dialogue, coding, mathematical problem-solving, and tool usage. Although there is a large amount of high-quality data available for those tasks, most of them provide only questions and answers without including the system prompt. Though a detailed analysis of the Qwen language model, we found that the system prompt has a significant impact on both training and inference processes of LLM. We attributes this phenomenon to overfitting to the system prompt. In address this issue, we introduce a novel technique termed Mixture-of-Instructions (MoI), which employs a strategy of instruction packing combined with diverse system prompts to boost the alignment efficiency of language models. We have also compiled a diverse set of seven benchmark datasets to rigorously evaluate the alignment efficacy of the MoI-enhanced language model. Our methodology was applied to the open-source Qwen-7B-chat model, culminating in the development of Qwen-SFT-MoI. This enhanced model demonstrates significant advancements in generative capabilities across coding, mathematics, and tool use tasks.
format	Preprint
id	arxiv_https___arxiv_org_abs_2404_18410
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Mixture-of-Instructions: Aligning Large Language Models via Mixture Prompting Xu, Bowen Wu, Shaoyu Liu, Kai Hu, Lulu Computation and Language With the proliferation of large language models (LLMs), the comprehensive alignment of such models across multiple tasks has emerged as a critical area of research. Existing alignment methodologies primarily address single task, such as multi-turn dialogue, coding, mathematical problem-solving, and tool usage. Although there is a large amount of high-quality data available for those tasks, most of them provide only questions and answers without including the system prompt. Though a detailed analysis of the Qwen language model, we found that the system prompt has a significant impact on both training and inference processes of LLM. We attributes this phenomenon to overfitting to the system prompt. In address this issue, we introduce a novel technique termed Mixture-of-Instructions (MoI), which employs a strategy of instruction packing combined with diverse system prompts to boost the alignment efficiency of language models. We have also compiled a diverse set of seven benchmark datasets to rigorously evaluate the alignment efficacy of the MoI-enhanced language model. Our methodology was applied to the open-source Qwen-7B-chat model, culminating in the development of Qwen-SFT-MoI. This enhanced model demonstrates significant advancements in generative capabilities across coding, mathematics, and tool use tasks.
title	Mixture-of-Instructions: Aligning Large Language Models via Mixture Prompting
topic	Computation and Language
url	https://arxiv.org/abs/2404.18410

Similar Items