Vista Equipo: :: Library Catalog

Guardado en:

Detalles Bibliográficos
Autores principales:	Garg, Raghav, Sharma, Kapil, Singla, Shrey
Formato:	Preprint
Publicado:	2024
Materias:	Machine Learning Artificial Intelligence
Acceso en línea:	https://arxiv.org/abs/2406.18954
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

_version_	1866910504037908480
author	Garg, Raghav Sharma, Kapil Singla, Shrey
author_facet	Garg, Raghav Sharma, Kapil Singla, Shrey
contents	This paper shows that alignment methods can achieve superior adherence to guardrails compared to instruction fine-tuning alone in conversational agents, also known as bots, within predefined guidelines or 'guardrails'. It examines traditional training approaches such as instruction fine-tuning and the recent advancements in direct alignment methods like Identity Preference Optimization (IPO), and Kahneman-Tversky Optimization (KTO). The effectiveness of alignment techniques both pre and post-instruction tuning is highlighted, illustrating their potential to optimize conversational bots in domains that require strict adherence to specified rules, such as customer care.
format	Preprint
id	arxiv_https___arxiv_org_abs_2406_18954
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Alignment For Performance Improvement in Conversation Bots Garg, Raghav Sharma, Kapil Singla, Shrey Machine Learning Artificial Intelligence This paper shows that alignment methods can achieve superior adherence to guardrails compared to instruction fine-tuning alone in conversational agents, also known as bots, within predefined guidelines or 'guardrails'. It examines traditional training approaches such as instruction fine-tuning and the recent advancements in direct alignment methods like Identity Preference Optimization (IPO), and Kahneman-Tversky Optimization (KTO). The effectiveness of alignment techniques both pre and post-instruction tuning is highlighted, illustrating their potential to optimize conversational bots in domains that require strict adherence to specified rules, such as customer care.
title	Alignment For Performance Improvement in Conversation Bots
topic	Machine Learning Artificial Intelligence
url	https://arxiv.org/abs/2406.18954

Ejemplares similares