Vista Equipo: :: Library Catalog

Guardado en:

Detalles Bibliográficos
Autores principales:	Roychowdhury, Sohini, Krema, Marko, Moore, Brian, Lai, Xingjian, Effedua, Dike, Jethwani, Bharat
Formato:	Preprint
Publicado:	2024
Materias:	Computation and Language Artificial Intelligence Computational Engineering, Finance, and Science
Acceso en línea:	https://arxiv.org/abs/2408.05365
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

_version_	1866915024063168512
author	Roychowdhury, Sohini Krema, Marko Moore, Brian Lai, Xingjian Effedua, Dike Jethwani, Bharat
author_facet	Roychowdhury, Sohini Krema, Marko Moore, Brian Lai, Xingjian Effedua, Dike Jethwani, Bharat
contents	Recent trends in Generative AI have emerged towards fine-tuning foundational large language models (LLMs) to create domain-specific LLMs for automation and chatbot-like applications. Specialized applications for analytics-heavy domains such as Financial report generation require specific writing styles that comprise compound and creative sentences with minimized hallucinations. In this work, we explore the self-corrective auto-regressive qualities of LLMs to learn creativity in writing styles with minimal prompting. We propose a novel two-stage fine-tuning (FT) strategy wherein in the first stage public domain financial reports are used to train for writing styles while allowing the LLM to hallucinate. In the second stage the examples of hallucinations are manually corrected and further used to fine-tune the LLM. The finally trained LLM learns to generate specific financial report sections using minimal instructions and tabular data inputs while ensuring low fine-tuning costs. Our proposed two-stage fine-tuning boosts the accuracy of financial questions answering by two-folds while reducing hallucinations by over 50%. Also, the fine-tuned model has lower perplexity, improved ROUGE, TER and BLEU scores, higher creativity and knowledge density with lower uncertainty and cross entropy than base LLMs. Thus, the proposed framework can be generalized to train creativity in LLMs by first allowing them to hallucinate.
format	Preprint
id	arxiv_https___arxiv_org_abs_2408_05365
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	FiSTECH: Financial Style Transfer to Enhance Creativity without Hallucinations in LLMs Roychowdhury, Sohini Krema, Marko Moore, Brian Lai, Xingjian Effedua, Dike Jethwani, Bharat Computation and Language Artificial Intelligence Computational Engineering, Finance, and Science Recent trends in Generative AI have emerged towards fine-tuning foundational large language models (LLMs) to create domain-specific LLMs for automation and chatbot-like applications. Specialized applications for analytics-heavy domains such as Financial report generation require specific writing styles that comprise compound and creative sentences with minimized hallucinations. In this work, we explore the self-corrective auto-regressive qualities of LLMs to learn creativity in writing styles with minimal prompting. We propose a novel two-stage fine-tuning (FT) strategy wherein in the first stage public domain financial reports are used to train for writing styles while allowing the LLM to hallucinate. In the second stage the examples of hallucinations are manually corrected and further used to fine-tune the LLM. The finally trained LLM learns to generate specific financial report sections using minimal instructions and tabular data inputs while ensuring low fine-tuning costs. Our proposed two-stage fine-tuning boosts the accuracy of financial questions answering by two-folds while reducing hallucinations by over 50%. Also, the fine-tuned model has lower perplexity, improved ROUGE, TER and BLEU scores, higher creativity and knowledge density with lower uncertainty and cross entropy than base LLMs. Thus, the proposed framework can be generalized to train creativity in LLMs by first allowing them to hallucinate.
title	FiSTECH: Financial Style Transfer to Enhance Creativity without Hallucinations in LLMs
topic	Computation and Language Artificial Intelligence Computational Engineering, Finance, and Science
url	https://arxiv.org/abs/2408.05365

Ejemplares similares