Vista Equipo: :: Library Catalog

Guardado en:

Detalles Bibliográficos
Autores principales:	Vij, Anneketh, Liu, Changhao, Nair, Rahul Anil, Ho, Theodore Eugene, Shi, Edward, Bhowmick, Ayan
Formato:	Preprint
Publicado:	2025
Materias:	Computation and Language Artificial Intelligence
Acceso en línea:	https://arxiv.org/abs/2502.02028
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

_version_	1866910830157627392
author	Vij, Anneketh Liu, Changhao Nair, Rahul Anil Ho, Theodore Eugene Shi, Edward Bhowmick, Ayan
author_facet	Vij, Anneketh Liu, Changhao Nair, Rahul Anil Ho, Theodore Eugene Shi, Edward Bhowmick, Ayan
contents	This research presents an exploration and study of the recipe generation task by fine-tuning various very small language models, with a focus on developing robust evaluation metrics and comparing across different language models the open-ended task of recipe generation. This study presents extensive experiments with multiple model architectures, ranging from T5-small (Raffel et al., 2023) and SmolLM-135M(Allal et al., 2024) to Phi-2 (Research, 2023), implementing both traditional NLP metrics and custom domain-specific evaluation metrics. Our novel evaluation framework incorporates recipe-specific metrics for assessing content quality and introduces approaches to allergen substitution. The results indicate that, while larger models generally perform better on standard metrics, the relationship between model size and recipe quality is more nuanced when considering domain-specific metrics. SmolLM-360M and SmolLM-1.7B demonstrate comparable performance despite their size difference before and after fine-tuning, while fine-tuning Phi-2 shows notable limitations in recipe generation despite its larger parameter count. The comprehensive evaluation framework and allergen substitution systems provide valuable insights for future work in recipe generation and broader NLG tasks that require domain expertise and safety considerations.
format	Preprint
id	arxiv_https___arxiv_org_abs_2502_02028
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Fine-tuning Language Models for Recipe Generation: A Comparative Analysis and Benchmark Study Vij, Anneketh Liu, Changhao Nair, Rahul Anil Ho, Theodore Eugene Shi, Edward Bhowmick, Ayan Computation and Language Artificial Intelligence This research presents an exploration and study of the recipe generation task by fine-tuning various very small language models, with a focus on developing robust evaluation metrics and comparing across different language models the open-ended task of recipe generation. This study presents extensive experiments with multiple model architectures, ranging from T5-small (Raffel et al., 2023) and SmolLM-135M(Allal et al., 2024) to Phi-2 (Research, 2023), implementing both traditional NLP metrics and custom domain-specific evaluation metrics. Our novel evaluation framework incorporates recipe-specific metrics for assessing content quality and introduces approaches to allergen substitution. The results indicate that, while larger models generally perform better on standard metrics, the relationship between model size and recipe quality is more nuanced when considering domain-specific metrics. SmolLM-360M and SmolLM-1.7B demonstrate comparable performance despite their size difference before and after fine-tuning, while fine-tuning Phi-2 shows notable limitations in recipe generation despite its larger parameter count. The comprehensive evaluation framework and allergen substitution systems provide valuable insights for future work in recipe generation and broader NLG tasks that require domain expertise and safety considerations.
title	Fine-tuning Language Models for Recipe Generation: A Comparative Analysis and Benchmark Study
topic	Computation and Language Artificial Intelligence
url	https://arxiv.org/abs/2502.02028

Ejemplares similares