Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Cai, Yida, Sun, Hao, Huang, Hsiu-Yuan, Wu, Yunfang
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2406.02079
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866910470755057664
author	Cai, Yida Sun, Hao Huang, Hsiu-Yuan Wu, Yunfang
author_facet	Cai, Yida Sun, Hao Huang, Hsiu-Yuan Wu, Yunfang
contents	Information Extraction (IE) plays a crucial role in Natural Language Processing (NLP) by extracting structured information from unstructured text, thereby facilitating seamless integration with various real-world applications that rely on structured data. Despite its significance, recent experiments focusing on English IE tasks have shed light on the challenges faced by Large Language Models (LLMs) in achieving optimal performance, particularly in sub-tasks like Named Entity Recognition (NER). In this paper, we delve into a comprehensive investigation of the performance of mainstream Chinese open-source LLMs in tackling IE tasks, specifically under zero-shot conditions where the models are not fine-tuned for specific tasks. Additionally, we present the outcomes of several few-shot experiments to further gauge the capability of these models. Moreover, our study includes a comparative analysis between these open-source LLMs and ChatGPT, a widely recognized language model, on IE performance. Through meticulous experimentation and analysis, we aim to provide insights into the strengths, limitations, and potential enhancements of existing Chinese open-source LLMs in the domain of Information Extraction within the context of NLP.
format	Preprint
id	arxiv_https___arxiv_org_abs_2406_02079
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Assessing the Performance of Chinese Open Source Large Language Models in Information Extraction Tasks Cai, Yida Sun, Hao Huang, Hsiu-Yuan Wu, Yunfang Computation and Language Information Extraction (IE) plays a crucial role in Natural Language Processing (NLP) by extracting structured information from unstructured text, thereby facilitating seamless integration with various real-world applications that rely on structured data. Despite its significance, recent experiments focusing on English IE tasks have shed light on the challenges faced by Large Language Models (LLMs) in achieving optimal performance, particularly in sub-tasks like Named Entity Recognition (NER). In this paper, we delve into a comprehensive investigation of the performance of mainstream Chinese open-source LLMs in tackling IE tasks, specifically under zero-shot conditions where the models are not fine-tuned for specific tasks. Additionally, we present the outcomes of several few-shot experiments to further gauge the capability of these models. Moreover, our study includes a comparative analysis between these open-source LLMs and ChatGPT, a widely recognized language model, on IE performance. Through meticulous experimentation and analysis, we aim to provide insights into the strengths, limitations, and potential enhancements of existing Chinese open-source LLMs in the domain of Information Extraction within the context of NLP.
title	Assessing the Performance of Chinese Open Source Large Language Models in Information Extraction Tasks
topic	Computation and Language
url	https://arxiv.org/abs/2406.02079

Similar Items