Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Mæhlum, Petter, Samuel, David, Norman, Rebecka Maria, Jelin, Elma, Bjertnæs, Øyvind Andresen, Øvrelid, Lilja, Velldal, Erik
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2404.18832
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866913334496854016
author	Mæhlum, Petter Samuel, David Norman, Rebecka Maria Jelin, Elma Bjertnæs, Øyvind Andresen Øvrelid, Lilja Velldal, Erik
author_facet	Mæhlum, Petter Samuel, David Norman, Rebecka Maria Jelin, Elma Bjertnæs, Øyvind Andresen Øvrelid, Lilja Velldal, Erik
contents	Sentiment analysis is an important tool for aggregating patient voices, in order to provide targeted improvements in healthcare services. A prerequisite for this is the availability of in-domain data annotated for sentiment. This article documents an effort to add sentiment annotations to free-text comments in patient surveys collected by the Norwegian Institute of Public Health (NIPH). However, annotation can be a time-consuming and resource-intensive process, particularly when it requires domain expertise. We therefore also evaluate a possible alternative to human annotation, using large language models (LLMs) as annotators. We perform an extensive evaluation of the approach for two openly available pretrained LLMs for Norwegian, experimenting with different configurations of prompts and in-context learning, comparing their performance to human annotators. We find that even for zero-shot runs, models perform well above the baseline for binary sentiment, but still cannot compete with human annotators on the full dataset.
format	Preprint
id	arxiv_https___arxiv_org_abs_2404_18832
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	It's Difficult to be Neutral -- Human and LLM-based Sentiment Annotation of Patient Comments Mæhlum, Petter Samuel, David Norman, Rebecka Maria Jelin, Elma Bjertnæs, Øyvind Andresen Øvrelid, Lilja Velldal, Erik Computation and Language Sentiment analysis is an important tool for aggregating patient voices, in order to provide targeted improvements in healthcare services. A prerequisite for this is the availability of in-domain data annotated for sentiment. This article documents an effort to add sentiment annotations to free-text comments in patient surveys collected by the Norwegian Institute of Public Health (NIPH). However, annotation can be a time-consuming and resource-intensive process, particularly when it requires domain expertise. We therefore also evaluate a possible alternative to human annotation, using large language models (LLMs) as annotators. We perform an extensive evaluation of the approach for two openly available pretrained LLMs for Norwegian, experimenting with different configurations of prompts and in-context learning, comparing their performance to human annotators. We find that even for zero-shot runs, models perform well above the baseline for binary sentiment, but still cannot compete with human annotators on the full dataset.
title	It's Difficult to be Neutral -- Human and LLM-based Sentiment Annotation of Patient Comments
topic	Computation and Language
url	https://arxiv.org/abs/2404.18832

Similar Items