Saved in:
Bibliographic Details
Main Authors: Mæhlum, Petter, Samuel, David, Norman, Rebecka Maria, Jelin, Elma, Bjertnæs, Øyvind Andresen, Øvrelid, Lilja, Velldal, Erik
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2404.18832
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913334496854016
author Mæhlum, Petter
Samuel, David
Norman, Rebecka Maria
Jelin, Elma
Bjertnæs, Øyvind Andresen
Øvrelid, Lilja
Velldal, Erik
author_facet Mæhlum, Petter
Samuel, David
Norman, Rebecka Maria
Jelin, Elma
Bjertnæs, Øyvind Andresen
Øvrelid, Lilja
Velldal, Erik
contents Sentiment analysis is an important tool for aggregating patient voices, in order to provide targeted improvements in healthcare services. A prerequisite for this is the availability of in-domain data annotated for sentiment. This article documents an effort to add sentiment annotations to free-text comments in patient surveys collected by the Norwegian Institute of Public Health (NIPH). However, annotation can be a time-consuming and resource-intensive process, particularly when it requires domain expertise. We therefore also evaluate a possible alternative to human annotation, using large language models (LLMs) as annotators. We perform an extensive evaluation of the approach for two openly available pretrained LLMs for Norwegian, experimenting with different configurations of prompts and in-context learning, comparing their performance to human annotators. We find that even for zero-shot runs, models perform well above the baseline for binary sentiment, but still cannot compete with human annotators on the full dataset.
format Preprint
id arxiv_https___arxiv_org_abs_2404_18832
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle It's Difficult to be Neutral -- Human and LLM-based Sentiment Annotation of Patient Comments
Mæhlum, Petter
Samuel, David
Norman, Rebecka Maria
Jelin, Elma
Bjertnæs, Øyvind Andresen
Øvrelid, Lilja
Velldal, Erik
Computation and Language
Sentiment analysis is an important tool for aggregating patient voices, in order to provide targeted improvements in healthcare services. A prerequisite for this is the availability of in-domain data annotated for sentiment. This article documents an effort to add sentiment annotations to free-text comments in patient surveys collected by the Norwegian Institute of Public Health (NIPH). However, annotation can be a time-consuming and resource-intensive process, particularly when it requires domain expertise. We therefore also evaluate a possible alternative to human annotation, using large language models (LLMs) as annotators. We perform an extensive evaluation of the approach for two openly available pretrained LLMs for Norwegian, experimenting with different configurations of prompts and in-context learning, comparing their performance to human annotators. We find that even for zero-shot runs, models perform well above the baseline for binary sentiment, but still cannot compete with human annotators on the full dataset.
title It's Difficult to be Neutral -- Human and LLM-based Sentiment Annotation of Patient Comments
topic Computation and Language
url https://arxiv.org/abs/2404.18832