Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Author:	Wenderoth, Laura
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Artificial Intelligence Computation and Language
Online Access:	https://arxiv.org/abs/2411.01653
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866916466166595584
author	Wenderoth, Laura
author_facet	Wenderoth, Laura
contents	This study explores the potential of using training dynamics as an automated alternative to human annotation for evaluating the quality of training data. The framework used is Data Maps, which classifies data points into categories such as easy-to-learn, hard-to-learn, and ambiguous (Swayamdipta et al., 2020). Swayamdipta et al. (2020) highlight that difficult-to-learn examples often contain errors, and ambiguous cases significantly impact model training. To confirm the reliability of these findings, we replicated the experiments using a challenging dataset, with a focus on medical question answering. In addition to text comprehension, this field requires the acquisition of detailed medical knowledge, which further complicates the task. A comprehensive evaluation was conducted to assess the feasibility and transferability of the Data Maps framework to the medical domain. The evaluation indicates that the framework is unsuitable for addressing datasets' unique challenges in answering medical questions.
format	Preprint
id	arxiv_https___arxiv_org_abs_2411_01653
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Diagnosing Medical Datasets with Training Dynamics Wenderoth, Laura Machine Learning Artificial Intelligence Computation and Language This study explores the potential of using training dynamics as an automated alternative to human annotation for evaluating the quality of training data. The framework used is Data Maps, which classifies data points into categories such as easy-to-learn, hard-to-learn, and ambiguous (Swayamdipta et al., 2020). Swayamdipta et al. (2020) highlight that difficult-to-learn examples often contain errors, and ambiguous cases significantly impact model training. To confirm the reliability of these findings, we replicated the experiments using a challenging dataset, with a focus on medical question answering. In addition to text comprehension, this field requires the acquisition of detailed medical knowledge, which further complicates the task. A comprehensive evaluation was conducted to assess the feasibility and transferability of the Data Maps framework to the medical domain. The evaluation indicates that the framework is unsuitable for addressing datasets' unique challenges in answering medical questions.
title	Diagnosing Medical Datasets with Training Dynamics
topic	Machine Learning Artificial Intelligence Computation and Language
url	https://arxiv.org/abs/2411.01653

Similar Items