Saved in:
Bibliographic Details
Main Authors: Husseini-Wüsthoff, Hümeyra, Riethdorf, Sabine, Schneeweiss, Andreas, Trumpp, Andreas, Pantel, Klaus, Wikman, Harriet, Nielsen, Maximilian, Werner, René
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2411.16332
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866908771345760256
author Husseini-Wüsthoff, Hümeyra
Riethdorf, Sabine
Schneeweiss, Andreas
Trumpp, Andreas
Pantel, Klaus
Wikman, Harriet
Nielsen, Maximilian
Werner, René
author_facet Husseini-Wüsthoff, Hümeyra
Riethdorf, Sabine
Schneeweiss, Andreas
Trumpp, Andreas
Pantel, Klaus
Wikman, Harriet
Nielsen, Maximilian
Werner, René
contents Detection and differentiation of circulating tumor cells (CTCs) and non-CTCs in blood draws of cancer patients pose multiple challenges. While the gold standard relies on tedious manual evaluation of an automatically generated selection of images, machine learning (ML) techniques offer the potential to automate these processes. However, human assessment remains indispensable when the ML system arrives at uncertain or wrong decisions due to an insufficient set of labeled training data. This study introduces a human-in-the-loop (HiL) strategy for improving ML-based CTC detection. We combine self-supervised deep learning and a conventional ML-based classifier and propose iterative targeted sampling and labeling of new unlabeled training samples by human experts. The sampling strategy is based on the classification performance of local latent space clusters. The advantages of the proposed approach compared to naive random sampling are demonstrated for liquid biopsy data from patients with metastatic breast cancer.
format Preprint
id arxiv_https___arxiv_org_abs_2411_16332
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Cluster-based human-in-the-loop strategy for improving machine learning-based circulating tumor cell detection in liquid biopsy
Husseini-Wüsthoff, Hümeyra
Riethdorf, Sabine
Schneeweiss, Andreas
Trumpp, Andreas
Pantel, Klaus
Wikman, Harriet
Nielsen, Maximilian
Werner, René
Computer Vision and Pattern Recognition
Detection and differentiation of circulating tumor cells (CTCs) and non-CTCs in blood draws of cancer patients pose multiple challenges. While the gold standard relies on tedious manual evaluation of an automatically generated selection of images, machine learning (ML) techniques offer the potential to automate these processes. However, human assessment remains indispensable when the ML system arrives at uncertain or wrong decisions due to an insufficient set of labeled training data. This study introduces a human-in-the-loop (HiL) strategy for improving ML-based CTC detection. We combine self-supervised deep learning and a conventional ML-based classifier and propose iterative targeted sampling and labeling of new unlabeled training samples by human experts. The sampling strategy is based on the classification performance of local latent space clusters. The advantages of the proposed approach compared to naive random sampling are demonstrated for liquid biopsy data from patients with metastatic breast cancer.
title Cluster-based human-in-the-loop strategy for improving machine learning-based circulating tumor cell detection in liquid biopsy
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2411.16332