Saved in:
Bibliographic Details
Main Authors: Ingale, Kshitij, Hong, Sun Hae, Hu, Qiyuan, Zhang, Renyu, Osinski, Bo, Khoshdeli, Mina, Och, Josh, Nagpal, Kunal, Stumpe, Martin C., Joshi, Rohan P.
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2407.15816
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866916331935236096
author Ingale, Kshitij
Hong, Sun Hae
Hu, Qiyuan
Zhang, Renyu
Osinski, Bo
Khoshdeli, Mina
Och, Josh
Nagpal, Kunal
Stumpe, Martin C.
Joshi, Rohan P.
author_facet Ingale, Kshitij
Hong, Sun Hae
Hu, Qiyuan
Zhang, Renyu
Osinski, Bo
Khoshdeli, Mina
Och, Josh
Nagpal, Kunal
Stumpe, Martin C.
Joshi, Rohan P.
contents Molecular testing of tumor samples for targetable biomarkers is restricted by a lack of standardization, turnaround-time, cost, and tissue availability across cancer types. Additionally, targetable alterations of low prevalence may not be tested in routine workflows. Algorithms that predict DNA alterations from routinely generated hematoxylin and eosin (H&E)-stained images could prioritize samples for confirmatory molecular testing. Costs and the necessity of a large number of samples containing mutations limit approaches that train individual algorithms for each alteration. In this work, models were trained for simultaneous prediction of multiple DNA alterations from H&E images using a multi-task approach. Compared to biomarker-specific models, this approach performed better on average, with pronounced gains for rare mutations. The models reasonably generalized to independent temporal-holdout, externally-stained, and multi-site TCGA test sets. Additionally, whole slide image embeddings derived using multi-task models demonstrated strong performance in downstream tasks that were not a part of training. Overall, this is a promising approach to develop clinically useful algorithms that provide multiple actionable predictions from a single slide.
format Preprint
id arxiv_https___arxiv_org_abs_2407_15816
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Efficient and generalizable prediction of molecular alterations in multiple cancer cohorts using H&E whole slide images
Ingale, Kshitij
Hong, Sun Hae
Hu, Qiyuan
Zhang, Renyu
Osinski, Bo
Khoshdeli, Mina
Och, Josh
Nagpal, Kunal
Stumpe, Martin C.
Joshi, Rohan P.
Computer Vision and Pattern Recognition
Molecular testing of tumor samples for targetable biomarkers is restricted by a lack of standardization, turnaround-time, cost, and tissue availability across cancer types. Additionally, targetable alterations of low prevalence may not be tested in routine workflows. Algorithms that predict DNA alterations from routinely generated hematoxylin and eosin (H&E)-stained images could prioritize samples for confirmatory molecular testing. Costs and the necessity of a large number of samples containing mutations limit approaches that train individual algorithms for each alteration. In this work, models were trained for simultaneous prediction of multiple DNA alterations from H&E images using a multi-task approach. Compared to biomarker-specific models, this approach performed better on average, with pronounced gains for rare mutations. The models reasonably generalized to independent temporal-holdout, externally-stained, and multi-site TCGA test sets. Additionally, whole slide image embeddings derived using multi-task models demonstrated strong performance in downstream tasks that were not a part of training. Overall, this is a promising approach to develop clinically useful algorithms that provide multiple actionable predictions from a single slide.
title Efficient and generalizable prediction of molecular alterations in multiple cancer cohorts using H&E whole slide images
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2407.15816