Saved in:
Bibliographic Details
Main Authors: Salmanpour, Mohammad R., Falahati, Sonya, Pouria, Amir Hossein, Mousavi, Amin, Mehrnia, Somayeh Sadat, Alizadeh, Morteza, Gorji, Arman, Farsangi, Zeinab, Safarian, Alireza, Maghsudi, Mehdi, Uribe, Carlos, Rahmim, Arman, Yuan, Ren
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2510.17039
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866915563314348032
author Salmanpour, Mohammad R.
Falahati, Sonya
Pouria, Amir Hossein
Mousavi, Amin
Mehrnia, Somayeh Sadat
Alizadeh, Morteza
Gorji, Arman
Farsangi, Zeinab
Safarian, Alireza
Maghsudi, Mehdi
Uribe, Carlos
Rahmim, Arman
Yuan, Ren
author_facet Salmanpour, Mohammad R.
Falahati, Sonya
Pouria, Amir Hossein
Mousavi, Amin
Mehrnia, Somayeh Sadat
Alizadeh, Morteza
Gorji, Arman
Farsangi, Zeinab
Safarian, Alireza
Maghsudi, Mehdi
Uribe, Carlos
Rahmim, Arman
Yuan, Ren
contents Lung cancer remains the leading cause of cancer mortality, with CT imaging central to screening, prognosis, and treatment. Manual segmentation is variable and time-intensive, while deep learning (DL) offers automation but faces barriers to clinical adoption. Guided by the Knowledge-to-Action framework, this study develops a clinician-in-the-loop DL pipeline to enhance reproducibility, prognostic accuracy, and clinical trust. Multi-center CT data from 999 patients across 12 public datasets were analyzed using five DL models (3D Attention U-Net, ResUNet, VNet, ReconNet, SAM-Med3D), benchmarked against expert contours on whole and click-point cropped images. Segmentation reproducibility was assessed using 497 PySERA-extracted radiomic features via Spearman correlation, ICC, Wilcoxon tests, and MANOVA, while prognostic modeling compared supervised (SL) and semi-supervised learning (SSL) across 38 dimensionality reduction strategies and 24 classifiers. Six physicians qualitatively evaluated masks across seven domains, including clinical meaningfulness, boundary quality, prognostic value, trust, and workflow integration. VNet achieved the best performance (Dice = 0.83, IoU = 0.71), radiomic stability (mean correlation = 0.76, ICC = 0.65), and predictive accuracy under SSL (accuracy = 0.88, F1 = 0.83). SSL consistently outperformed SL across models. Radiologists favored VNet for peritumoral representation and smoother boundaries, preferring AI-generated initial masks for refinement rather than replacement. These results demonstrate that integrating VNet with SSL yields accurate, reproducible, and clinically trusted CT-based lung cancer prognosis, highlighting a feasible path toward physician-centered AI translation.
format Preprint
id arxiv_https___arxiv_org_abs_2510_17039
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Click, Predict, Trust: Clinician-in-the-Loop AI Segmentation for Lung Cancer CT-Based Prognosis within the Knowledge-to-Action Framework
Salmanpour, Mohammad R.
Falahati, Sonya
Pouria, Amir Hossein
Mousavi, Amin
Mehrnia, Somayeh Sadat
Alizadeh, Morteza
Gorji, Arman
Farsangi, Zeinab
Safarian, Alireza
Maghsudi, Mehdi
Uribe, Carlos
Rahmim, Arman
Yuan, Ren
Computer Vision and Pattern Recognition
F.2.2; I.2.7
Lung cancer remains the leading cause of cancer mortality, with CT imaging central to screening, prognosis, and treatment. Manual segmentation is variable and time-intensive, while deep learning (DL) offers automation but faces barriers to clinical adoption. Guided by the Knowledge-to-Action framework, this study develops a clinician-in-the-loop DL pipeline to enhance reproducibility, prognostic accuracy, and clinical trust. Multi-center CT data from 999 patients across 12 public datasets were analyzed using five DL models (3D Attention U-Net, ResUNet, VNet, ReconNet, SAM-Med3D), benchmarked against expert contours on whole and click-point cropped images. Segmentation reproducibility was assessed using 497 PySERA-extracted radiomic features via Spearman correlation, ICC, Wilcoxon tests, and MANOVA, while prognostic modeling compared supervised (SL) and semi-supervised learning (SSL) across 38 dimensionality reduction strategies and 24 classifiers. Six physicians qualitatively evaluated masks across seven domains, including clinical meaningfulness, boundary quality, prognostic value, trust, and workflow integration. VNet achieved the best performance (Dice = 0.83, IoU = 0.71), radiomic stability (mean correlation = 0.76, ICC = 0.65), and predictive accuracy under SSL (accuracy = 0.88, F1 = 0.83). SSL consistently outperformed SL across models. Radiologists favored VNet for peritumoral representation and smoother boundaries, preferring AI-generated initial masks for refinement rather than replacement. These results demonstrate that integrating VNet with SSL yields accurate, reproducible, and clinically trusted CT-based lung cancer prognosis, highlighting a feasible path toward physician-centered AI translation.
title Click, Predict, Trust: Clinician-in-the-Loop AI Segmentation for Lung Cancer CT-Based Prognosis within the Knowledge-to-Action Framework
topic Computer Vision and Pattern Recognition
F.2.2; I.2.7
url https://arxiv.org/abs/2510.17039