Saved in:
Bibliographic Details
Main Authors: Khosla, Nathan, Lesinski, Jake M., Haywood-Alexander, Marcus, deMello, Andrew J., Richards, Daniel A.
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2501.04413
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866916556216205312
author Khosla, Nathan
Lesinski, Jake M.
Haywood-Alexander, Marcus
deMello, Andrew J.
Richards, Daniel A.
author_facet Khosla, Nathan
Lesinski, Jake M.
Haywood-Alexander, Marcus
deMello, Andrew J.
Richards, Daniel A.
contents CRISPR-based diagnostics have gained increasing attention as biosensing tools able to address limitations in contemporary molecular diagnostic tests. To maximise the performance of CRISPR-based assays, much effort has focused on optimizing the chemistry and biology of the biosensing reaction. However, less attention has been paid to improving the techniques used to analyse CRISPR-based diagnostic data. To date, diagnostic decisions typically involve various forms of slope-based classification. Such methods are superior to traditional methods based on assessing absolute signals, but still have limitations. Herein, we establish performance benchmarks (total accuracy, sensitivity, and specificity) using common slope-based methods. We compare the performance of these benchmark methods with three different quadratic empirical distribution function statistical tests, finding significant improvements in diagnostic speed and accuracy when applied to a clinical data set. Two of the three statistical techniques, the Kolmogorov-Smirnov and Anderson-Darling tests, report the lowest time-to-result and highest total test accuracy. Furthermore, we developed a long short-term memory recurrent neural network to classify CRISPR-biosensing data, achieving 100% specificity on our model data set. Finally, we provide guidelines on choosing the classification method and classification method parameters that best suit a diagnostic assays needs.
format Preprint
id arxiv_https___arxiv_org_abs_2501_04413
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Machine Learning and statistical classification of CRISPR-Cas12a diagnostic assays
Khosla, Nathan
Lesinski, Jake M.
Haywood-Alexander, Marcus
deMello, Andrew J.
Richards, Daniel A.
Quantitative Methods
Machine Learning
CRISPR-based diagnostics have gained increasing attention as biosensing tools able to address limitations in contemporary molecular diagnostic tests. To maximise the performance of CRISPR-based assays, much effort has focused on optimizing the chemistry and biology of the biosensing reaction. However, less attention has been paid to improving the techniques used to analyse CRISPR-based diagnostic data. To date, diagnostic decisions typically involve various forms of slope-based classification. Such methods are superior to traditional methods based on assessing absolute signals, but still have limitations. Herein, we establish performance benchmarks (total accuracy, sensitivity, and specificity) using common slope-based methods. We compare the performance of these benchmark methods with three different quadratic empirical distribution function statistical tests, finding significant improvements in diagnostic speed and accuracy when applied to a clinical data set. Two of the three statistical techniques, the Kolmogorov-Smirnov and Anderson-Darling tests, report the lowest time-to-result and highest total test accuracy. Furthermore, we developed a long short-term memory recurrent neural network to classify CRISPR-biosensing data, achieving 100% specificity on our model data set. Finally, we provide guidelines on choosing the classification method and classification method parameters that best suit a diagnostic assays needs.
title Machine Learning and statistical classification of CRISPR-Cas12a diagnostic assays
topic Quantitative Methods
Machine Learning
url https://arxiv.org/abs/2501.04413