Saved in:
Bibliographic Details
Main Authors: Xu, Xuechun, Jaldén, Joakim
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2405.00833
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914780703358976
author Xu, Xuechun
Jaldén, Joakim
author_facet Xu, Xuechun
Jaldén, Joakim
contents Recent advancements in nanopore sequencing technology, particularly the R10 nanopore from Oxford Nanopore Technology, have necessitated the development of improved data processing methods to utilize their potential for more than 9-mer resolution fully. The processing of the ion currents predominantly utilizes neural network-based methods known for their high basecalling accuracy but face developmental bottlenecks at higher resolutions. In light of this, we introduce the Helicase Hidden Markov Model (HHMM), a novel framework designed to incorporate the dynamics of the helicase motor protein alongside the nucleotide sequence during nanopore sequencing. This model supports the analysis of millions of distinct states, enhancing our understanding of raw ion currents and their alignment with nucleotide sequences. Our findings demonstrate the utility of HHMM not only as a potent visualization tool but also as an effective base for developing advanced basecalling algorithms. This approach offers a promising avenue for leveraging the full capabilities of emerging high-resolution nanopore sequencing technologies.
format Preprint
id arxiv_https___arxiv_org_abs_2405_00833
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Modelling the nanopore sequencing process with Helicase HMMs
Xu, Xuechun
Jaldén, Joakim
Signal Processing
Genomics
Recent advancements in nanopore sequencing technology, particularly the R10 nanopore from Oxford Nanopore Technology, have necessitated the development of improved data processing methods to utilize their potential for more than 9-mer resolution fully. The processing of the ion currents predominantly utilizes neural network-based methods known for their high basecalling accuracy but face developmental bottlenecks at higher resolutions. In light of this, we introduce the Helicase Hidden Markov Model (HHMM), a novel framework designed to incorporate the dynamics of the helicase motor protein alongside the nucleotide sequence during nanopore sequencing. This model supports the analysis of millions of distinct states, enhancing our understanding of raw ion currents and their alignment with nucleotide sequences. Our findings demonstrate the utility of HHMM not only as a potent visualization tool but also as an effective base for developing advanced basecalling algorithms. This approach offers a promising avenue for leveraging the full capabilities of emerging high-resolution nanopore sequencing technologies.
title Modelling the nanopore sequencing process with Helicase HMMs
topic Signal Processing
Genomics
url https://arxiv.org/abs/2405.00833