Saved in:
Bibliographic Details
Main Authors: Sokolovskii, Roman, Agarwal, Parv, Croquevielle, Luis Alberto, Zhou, Zijian, Heinis, Thomas
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2406.04141
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909414613581824
author Sokolovskii, Roman
Agarwal, Parv
Croquevielle, Luis Alberto
Zhou, Zijian
Heinis, Thomas
author_facet Sokolovskii, Roman
Agarwal, Parv
Croquevielle, Luis Alberto
Zhou, Zijian
Heinis, Thomas
contents Encoding information in combinations of pre-synthesised deoxyribonucleic acid (DNA) strands (referred to as motifs) is an interesting approach to DNA storage that could potentially circumvent the prohibitive costs of nucleotide-by-nucleotide DNA synthesis. Based on our analysis of an empirical data set from HelixWorks, we propose two channel models for this setup (with and without interference) and analyse their fundamental limits. We propose a coding scheme that approaches those limits by leveraging all information available at the output of the channel, in contrast to earlier schemes developed for a similar setup by Preuss et al. We highlight an important connection between channel capacity curves and the fundamental trade-off between synthesis (writing) and sequencing (reading), and offer a way to mitigate an exponential growth in decoding complexity with the size of the motif library.
format Preprint
id arxiv_https___arxiv_org_abs_2406_04141
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Coding Over Coupon Collector Channels for Combinatorial Motif-Based DNA Storage
Sokolovskii, Roman
Agarwal, Parv
Croquevielle, Luis Alberto
Zhou, Zijian
Heinis, Thomas
Information Theory
Encoding information in combinations of pre-synthesised deoxyribonucleic acid (DNA) strands (referred to as motifs) is an interesting approach to DNA storage that could potentially circumvent the prohibitive costs of nucleotide-by-nucleotide DNA synthesis. Based on our analysis of an empirical data set from HelixWorks, we propose two channel models for this setup (with and without interference) and analyse their fundamental limits. We propose a coding scheme that approaches those limits by leveraging all information available at the output of the channel, in contrast to earlier schemes developed for a similar setup by Preuss et al. We highlight an important connection between channel capacity curves and the fundamental trade-off between synthesis (writing) and sequencing (reading), and offer a way to mitigate an exponential growth in decoding complexity with the size of the motif library.
title Coding Over Coupon Collector Channels for Combinatorial Motif-Based DNA Storage
topic Information Theory
url https://arxiv.org/abs/2406.04141