Saved in:
| Hovedforfatter: | |
|---|---|
| Format: | Recurso digital |
| Sprog: | |
| Udgivet: |
Zenodo
2025
|
| Online adgang: | https://doi.org/10.5281/zenodo.14963685 |
| Tags: |
Tilføj Tag
Ingen Tags, Vær først til at tagge denne postø!
|
Indholdsfortegnelse:
- <p><strong>EVEREST</strong> (pip<strong>E</strong>line for <strong>V</strong>iral ass<strong>E</strong>mbly and cha<strong>R</strong>act<strong>E</strong>ri<strong>S</strong>a<strong>T</strong>ion) is a comprehensive, end-to-end pipeline designed for virus discovery and characterization. Implemented in Nextflow, it processes Illumina single- and paired-end reads through five key phases: pre-processing, filtering, de novo assembly, refinement, and classification. The pipeline ensures high-quality data by trimming, removing host sequences, eliminating duplicates, and applying digital normalization. It then assembles viral genomes using a de novo assembly strategy, clusters similar contigs, captures viral genomes, and assesses their quality. Finally, <strong>EVEREST</strong> classifies viral contigs using the NCBI (nucleotide) and Uniprot (amino acid) databases, providing a robust framework for identifying and characterizing viruses from sequencing data.</p>