Saved in:
Bibliografiske detaljer
Hovedforfatter: Patricia
Format: Recurso digital
Sprog:
Udgivet: Zenodo 2025
Online adgang:https://doi.org/10.5281/zenodo.14963685
Tags: Tilføj Tag
Ingen Tags, Vær først til at tagge denne postø!
Indholdsfortegnelse:
  • <p><strong>EVEREST</strong> (pip<strong>E</strong>line for <strong>V</strong>iral ass<strong>E</strong>mbly and cha<strong>R</strong>act<strong>E</strong>ri<strong>S</strong>a<strong>T</strong>ion) is a comprehensive, end-to-end pipeline designed for virus discovery and characterization. Implemented in Nextflow, it processes Illumina single- and paired-end reads through five key phases: pre-processing, filtering, de novo assembly, refinement, and classification. The pipeline ensures high-quality data by trimming, removing host sequences, eliminating duplicates, and applying digital normalization. It then assembles viral genomes using a de novo assembly strategy, clusters similar contigs, captures viral genomes, and assesses their quality. Finally, <strong>EVEREST</strong> classifies viral contigs using the NCBI (nucleotide) and Uniprot (amino acid) databases, providing a robust framework for identifying and characterizing viruses from sequencing data.</p>