Saved in:
| Main Authors: | , , , |
|---|---|
| Format: | Artículo científico |
| Language: | en |
| Published: |
Bioinformatics advances
2025
|
| Online Access: | https://pubmed.ncbi.nlm.nih.gov/39735572/ |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Table of Contents:
- SeqLengthPlot v2.0: an all-in-one, easy-to-use tool for visualizing and retrieving sequence lengths from FASTA files. Domínguez-Pérez, Dany Agüero-Chapin, Guillermin Leone, Serena Modica, Maria Vittoria Accurate sequence length profiling is essential in bioinformatics, particularly in genomics and proteomics. Existing tools like SeqKit and the Trinity toolkit provide basic sequence statistics but often fall short in offering comprehensive analytics and plotting options. For instance, SeqKit is a very complete and fast tool for sequence analysis, delivering useful metrics (e.g. number of sequences, average, minimum, and maximum lengths) and can return sequences either shorter or longer (but not both at once) for a given length. Similarly, Trinity's Perl-based scripts provide detailed contig length distributions (e.g. N50, median, and average lengths) but do not include the total number of sequences or offer graphical representations of the data. Given that key sequence analysis tasks are often distributed across multiple tools, we introduce , an all-in-one, easy-to-use Python-based tool. Through a simple command-line interface, this straightforward tool enables users to split input FASTA files (nucleotide and protein) into two distinct files based on a customizable sequence length cutoff. It also automatically retrieves the resulting FASTA files, generates length distribution plots, and provides comprehensive statistical summaries. SeqLengthPlot_v2.0.2 can be accessed at https://github.com/danydguezperez/SeqLengthPlot/releases/tag/v2.0.2.