Enregistré dans:
| Auteurs principaux: | , , |
|---|---|
| Format: | Recurso digital |
| Langue: | |
| Publié: |
Zenodo
2025
|
| Accès en ligne: | https://doi.org/10.5281/zenodo.15100011 |
| Tags: |
Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
|
Table des matières:
- <div> <p>Four files used in the paper “CREMSA: Compressed Indexing of (Ultra) Large Multiple Sequence Alignments” are made available here for reproducibility:</p> <ul> <li><code>random_datasets_len10000_num30000.zip</code> : An archive of artificial FASTA files generated as described in the paper.</li> <li><code>HIV1_ALL_2022_genome_DNA.fasta.xz</code> : A multiple sequence alignment of 5,381 HIV1 genomes, <a href="https://www.hiv.lanl.gov/content/sequence/NEWALIGN/align.html">retrieved from the Los Alamos National Laboratory</a> on March 2025.</li> <li><code>nextstrain_groups_LANL-HIV-DB_HIV_genome_timetree.jsonl.gz</code> : A JSONL file, as produced by <a href="https://nextstrain.org/groups/LANL-HIV-DB/HIV/genome">Nextstrain</a>, of the phylogeny of 3,090 HIV genomes among the 5,381 from the previous file. </li> <li><code>MFS_1.fasta.xz</code> : A multiple sequence alignment of 214,283 protein sequences of the Major Facilitator Superfamily (MFS), <a href="https://www.ebi.ac.uk/interpro/download/pfam/">retrieved from Pfam</a> on March 2025.</li> </ul> </div>