Name | Description | \(\sigma\) | No. of sequences | Ref. sequence length | r |
---|
chr19 | Human Chromosome 19 | 5 | 103 | 59,126,939 | 33,799,549 |
sars-cov2 | SARS-CoV2 genome | 14 | 205,813 | 29,783 | 6,207,939 |
- In column 3, we specify the alphabet size \(\sigma\), in column 4 the number of sequences in the dataset, in column 5 the reference sequence length, and in column 6 the number of runs r in the BWT. The total dataset has size 6 GB