Skip to main content

Table 3 Datasets used in the experiments

From: Suffix sorting via matching statistics

Name

Description

\(\sigma\)

No. of sequences

Ref. sequence length

r

chr19

Human Chromosome 19

5

103

59,126,939

33,799,549

sars-cov2

SARS-CoV2 genome

14

205,813

29,783

6,207,939

  1. In column 3, we specify the alphabet size \(\sigma\), in column 4 the number of sequences in the dataset, in column 5 the reference sequence length, and in column 6 the number of runs r in the BWT. The total dataset has size 6 GB