Skip to main content

Table 1 Summary statistics for the tested collections. The row “Integers in colors” reports the total number of reference IDs that are required to encode all colors—i.e., the sum set sizes for all colors, \(\sum _{i} |C_i|\)

From: Fulgor: a fast and compact k-mer index for large-scale matching and color queries

 

E. coli (EC)

S. Enterica (SE)

Gut Bacteria (GB)

Genomes

3682

5000

10,000

50,000

100,000

150,000

30,691

Distinct colors (\(\times 10^6\))

5.59

2.69

4.24

13.92

19.36

23.61

227.80

Integers in colors (\(\times 10^9\))

5.74

5.77

15.68

133.49

303.53

490.04

10.04

\(k\)-mers in dBG (\(\times 10^6\))

170.65

104.69

239.88

806.23

1,018.69

1,194.44

13,936.86

Unitigs in dBG (\(\times 10^6\))

9.31

4.95

8.24

30.64

41.16

49.60

566.39