Skip to main content

Table 1 WordCluster predictions of CpG clusters*

From: WordCluster: detecting clusters of DNA words and genomic elements

Method

#

Length ± SD

GC ± SD

OE ± SD

cpg50

198703

273.2 ± 246.4

63.8 ± 7.5

0.855 ± 0.265

cpgISc

194725

218.7 ± 200.1

65.6 ± 7.7

0.916 ± 0.273

cpgISg

204238

202.6 ± 183.8

66.3 ± 7.5

0.930 ± 0.274

  1. *Basic statistic of CpG island predictions using three different distance models: cpgISg (genome intersection), cpg50 (Median) and cpgISc (chromosome intersection). The number of predicted islands, the length, the G+C content and the observed to expected ratios are shown. Note that the original cpg50 algorithm predicts 198702 islands, i.e. one less than WordCluster with the median model. This is due to the changes introduced regarding the N-runs (see main text).