Skip to main content

Table 1 WordCluster predictions of CpG clusters*

From: WordCluster: detecting clusters of DNA words and genomic elements

Method # Length ± SD GC ± SD OE ± SD
cpg50 198703 273.2 ± 246.4 63.8 ± 7.5 0.855 ± 0.265
cpgISc 194725 218.7 ± 200.1 65.6 ± 7.7 0.916 ± 0.273
cpgISg 204238 202.6 ± 183.8 66.3 ± 7.5 0.930 ± 0.274
  1. *Basic statistic of CpG island predictions using three different distance models: cpgISg (genome intersection), cpg50 (Median) and cpgISc (chromosome intersection). The number of predicted islands, the length, the G+C content and the observed to expected ratios are shown. Note that the original cpg50 algorithm predicts 198702 islands, i.e. one less than WordCluster with the median model. This is due to the changes introduced regarding the N-runs (see main text).