Skip to main content

Table 2 Biological meaning of WordCluster predictions*

From: WordCluster: detecting clusters of DNA words and genomic elements

Method #islands #TSS overlap #R13 overlap #Alu overlap #PhastCons overlap
cpg50 198703 12432 (6.3%) 30660 (15.4%) 80323 (40.4%) 48787 (24.6%)
cpgISc 194724 11926 (6.1%) 34567 (17.8%) 70144 (36.0%) 48930 (25.1%)
cpgISg 204238 12156 (6.0%) 37616 (18.4%) 70456 (34.5%) 52335 (25.6%)
  1. *Comparison of three WordCluster predictions of CG clusters (CpG islands) using three different distance models: cpgISg (genome intersection), cpg50 (median) and cpgISc (chromosome intersection). The overlap with two gene regions (TSS and R13), Alu elements and phylogenetically conserved PhastCons elements have been measured and both absolute numbers and percentages are given.