Skip to main content

Advertisement

Table 4 Exact P-values for a selection of PROSITE patterns of high complexities using the complete proteome of Escherichia coli (NC_000913.faa). We use an order 1 homogeneous Markov model estimated over the data set.

From: Exact distribution of a pattern in a set of random sequences generated by a Markov source: applications to biological data

PROSITE signature n Exact
PILI_CHAPERONE 10 3.27 × 10-46
SIGMA54_INTERACT × 2 12 1.58 × 10-42
EFACTOR_GTP 8 4.43 × 10-20
ALDEHYDE_DEHYDR_CYS 11 5.63 × 10-9
ADH_ZINC 12 8.93 × 10-16
THIOLASE_1 5 5.76 × 10-9
SUGAR_TRANSPORT_1 18 3.75 × 10-8
FGGY_KINASES_2 5 2.14 × 10-4
PTS_EIIA_TYPE_2_HIS 8 7.19 × 10-19
MOLYBDOPTERIN_PROK_3 11 2.59 × 10-35
SUGAR_TRANSPORT_2 10 1.22 × 10-5