Skip to main content

Table 3 P-values for a selection of PROSITE patterns of low (or moderate) complexities using the complete proteome of Escherichia coli (NC_000913.faa).

From: Exact distribution of a pattern in a set of random sequences generated by a Markov source: applications to biological data

PROSITE signature

n

Exact

SSA with no offset

SSA (offset)

RGD

215

5.35 × 10-1

5.91 × 10-1

5.55 × 10-1(2)

ER_TARGET

72

4.01 × 10-2

5.21 × 10-2

4.70 × 10-2(2)

PPASE

3

2.60 × 10-2

2.76 × 10-2

2.63 × 10-2(6)

ALDEHYDE_DEHYDR_GLU

12

1.99 × 10-5

2.41 × 10-5

1.95 × 10-5(7)

PROKAR_NTER_METHYL

10

6.79 × 10-3

8.01 × 10-3

5.10 × 10-3(20)

GLY_RADICAL_1

6

1.58 × 10-6

1.86 × 10-6

1.60 × 10-6(8)

PEP_ENZYMES_PHOS_SITE

4

1.49 × 10-10

1.74 × 10-10

1.49 × 10-10(12)

PUR_PYR_PR_TRANSFER

7

2.15 × 10-14

2.75 × 10-14

2.10 × 10-14(12)