Skip to main content

Table 4 Evaluation of contiguous motifs on protein synthetic data.

From: Evaluating deterministic motif significance measures in protein databases

Motif

Supp

ZScore

LogOdd

Pratt

IG

SSN

1

3710

1

2130

1

IYKQ

1

1533

2

11817

1

NDFNE

1

1

1

13483

1

PLMPES

1

1

2

4973

1

MRKMVTAG

1

1

6

9818

1

TKYEETGAFK

1

1

43

7350

1

DRTGMHSIFFLP

1

1

3

11721

1

MTENKVGESICPAAPN

1

1

29

9589

1

R m

1

0.0015

0.0919

1.128E-4

1

  1. Ranking results for eight synthetic protein datasets. Each dataset contains 50 sequences of length 300. Target motifs have a support of 100%. Motifs are ranked with Information-theoretic measures and support. Last row gives the R m values of each measure, where the best results are obtained by IG and support.