Skip to main content

Table 1 Number of simple and structured motifs on biological datasets

From: Direct vs 2-stage approaches to structured motif finding

UASH-URS1_5

  

SISMA_smile

SISMA_speller

Box (7,1)

 

1,452 (10)

420 (10)

Box (10,2)

 

5,472 (5)

92 (5)

Structured Motif

   

(7,1) - [1,50] - (10,2)

 

16,662 (5)

14 (5)

UASH-URS1_10

  

SISMA_smile

SISMA_speller

Box (3,1)

 

64 (1000)

64 (1000)

Box (5,2)

 

1,024 (1000)

942 (1000)

Box (9,1)

 

103 (10)

55 (10)

Structured Motif

   

(3,1) - [ 1,1 ] - (5,2) - [ 1,200 ] - (9,1)

 

2,309,173 ( 70)

7,241 ( 70)

KAR4P

  

SISMA_smile

SISMA_speller

Box (3,1)

 

64 (2000)

64 (2000)

Box (4,1)

 

256 (1000)

256 (1000)

Box (2,1)

 

16 (6000)

16 (6000)

Structured Motif

   

(3,1) - [2,2]- (4,1) - [2,2] -(3,1) -[1,1] - (2,1)

 

101,750 ( 50)

858 ( 50)

  1. The table is divided in three (sub)tables, one for each dataset. The following information apply to each sub-table. There is a row corresponding to each box type involved and one more row corresponding to the type of structured motifs to be found. Also, there is a column for each of the two versions of our SISMA algorithm (SISMA_Smile and SISMA_Speller). Each cell reports two pieces of information: (1) the number of simple/structured motifs in the input sequences that conform to the given specifications, and (2) the corresponding (approximate) average number of occurrences of each simple/structured motif found.