Algorithms for Molecular Biology

Table 1 Number of simple and structured motifs on biological datasets

From: Direct vs 2-stage approaches to structured motif finding

UASH-URS1_5
		SISMA_smile	SISMA_speller
Box (7,1)		1,452 (∼10)	420 (∼10)
Box (10,2)		5,472 (∼5)	92 (∼5)
Structured Motif
(7,1) - [1,50] - (10,2)		16,662 (∼5)	14 (∼5)
UASH-URS1_10
		SISMA_smile	SISMA_speller
Box (3,1)		64 (∼1000)	64 (∼1000)
Box (5,2)		1,024 (∼1000)	942 (∼1000)
Box (9,1)		103 (∼10)	55 (∼10)
Structured Motif
(3,1) - [ 1,1 ] - (5,2) - [ 1,200 ] - (9,1)		2,309,173 (∼ 70)	7,241 (∼ 70)
KAR4P
		SISMA_smile	SISMA_speller
Box (3,1)		64 (∼2000)	64 (∼2000)
Box (4,1)		256 (∼1000)	256 (∼1000)
Box (2,1)		16 (∼6000)	16 (∼6000)
Structured Motif
(3,1) - [2,2]- (4,1) - [2,2] -(3,1) -[1,1] - (2,1)		101,750 (∼ 50)	858 (∼ 50)

The table is divided in three (sub)tables, one for each dataset. The following information apply to each sub-table. There is a row corresponding to each box type involved and one more row corresponding to the type of structured motifs to be found. Also, there is a column for each of the two versions of our SISMA algorithm (SISMA_Smile and SISMA_Speller). Each cell reports two pieces of information: (1) the number of simple/structured motifs in the input sequences that conform to the given specifications, and (2) the corresponding (approximate) average number of occurrences of each simple/structured motif found.

Back to article page

ISSN: 1748-7188

Contact us

General enquiries: journalsubmissions@springernature.com