RNA-RNA interaction prediction using genetic algorithm

Montaseri, Soheila; Zare-Mirakabad, Fatemeh; Moghadam-Charkari, Nasrollah

doi:10.1186/1748-7188-9-17

Research
Open access
Published: 29 June 2014

RNA-RNA interaction prediction using genetic algorithm

Soheila Montaseri¹,
Fatemeh Zare-Mirakabad^2,3 &
Nasrollah Moghadam-Charkari⁴

Algorithms for Molecular Biology volume 9, Article number: 17 (2014) Cite this article

3513 Accesses
6 Citations
1 Altmetric
Metrics details

Abstract

Background

RNA-RNA interaction plays an important role in the regulation of gene expression and cell development. In this process, an RNA molecule prohibits the translation of another RNA molecule by establishing stable interactions with it. In the RNA-RNA interaction prediction problem, two RNA sequences are given as inputs and the goal is to find the optimal secondary structure of two RNAs and between them. Some different algorithms have been proposed to predict RNA-RNA interaction structure. However, most of them suffer from high computational time.

Results

In this paper, we introduce a novel genetic algorithm called GRNAs to predict the RNA-RNA interaction. The proposed algorithm is performed on some standard datasets with appropriate accuracy and lower time complexity in comparison to the other state-of-the-art algorithms. In the proposed algorithm, each individual is a secondary structure of two interacting RNAs. The minimum free energy is considered as a fitness function for each individual. In each generation, the algorithm is converged to find the optimal secondary structure (minimum free energy structure) of two interacting RNAs by using crossover and mutation operations.

Conclusions

This algorithm is properly employed for joint secondary structure prediction. The results achieved on a set of known interacting RNA pairs are compared with the other related algorithms and the effectiveness and validity of the proposed algorithm have been demonstrated. It has been shown that time complexity of the algorithm in each iteration is as efficient as the other approaches.

Background

Major successes have been achieved in the treatment of some cancers, including colon, breast and pancreatic by suppressing the gene expression involved in the development of these diseases using RNA-RNA interaction. The interaction between two RNAs is known as the newest and the most efficient method for gene silencing. It has been shown that the small interfering RNAs (siRNAs) can be used for silencing their target mRNAs [1]. Furthermore, small RNAs (sRNAs) play an important role in the regulation of gene expression. They usually bind to their target mRNAs to prevent their translation [2].

In RNA-RNA Interaction Prediction (RRIP) problem, two RNA sequences are given as inputs and the goal is to find the minimum free energy Secondary Structure of Interacting RNAs (SSIR). To tackle this problem, some algorithms have been proposed by research groups. Andronescu et al. [3] proposed a method based on dynamic programming in which two RNA sequences are concatenated as a single sequence and its secondary structure is calculated [3]. Another approach calculates the partition function of a secondary structure complex of multiple interacting RNAs [4]. This method rigorously extends those models of secondary structure to the multi-stranded case. The tools such as RNAhybrid [5], UNAFold [6] and RNAduplex from ViennaRNA package[7] reduce computational time complexity by ignoring all the internal base pairings in both RNAs. RNAup [8, 9] extends the standard partition function approach to RNA secondary structures and employs the single (unpaired) regions on each RNA to find the interaction between them. RNAplex [10, 11] finds the possible hybridization sites for RNA in the large RNA databases based on a slight simplification of the energy model. In this model, the loop energy is assumed to be a function of the loop size.

Recently, a novel algorithm based on the multiple context free grammars was introduced in [12]. Accordingly, two real values called transition and emission probabilities are specified for each rule of the grammar. Then, a derivation tree is constructed for the grammar based on the rules with high probability.

In heuristic based approaches, inRNAs [13] firstly predicts the loop regions in the native structure of each sequence, and then finds the optimal non-conflicting interaction between two RNAs. IntaRNA [14] combines the accessibility of target sites as well as the existence of a user-definable seed to find RNA-RNA interaction. Minimizing the joint free energy between two RNA molecules under a number of energy models with growing complexity was introduced in [2]. Another interesting heuristic approach for this problem was presented in [15]. This algorithm employs some dot matrices representation of all possible base pairs for finding the secondary structure of each RNA and between the two RNAs.

An approximation algorithm was presented in [1], where an RNA-RNA interaction graph is created in which every edge represents a possible bond in or between two RNAs. A set of edges is found to maximize the number of bonds. A statistical sampling algorithm was introduced in [16] based on some modifications to the grammars. It calculates the interaction probabilities for any given single region on RNA. RactIP [17] predicts RNA-RNA interaction using integer programming. Accordingly, it uses the approximate information of the internal and external base pairing probabilities of joint structures as an objective function of integer programming. PETcofold [18] employs covariance information in the internal and external base pairs to predict SSIR of two multiple alignments of RNA sequences. InteRNA [19] reduces the time and space complexity of RRIP problem described by Alkan et al. [2] using dynamic programming sparsification.

One of the pitfalls of the most existing algorithms is their high computational time to predict RNA-RNA interaction, while a number of them have not been performed on some RNA pairs to predict binding sites between two single regions of RNAs. Alkan et al. [2] proved that RNA-RNA interaction prediction is an NP-complete problem.

In this paper, we propose a new genetic algorithm called GRNAs as an appropriate solution for the RRIP problem. This algorithm can be performed on some standard RNA pairs with high accuracy. In this method, at first, all possible stems in each RNA as well as all possible hybrid regions between two RNAs are extracted from a dot matrix. The initial population consists of some individuals, where each of them is an SSIR obtained from some randomly extracted stems and hybrid regions of the dot matrix. The minimum free energy is computed for each individual as a fitness value. For each generation, some individuals are selected to mate based on their fitness values and form a new population. Then, mutation operation is done on a few individuals. The population generation terminates when the free energy of an individual is minimum enough. Finally, one of the best individuals is selected as an optimal SSIR. The algorithm is conducted on some real datasets and compared with some other algorithms to investigate efficiency and validity of the proposed method. The time and space complexity of the proposed method in each iteration is 0(l² + |P|), where l and |P| indicate the sum of the length of two RNAs and the length of an individual, respectively. The results show that the accuracy of the algorithm is as efficient as the other related methods.

The rest of this paper is organized as follows. In Section 2, some definitions and notations are described. In Section 3, a genetic algorithm called GRNAs is presented to predict RNA-RNA interaction. The results and conclusion are discussed in Sections 4 and 5, respectively.

Definitions and notations

An RNA molecule is composed of a long, usually single-stranded chain of nucleotide units; adenine (A), cytosine (C), guanine (G) and uracil (U). Thus, R = r₁r₂ … r_n in 5 ' - 3 ' direction is an RNA sequence, where |R| = n and r_i ∈ {A, C, G, U} (1 ≤ i ≤ n). The RNA structure is formed by the creation of hydrogen bonds between Watson-Crick complementary bases (A - U and C - G) and a Wobble base pair (G - U).

In an RNA secondary structure, each base interacts with at most one other base, and no base pairs cross each other. Two bases r_i and r_j (1 ≤ i < j ≤ n) of the base pair (r_i, r_j) are represented by ' (' and ') ', respectively and each unpaired base is declared by '. '. A stem consists of subsequent base pairs and a loop is one sequence of consecutive unpaired bases.

A secondary structure of two interacting RNAs, R₁ and R₂, contains the set of stems in each RNA and the hybrid regions between two RNAs as well as loops. Each hybrid region consists of subsequent hybrid base pairs between two RNAs. Two bases r_i ∈ R₁ and r_j ∈ R₂ of the hybrid base pair (r_i, r_j) are represented by ' [' and '] ', respectively.

Example. Let R₁ = CGGUUUGAGGUCCG and R₂ = ACUACCGAAAAGUU be two RNA sequences. The SSIR of the two RNAs is shown as follows;

\begin{array}{c} 5' - CGGUUUGAGGUCCG - 3' & 5' - ACUACCGAAAAGUU - 3' \\ ((([[[.. [[[))) & (((]]] ..]]]))) \end{array}

In this example, each RNA has one stem. In the left hand RNA, one stem is found by the production of bonding between CGG and the reverse CCG (GCC). There are two hybrid regions between the sequences R₁ and R₂. The first one is produced by binding between UUU and the reverse AAA (AAA). The second one is generated by binding between GGU and the reverse ACC (CCA).

A new genetic algorithm for RNA-RNA interaction prediction

Genetic algorithm is an optimization method based on evolutionary biology that is widely used to solve search and optimization problems [20–22]. In this section, a new genetic algorithm, GRNAs, is presented to predict RNA-RNA interaction. In the following, initial population, fitness function, crossover and mutation operations are introduced.

Initial population

In the proposed algorithm, two RNA sequences R ' = r ' ₁r ' ₂ … r ' _n (|R ' | = n) and R " = r " ₁r " ₂ … r " _m (|R " | = m) are given as inputs. The two RNAs R ' and R " are converted to the sequence R = r₁r₂ … r_l as follows;

r_{i} = \{\begin{array}{c} r'_{i} & 1 \leq i \leq n, \\ r "_{i - n - 1} & n + 2 \leq i \leq l \\ N & i = n + 1, \end{array},

where N is an arbitrary character to distinguish between two sequences and l = m + n + 1.

A dot matrix $M_{l \times l}^{R}$ is made, where the axes in the dot matrix correspond to the two sequences R and reverse R,as follows;

M^{R} [i, j] = \{\begin{array}{c} 1 & if (r_{i}, r_{l - j + 1}) \in \{(A, U), (U, A), (C, G), (G, C), (U, G), (G, U)\}, \\ 0 & else, \end{array}

where r_i and r_l-j+1 (1 ≤ i, j ≤ l) are the i-th and l-j+1-th nucleotides in the sequence R = r₁r₂ … r_l, respectively. Each right-skewed consecutive value of 1’s which is parallel to the main diagonal in the dot matrix is selected as a sub-diagonal. Each sub-diagonal shows a possible stem in each RNA or hybrid region between two RNAs. Set D^R shows all sub-diagonals in the dot matrix as follows;

D^{R} = \{< i, j, t > \begin{array}{c} | & 1 \leq i \leq l & 1 \leq j \leq l & 1 \leq t \leq l - 1 \end{array}\},

where i and j indicate the start position of the row and the column of a sub-diagonal with t+1 consecutive 1’s, respectively. Hence, each <i,j,t> is a set of consecutive base pairs as follows;

< i, j, t > = \{(r_{i}, r_{l - j + 1}), \dots, (r_{i + t}, r_{l - j - t + 1})\},

According to the prior knowledge, we know that Watson-Crick base pairs occur more than Wobble in RNA structures. In this regard, we compute the percent of G-U pairs on our dataset approximated 14%. So, G-U pairs are removed from the sub-diagonals including more than 14% G-U pairs. For each d₁ = < i₁, j₁, t₁ > ∈ D^R and d₂ = < i₂, j₂, t₂ > ∈ D^R, d₁ ∝ d₂ is defined as;

d_{1} \propto d_{2} = \{(r_{i}, r_{j}) \in d_{1} | \exists (r_{k}, r_{g}) \in d_{2}, i = k \lor i = g \lor j = k \lor j = g\},

where d₁ ∝ d₂ represents all base pairs in d₁ overlapping with d₂.

For each individual P, a |D^R| -tuple is randomly made as;

I = < x_{1}, x_{2}, \dots, x_{|D^{R}|} >, \cdot x_{k} \in \{0, 1\},

where x_k (1 ≤ k ≤ |D^R|) indicates the k-th sub-diagonal in I. In other words, individual P contains those sub-diagonals that their related x in I is equal to 1. Here, x_k = 1 means sub-diagonal d_k ∈ P, while x_k = 0 points to d_k ∉ P. Then, the individual P is constructed as follows;

P = \{C_{k} | x_{k} = 1, C_{k} \neq ϕ\},

(1)

where the set C_k (modified k-th sub-diagonal) is obtained as follows;

C_{k} = \{(r_{i}, r_{j}) \in d_{k}\} - \begin{array}{c} \cup_{\begin{array}{c} _{1 \leq t \leq k - 1} \\ C_{t} \in P \end{array}} & d_{k} \propto C_{t} \end{array} .

Here, C_k ∈ P is a set of base pairs in d_k without any common base pairs in the previous sub-diagonals, d_t(1 ≤ t < k), of individual P. Finally, C_k is modified by removing the lonely base pairs from it as follows;

C_{k} = C_{k} - \{(r_{i}, r_{j}) \in C_{k} | (r_{i + 1}, r_{j - 1}) \notin C_{k} & (r_{i - 1}, r_{j + 1}) \notin C_{k}\} .

Notice that a set of the produced individuals creates an initial population.

Fitness function

For each individual P, let S and H represent two RNAs secondary structures and binding sites between the two RNAs, respectively. Therefore, the fitness function is defined as follows;

Fitness (P) = MFE (S) + MFE (H),

where for C ∈ {S, H}, MFE(C) denotes the minimum free energy of structure C. We apply RNAeval.exe [7] to compute minimum free energies of secondary structures and binding sites separately.

Crossover

Crossover operation is performed between the individuals with the rate of 0.9. The good and mediocre individuals are transferred to the next population. The remaining individuals are consecutively selected for crossover operation.

Let P₁ and P₂ are selected as parents. For each individual P_i, 1 ≤ i ≤ 2, a |D^R| -tuple is defined as follows:

I_{i} = < x_{i 1}, x_{i 2}, \dots, x_{i | D^{R} |} >, x_{ij} = \{\begin{array}{c} 1 & if C_{j} \in P_{i}, \\ 0 & else . \end{array}

In this procedure, a random position k, 1 ≤ k ≤ |D^R|, is selected and I₁ and I₂ are crossed. Then, I ' ₁ and I ' ₂ are formed as follows:

\begin{array}{l} I'_{1} = < x_{11}, x_{12}, \dots, x_{1 k}, x_{2 (k + 1)}, \dots, x_{2 | D^{R} |} >, \\ I'_{2} = < x_{21}, x_{22}, \dots, x_{2 k}, x_{1 (k + 1)}, \dots, x_{1 | D^{R} |} > . \end{array}

In the following, two new individuals P ' ₁ and P ' ₂ are generated from I ' ₁ and I ' ₂ similar to the described method in the initial population (refers to the Equation 1), respectively.

Mutation

Mutation operation is done with the rate of 0.1 on a few randomly selected individuals in each generation. Assume that P is an individual selected for mutation. For the individual P, a |D^R| -tuple is obtained by:

I = < x_{1}, x_{2}, \dots, x_{|D^{R}|} >, x_{i} = \{\begin{array}{c} 1 & if C_{i} \in P, \\ 0 & else . \end{array}

An item x_j, where x_j = 1 is randomly selected and replaced to 0. Then, another x_k (x_k= 0) is replaced to 1. The new individual, P, is obtained from the changed I based on the proposed method in initial population (refers to the Equation 1). Finally, if C_k ∉ P (all the base pairs of d_k have overlapping with the existence sub-diagonals in P), the other x_i (x_i = 0) is selected to replace with 1. This process continues until C_k ∈ P or the defined number of generations is reached. When mutation is performed on a number of individuals, they will be increasingly sorted based on their fitness values.

Termination of the GRNAs algorithm

The GRNAs algorithm terminates when the best individual in definite generations will not be changed or the defined number of generations be reached. After the termination of the algorithm, one of the best individuals is selected as the best folding of two RNAs and the best interaction between the two RNAs.

Time and space complexity

We have obtained the time and space complexity of GRNAs in each iteration. Making the dot matrix needs the complexity of O(l²) where l exposes sum of the length of the two RNAs. Let h and |P| be the number of individuals and the length of an individual P. The time complexity of creating the initial population is O(h. |P| ²). We set h = 40 and |P| = max{|R ' |, |R " |}, so h can be ignored. Sorting individuals based on their fitness values requires O(|P|. h. log h). Crossover and mutation operations take O(|P| ²) and O(|P|), respectively. Thus, the time complexity in each iteration in the proposed algorithm is O(h. |P|(|P| + log h)). The maximum number of iteration is at most I = 20. Therefore, the time complexity of the algorithm is O(I. h. |P|(|P| + log h) + l²) that is simplified with O(l² + |P|).

On the other hand, for storing the h individuals of length |P| we need O(h. |P|) space complexity. Furthermore, the population in the algorithm uses both dot matrix and an array of sub-diagonals. Hence, the storage complexity of these two types is O(l²), where l denotes sum of the length of the two RNAs. Thus the total space complexity of GRNAs is O(h|P| + l²) which is simplified with O(l² + |P|).

Results and discussion

The GRNAs has been performed on a machine with two-Core Intel(R) Duo processor T6670 2.20 GHz and 4 GB RAM to predict the interaction structure between two RNAs. The proposed genetic algorithm is performed on two well-known datasets of RNA-RNA interactions. The first set contains: R1inv-R2inv, Tar-Tar*, DIS-DIS, CopA-CopT and IncRNA₅₄-RepZ in the Escherichia coli bacteria [12]. The joint secondary structures of this dataset include kissing hairpins. We evaluate the performance of joint secondary structure prediction of this dataset.

Also, this algorithm is carried on the second set of datasets with their binding sites including some RNA pairs called: DsrA-Rpos, GcvB-argT, GcvB-dppA, GcvB-gltI, GcvB-livK, GcvB-livJ, GcvB-oppA, GcvB-STM4351, IstR-tisAB, MicA-ompA, MicA-lamB, MicC-ompC, MicF-ompF, OxyS-fhlA, RyhB-sdhD, RyhB-sodB, SgrS-ptsG and Spot42-galK [14]. This dataset is used to appraise the performance of RNA-RNA interaction prediction in binding sites.

To evaluate the prediction accuracy of the GRNAs, F-measure (F) and Matthews Correlation Coefficient (MCC) [18] are calculated using sensitivity (Sn) and positive predictive value (PPV). Assume that the number of correctly predicted base pairs, the number of false predicted base pairs and the number of unpredicted base pairs are indicated by TP, FP and FN, respectively. So, Sn, PPV, F, and MCC are defined as follows:

Sn = TP / (TP + FN), PPV = TP / (TP + FP),

F = (2 \times Sn \times PPV) / (Sn + PPV),

MCC = \sqrt{Sn \times PPV .}

Table 1 shows the results of joint secondary structure prediction of our algorithm, GRNAs, in Matthews Correlation Coefficient [18]. It is also compared to the state-of-the-art methods such as PETcofold [18], the sparsified version of inteRNA [19], Pairfold [3], RactIP[17] and RNAcofold [7]. The MCC evaluates the joint structure, i.e. both the binding sites between the two RNAs and the secondary structure of each single RNA. In two pairs MicA-ompA and OxyS-fhlA, PETcofold has the best MCC value and in other two pairs RyhB-uof-fur and RyhB-sodB, GRNAs has the highest MCC value.

Table 1 The results of joint secondary structure prediction of GRNAs in MCC in comparison to the PETcofold and other joint structure prediction methods such as RNAcofold, inteRNA, Pairfold and RactIP

Full size table

We also compared GRNAs with four state-of-the-art methods: inRNAs, IntaRNA, RNAup and RactIP. Table 2 shows the results of prediction in binding sites in sensitivity and positive predictive values on the datasets [12, 14] using the proposed approach and mentioned methods. Here, only external base pairs are considered to measure accuracy. According to the second half of Table 2, the average positive predictive value on datasets [12, 14] is 96.03%. This table shows that our method is comparable with the existing methods. Table 3 indicates the accuracy of GRNAs in F-measure with considering binding sites. In GRNAs, the average F-measure is 89%. The results shown in Tables 2 and 3 indicate that GRNAs works as efficient as the other methods in average sensitivity, positive predictive value and F-measure.

Table 2 The results of binding sites prediction of GRNAs in sensitivity and positive predictive value on the datasets[12, 14] in comparison to inRNAs, IntaRNA, RNAup and RactIP

Full size table

Table 3 The results of binding sites prediction of GRNAs in F-measure on the datasets[12, 14] in comparison to inRNAs, IntaRNA, RNAup and RactIP

Full size table

Our genetic algorithm randomly selects the sub-diagonals to make individuals. Therefore different individuals with variety sub-diagonals are constructed. Due to the nature of proposed genetic approach, some of the RNA-RNA interactions can be predicted more accurate than the other algorithms. For example the accuracy rate of Tar-Tar* is obtained 100%, while maximum accuracy of the other approaches is 90.9%.

We compare the computational time complexity of GRNAs and state-of-the-are methods. The time and space complexity of several algorithms (TIRNA [15], App (approximation algorithm to predict SSIR) [14], ripalign [23], and other methods in the Tables 1, 2 and 3) are given in Table 4. As it is shown, the time complexity of GRNAs in each iteration is O(l² + |P|), where l and |P| indicate sum of the length of the two RNAs and the length of an individual, respectively. Also, Space complexity of the proposed method is O(l² + |P|).

Table 4 Comparison of time and space complexity of some algorithms

Full size table

Conclusion

In this paper, a new genetic algorithm was introduced for solving RNA-RNA interaction prediction problem. In this algorithm, all possible stems in each RNA and hybrid regions between two RNAs are extracted from a dot matrix showing all possible base pairs. Initial population is formed based on some stems and hybrid regions of the dot matrix. Minimum free energy is considered as a fitness function. Crossover operation is done between some consecutive individuals in the population. Mutation is taken on a few randomly selected individuals. Population generation continues until the minimum free energy of the best individual becomes minimal enough. Finally, one of the best individuals is selected to form RNA-RNA interaction structure. The proposed algorithm was tested on several RNA-RNA interaction datasets. The experimental results indicate a high accuracy of GRNAs. Furthermore, time and space complexity of GRNAs is as efficient as the other related studies.

Availability

The program of GRNAs is available at http://mostafa.ut.ac.ir/grnas.

References

Mneimneh S: On the approximation of optimal structures for RNA-RNA interaction. Trans Comput Biol Bioinform. 2009, 6: 682-688.
Article CAS Google Scholar
Alkan C, Karakoc E, Nadeau JH, Sahinalp C, Zhang K: RNA-RNA interaction prediction and antisense RNA target search. J Comput Biol. 2006, 13: 267-282.
Article CAS PubMed Google Scholar
Andronescu M, Zhang ZC, Condon A: Secondary structure prediction of interacting RNA molecules. J Mol Biol. 2005, 345: 987-1001.
Article CAS PubMed Google Scholar
Dirks R, Bios J, Schaeffer JM, Winfree E, Pierce N: Thermodynamic Analysis of Interacting Nucleic Acid Strands. Soc Ind Appl Math. 2007, 49: 65-88.
Google Scholar
Rehmsmeier M, Steffen P, Hochsmann M, Giegerich R: Fast and effective prediction of microRNA/target duplexes. RNA. 2004, 10: 1507-1517.
Article CAS PubMed PubMed Central Google Scholar
Markham NR, Zuker M: UNAFold: Software for Nucleic Acid Folding and Hybridization. Methods Mol Biol. 2008, 453: 3-31.
Article CAS PubMed Google Scholar
Bernhart S, Tafer H, Mückstein U, Flamm C, Peter F, Stadler P, Hofacker I: Partition Function and Base Pairing Probabilities of RNA Heterodimers. Algorithms Mol Biol. 2006, 1: 3-
Article PubMed PubMed Central Google Scholar
Mückstein U, Tafer H, Bernhart S, Hernandez-Rosales M, Vogel J, Stadler P, Hofacker I: Translational control by RNA-RNA interaction: Improved computation of RNA-RNA binding thermodynamics. Bioinform Res Dev. 2009, 13: 114-127.
Article Google Scholar
Mückstein U, Tafer H, Hackermuller J, Bernhart SH, Stadler PF, Hofacker IL: Thermodynamics of RNA-RNA binding. Bioinformatics. 2006, 22: 177-1182.
Article Google Scholar
Tafer H, Hofacker I: RNAplex: a fast tool for RNA-RNA interaction search. Bioinformatics. 2008, 24: 2657-2663.
Article CAS PubMed Google Scholar
Tafer H, Amman F, Eggenhofer F, Stadler PF, Hofacker IL: Fast accessibility-based prediction of RNA-RNA interactions. Bioinformatics. 2011, 27 (14): 1934-1940.
Article CAS PubMed Google Scholar
Kato Y, Akutsu T, Seki H: A grammatical approach to RNA-RNA interaction prediction. Pattern Recogn. 2009, 42: 531-538. 10.1016/j.patcog.2008.08.004.
Article Google Scholar
Salari R, Backofen R, Sahinalp SC: Fast prediction of RNA-RNA interaction. Algorithms Mol Biol. 2010, 5: 5-
Article PubMed PubMed Central Google Scholar
Busch A, Richter AS, Backofen R: IntaRNA: efficient prediction of bacterial sRNA targets incorporating target site accessibility and seed regions. Bioinformatics. 2008, 24: 2849-2856.
Article CAS PubMed PubMed Central Google Scholar
Montaseri S, Moghadam-Charkari N, Zare-Mirakabad F: A heuristic approach to RNA-RNA interaction prediction. J Theor Biol. 2012, 300: 206-211.
Article CAS PubMed Google Scholar
Huang FWD, Qin J, Reidys CM, Stadler PF: Target prediction and a statistical sampling algorithm for RNA-RNA interaction. Bioinformatics. 2010, 26: 175-181.
Article CAS PubMed PubMed Central Google Scholar
Kato Y, Sato K, Hamada M, Watanabe Y, Asai K, Akutsu T: RactIP: fast and accurate prediction of RNA-RNA interaction using integer programming. Bioinformatics. 2010, 26: i460-i466.
Article CAS PubMed PubMed Central Google Scholar
Seemann SE, Richter AS, Gesell T, Backofen R, Gorodkin J: PETcofold: predicting conserved interactions and structures of two multiple alignments of RNA sequences. Bioinformatics. 2011, 2: 211-219.
Article Google Scholar
Salari R, Mohl M, Will S, Sahinalp SC, Backofen R: Time and space efficient RNA-RNA interaction prediction via sparse folding. 2010, RECOMB'10, Research in Computational Molecular Biology
Chapter Google Scholar
Goldberg DE: Genetic Algorithms in Search, Optimization and Machine learning, Reading. 1989, MA: Addison-Wesley
Google Scholar
Holland HJ: Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology. 1992, MA: Control and Artificial Intelligence. MIT Press. Cambridge
Google Scholar
Miettinen K, Neittaanmaki P, Periaux J: Evolutionary Programming. Genetic Programming and Industrial Applications. Evolutionary Algorithms in Engineering and Computer Science: Recent Advances in Genetic Algorithms, Evolution Strategies. 1999, New York: Wiley
Google Scholar
Li AX, Marz M, Qin J, Reidys CM: RNA–RNA interaction prediction based on multiple sequence alignments. Bioinformatics. 2011, 4: 456-463.
Article Google Scholar

Download references

Acknowledgment

This research was in part supported by a grant from IPM. Also, we would like to thank Professor Mohammad Ganjtabesh for improving the proposed algorithm of the paper.

Author information

Authors and Affiliations

Department of Mathematics, Statistics and Computer Sciences, University of Tehran, Tehran, Iran
Soheila Montaseri
Faculty of Mathematics & Computer Science, Amirkabir University of Technology, Tehran, Iran
Fatemeh Zare-Mirakabad
School of Biological Science, Institute for Research in Fundamental Sciences (IPM), P.O. Box: 19395- 5746, Tehran, Iran
Fatemeh Zare-Mirakabad
Faculty of Electrical & Computer Engineering, Tarbiat Modares University, Tehran, Iran
Nasrollah Moghadam-Charkari

Authors

Soheila Montaseri
View author publications
You can also search for this author in PubMed Google Scholar
Fatemeh Zare-Mirakabad
View author publications
You can also search for this author in PubMed Google Scholar
Nasrollah Moghadam-Charkari
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Fatemeh Zare-Mirakabad.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

SM participated in the design of the algorithm, performed the experiments and drafted the manuscript. FZM contributed in the design of the algorithm, performed the experiments, drafted the manuscript and supervised the project. NMC contributed in the design of the algorithm and drafted the manuscript. All authors contributed to the writing of the manuscript. All authors read and approved the final manuscript.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Montaseri, S., Zare-Mirakabad, F. & Moghadam-Charkari, N. RNA-RNA interaction prediction using genetic algorithm. Algorithms Mol Biol 9, 17 (2014). https://doi.org/10.1186/1748-7188-9-17

Download citation

Received: 19 January 2013
Accepted: 18 June 2014
Published: 29 June 2014
DOI: https://doi.org/10.1186/1748-7188-9-17

RNA-RNA interaction prediction using genetic algorithm

Abstract

Background

Results

Conclusions

Background

Definitions and notations

A new genetic algorithm for RNA-RNA interaction prediction

Initial population

Fitness function

Crossover

Mutation

Termination of the GRNAs algorithm

Time and space complexity

Results and discussion

Conclusion

Availability

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ contributions

Authors’ original submitted files for images

Authors’ original file for figure 1

Rights and permissions

About this article

Cite this article

Keywords

Algorithms for Molecular Biology

Contact us

RNA-RNA interaction prediction using genetic algorithm

Abstract

Background

Results

Conclusions

Background

Definitions and notations

A new genetic algorithm for RNA-RNA interaction prediction

Initial population

Fitness function

Crossover

Mutation

Termination of the GRNAs algorithm

Time and space complexity

Results and discussion

Conclusion

Availability

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ contributions

Authors’ original submitted files for images

Authors’ original file for figure 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Algorithms for Molecular Biology

Contact us