Abstract
Background: Structural alignment of ribose nucleic acid (RNA) is one of the most challenging multi-objective real world problems from the field of bioinformatics.
Objective: RNA molecules are less stable; hence they require inclusion of most stable secondary structure during their alignment. Therefore, the structural alignment requires the consideration of similarity score and structure score, as two objectives. Trade-off between these two objectives exists since obtaining optimum similarity score at concurrent optimum structure score is not possible. This paper presents artificial bee colony algorithm based three level multi-objective approach for performing structural alignment of RNA sequences, namely MO-3LABC.
Methods: Algorithm firstly builds the secondary structure of all sequences at minimum free energy (MFE). Then sequence alignment is performed in level one at average percent sequence identity (APSI) score based gap length, optimized by ABC algorithm. Level two now builds the secondary structure of these aligned sequences based on base-pair probability and co-variation. Now the scores of level one and level two move towards level three for multi-objective optimization at Pareto optimality criteria with few additional strategies.
Results: The results of MO-3LABC are compared with an already established efficient strategy MO-TLPSO; multi-objective two level strategy based on particle swarm optimization. The outputs are compared for pairwise and multiple sequence alignment datasets at prediction accuracy and solution quality criteria.
Conclusion: MO-3LABC is found significantly better than MO-TLPSO at all the four evaluation criteria for both the datasets.
Keywords: RNA structural alignment, artificial bee colony algorithm, non-dominated solutions, multiple sequence alignment, multi-objective optimization, pareto optimal solutions, conflicting objectives.
Graphical Abstract
[http://dx.doi.org/10.1093/bioinformatics/btq439] [PMID: 20679330]
[http://dx.doi.org/10.1093/bioinformatics/btt188] [PMID: 23620356]
[http://dx.doi.org/10.1186/1471-2105-6-73] [PMID: 15790387]
[http://dx.doi.org/10.1186/1471-2105-8-271] [PMID: 17662141]
[http://dx.doi.org/10.1186/1471-2105-8-130] [PMID: 17445273]
[http://dx.doi.org/10.1371/journal.pcbi.0030193] [PMID: 17937495]
[http://dx.doi.org/10.1093/bioinformatics/btm146] [PMID: 17459961]
[http://dx.doi.org/10.1371/journal.pcbi.0030065] [PMID: 17432929]
[http://dx.doi.org/10.1093/bioinformatics/btm525] [PMID: 18006551]
[http://dx.doi.org/10.1186/1471-2105-9-521] [PMID: 19061486]
[http://dx.doi.org/10.1093/bioinformatics/bti577] [PMID: 16020472]
[http://dx.doi.org/10.1093/bioinformatics/btm272] [PMID: 17537756]
[http://dx.doi.org/10.1093/bioinformatics/btn177] [PMID: 18586747]
[http://dx.doi.org/10.1093/bioinformatics/btp580] [PMID: 19808876]
[http://dx.doi.org/10.1371/journal.pcbi.0030149] [PMID: 17696604]
[http://dx.doi.org/10.1109/ICNN.1995.488968]
[http://dx.doi.org/10.1016/j.swevo.2017.02.002]
[http://dx.doi.org/10.1016/j.swevo.2015.09.008]
[http://dx.doi.org/10.1073/pnas.0401799101] [PMID: 15123812]
[http://dx.doi.org/10.1093/bioinformatics/bti550] [PMID: 15972285]
[http://dx.doi.org/10.1186/1748-7188-1-19] [PMID: 17062125]
[http://dx.doi.org/10.1016/j.asoc.2007.05.007]
[http://dx.doi.org/10.1007/s10462-012-9328-0]
[http://dx.doi.org/10.1109/4235.996017]
[http://dx.doi.org/10.1093/nar/gki541] [PMID: 15860779]
[http://dx.doi.org/10.1093/nar/22.22.4673] [PMID: 7984417]
[http://dx.doi.org/10.1093/nar/gkf436] [PMID: 12136088]
[http://dx.doi.org/10.1093/bioinformatics/btl142] [PMID: 16613908]