Investigating Power and Limitations of Ensemble Motif Finders Using Metapredictor CE3

Mauro      Leoncini; Manuela      Montangero; Karina      Panucia Tillán

doi:10.2174/157489361002150518122428

Abstract

Ensemble methods represent a relatively new approach to motif discovery that combines the results returned by "third-party" finders with the aim of achieving a better accuracy than that obtained by the single tools. Besides the choice of the external finders, another crucial element for the success of an ensemble method is the particular strategy adopted to combine the finders' results, a.k.a. learning function.

Results appeared in the literature seem to suggest that ensemble methods can provide noticeable improvements over the quality of the most popular tools available for motif discovery.

With the goal of better understanding potentials and limitations of ensemble methods, we developed a general software architecture whose major feature is the flexibility with respect to the crucial aspects of ensemble methods mentioned above. The architecture provides facilities for the easy addition of virtually any third-party tool for motif discovery whose code is publicly available, and for the definition of new learning functions. We present a prototype implementation of our architecture, called CE³ (Customizable and Easily Extensible Ensemble).

Using CE³, and available ensemble methods, we performed experiments with three well-known datasets. The results presented here are varied. On the one hand, they confirm that ensemble methods cannot be just considered as the universal remedy for "in-silico" motif discovery. On the other hand, we found some encouraging regularities that may help to find a general set up for CE³ (and other ensemble methods as well) able to guarantee substantial improvements over single finders in a systematic way.

Keywords: DNA binding, ensemble methods, motif discovery, software tool, transcription factor, XML.

« Previous Next »

Graphical Abstract

Rights & Permissions Print Cite

Article Metrics

25

4

Journal Information

For Authors

For Editors

For Reviewers

Explore Articles

Open Access

Open Access Articles

For Visitors

DOI https://dx.doi.org/10.2174/157489361002150518122428	Print ISSN 1574-8936
Publisher Name Bentham Science Publisher	Online ISSN 2212-392X

Current Bioinformatics

Investigating Power and Limitations of Ensemble Motif Finders Using Metapredictor CE³

Abstract

Graphical Abstract

Current Bioinformatics

Investigating Power and Limitations of Ensemble Motif Finders Using Metapredictor CE3

Abstract Play Pause

Graphical Abstract

Related Journals

Related Books

Investigating Power and Limitations of Ensemble Motif Finders Using Metapredictor CE³

Abstract