Abstract
Drug discovery as well as (nano)material design projects demand the in silico analysis of large datasets of compounds with their corresponding properties/activities, as well as the retrieval and virtual screening of more structures in an effort to identify new potent hits. This is a demanding procedure for which various tools must be combined with different input and output formats. To automate the data analysis required we have developed the necessary tools to facilitate a variety of important tasks to construct workflows that will simplify the handling, processing and modeling of cheminformatics data and will provide time and cost efficient solutions, reproducible and easier to maintain. We therefore develop and present a toolbox of >25 processing modules, Enalos+ nodes, that provide very useful operations within KNIME platform for users interested in the nanoinformatics and cheminformatics analysis of chemical and biological data. With a user-friendly interface, Enalos+ Nodes provide a broad range of important functionalities including data mining and retrieval from large available databases and tools for robust and predictive model development and validation. Enalos+ Nodes are available through KNIME as add-ins and offer valuable tools for extracting useful information and analyzing experimental and virtual screening results in a chem- or nano- informatics framework. On top of that, in an effort to: (i) allow big data analysis through Enalos+ KNIME nodes, (ii) accelerate time demanding computations performed within Enalos+ KNIME nodes and (iii) propose new time and cost efficient nodes integrated within Enalos+ toolbox we have investigated and verified the advantage of GPU calculations within the Enalos+ nodes. Demonstration data sets, tutorial and educational videos allow the user to easily apprehend the functions of the nodes that can be applied for in silico analysis of data.
Keywords: Enalos+ KNIME nodes, chemoinformatics-aided material design, Nanoinformatics, Enalos Suite, Chemical data base, KINME, Efficient data mining, PubChem.
[http://dx.doi.org/10.1145/1656274.1656280]
[http://dx.doi.org/10.1093/nar/gkv951 ] [PMID: 26400175]
[http://dx.doi.org/10.1016/j.chemolab.2013.02.003]
[http://dx.doi.org/10.1186/1758-2946-5-3 ] [PMID: 23317286]
[http://dx.doi.org/10.1021/ci025584y ] [PMID: 12653513]
[http://dx.doi.org/10.1021/acs.jcim.6b00686 ] [PMID: 28125221]
[http://dx.doi.org/10.1186/1758-2946-4-7 ] [PMID: 22424447]
[http://dx.doi.org/10.2174/1568026615666150506144536 ] [PMID: 26002591]
[http://dx.doi.org/10.1021/ci800038f ] [PMID: 18564836]
[http://dx.doi.org/10.1186/s13321-017-0220-4] [PMID: 29086040]
[http://dx.doi.org/10.1186/s13321-014-0043-5] [PMID: 25221628]
[http://dx.doi.org/10.1002/minf.201000061 ] [PMID: 27463326]
[http://dx.doi.org/10.1080/00401706.1969.10490666]
[http://dx.doi.org/10.1002/qsar.19960150402]
[http://dx.doi.org/10.1002/qsar.200730083]
[http://dx.doi.org/10.1080/17435390.2018.1504998 ] [PMID: 30182778]
[http://dx.doi.org/10.1002/smll.201906588 ] [PMID: 32174008]
[http://dx.doi.org/10.2174/138620732301200316112000 ] [PMID: 32189589]
[http://dx.doi.org/10.1007/978-1-0716-0150-1_3]
[http://dx.doi.org/10.1007/978-1-4939-7899-1_14 ] [PMID: 29934899]
[http://dx.doi.org/10.1016/j.csbj.2020.02.023 ] [PMID: 32226594]