Abstract
It is of great use to find out and clear up the interactions between enzymes and small molecules, for understanding the molecular and cellular functions of organisms. In this study, we developed a novel method for the prediction of enzyme-small molecules interactions based on machine learning approach. The biochemical and physicochemical description of proteins and the functional group composition of small molecules are used for representing enzyme-small molecules pairs. Tested by jackknife cross-validation, our predictor achieved an overall accuracy of 87.47%, showing an acceptable efficiency. The 39 features selected by feature selection were analyzed for further understanding of enzyme-small molecule interactions.
Keywords: Biochemical and physicochemical description of proteins, Chemical functional group, Feature selection, Forward Feature Selection, Incremental Feature Selection, Minimum Redundancy Maximum Relevance, jackknife cross-validation, NNA, KEGG, mRMR, FFS, hydrophobicity, FRET, RTK signaling, NMR