Abstract
Aim: The study aimed to reconstruct the protein-protein interaction network for the identification of essential proteins.
Background: In a living organism, essential proteins play an indispensable role in its survival and development. Hence, how to identify essential proteins from the protein interaction network (PIN) has become a hot topic in the field of bioinformatics. However, existing methods’ accuracies for identifying essential proteins are still limited due to the false positives of the protein-protein interaction data.
Objective: The objective of the study was to propose an efficient algorithm for the reconstruction of a protein-protein interaction network.
Methods: In this paper, a method for the refinement of PIN based on three kinds of biological data (subcellular localization data, protein complex data, and gene expression data) is proposed. Through evaluating each interaction within the original PIN, a refined clean PIN could be obtained. To verify the effectiveness of the refined PIN for the identification of essential proteins, we applied eight networkbased essential protein discovery methods (DC, BC, CC, LC, HC, SC, LAC, and NC) to it.
Results: Based on the obtained experimental results, we demonstrated that the precision for identifying essential proteins could be greatly improved by refining the original PIN using our method.
Conclusion: Our method could effectively enhance the protein-protein interaction network and improve the accuracy of identifying essential proteins. In the future, we plan to integrate more biological information to enhance our refinement method and apply it to more species and more PIN-based discovery tasks, like the identification of protein complexes or functional modules.
[http://dx.doi.org/10.1038/msb.2009.89] [PMID: 19953084]
[http://dx.doi.org/10.1093/nar/gkn858]
[http://dx.doi.org/10.1038/nchembio.2007.24] [PMID: 17710100]
[http://dx.doi.org/10.1109/BIBM.2018.8621551]
[http://dx.doi.org/10.1186/1471-2164-7-165] [PMID: 16817963]
[http://dx.doi.org/10.1016/j.compbiolchem.2014.01.011] [PMID: 24569026]
[http://dx.doi.org/10.1186/s40246-020-00263-7] [PMID: 32252824]
[http://dx.doi.org/10.1007/978-3-030-51862-2_4]
[http://dx.doi.org/10.1038/nature00935] [PMID: 12140549]
[http://dx.doi.org/10.1111/j.1440-1711.2005.01332.x] [PMID: 15877598]
[http://dx.doi.org/10.1046/j.1365-2958.2003.03697.x] [PMID: 14507372]
[http://dx.doi.org/10.1109/ICASSP40776.2020.9052965]
[http://dx.doi.org/10.1016/j.compbiolchem.2020.107324] [PMID: 32623358]
[http://dx.doi.org/10.1109/TCBB.2017.2665482] [PMID: 28186903]
[http://dx.doi.org/10.1109/ACCESS.2020.2964571]
[http://dx.doi.org/10.1038/35075138] [PMID: 11333967]
[http://dx.doi.org/10.1093/molbev/msi072] [PMID: 15616139]
[http://dx.doi.org/10.1371/journal.pcbi.1000140] [PMID: 18670624]
[http://dx.doi.org/10.1142/S0219720013410023] [PMID: 23796179]
[http://dx.doi.org/10.1109/TNB.2014.2337912] [PMID: 25122840]
[http://dx.doi.org/10.1016/j.tig.2007.04.005] [PMID: 17512629]
[http://dx.doi.org/10.1021/pr8008786] [PMID: 19231892]
[http://dx.doi.org/10.1186/1471-2105-11-505] [PMID: 20939873]
[http://dx.doi.org/10.1126/science.1158684] [PMID: 18719252]
[http://dx.doi.org/10.1371/journal.pcbi.1000817] [PMID: 20585543]
[http://dx.doi.org/10.1155/JBB.2005.96] [PMID: 16046814]
[http://dx.doi.org/10.1016/S0022-5193(03)00071-7] [PMID: 12782116]
[http://dx.doi.org/10.1103/PhysRevE.71.056103] [PMID: 16089598]
[http://dx.doi.org/10.1103/PhysRevLett.87.278701] [PMID: 11800921]
[http://dx.doi.org/10.1080/15427951.2013.865686]
[http://dx.doi.org/10.1016/j.compbiolchem.2011.04.002] [PMID: 21704260]
[http://dx.doi.org/10.1109/TCBB.2011.147] [PMID: 22084147]
[http://dx.doi.org/10.1186/s12859-016-1115-5] [PMID: 27586883]
[http://dx.doi.org/10.1002/pmic.201200277] [PMID: 23225755]
[http://dx.doi.org/10.1093/nar/30.1.303] [PMID: 11752321]
[http://dx.doi.org/10.1093/nar/gkj148] [PMID: 16381839]
[http://dx.doi.org/10.1093/nar/26.1.73] [PMID: 9399804]
[http://dx.doi.org/10.1093/nar/gkr974] [PMID: 22075990]
[http://dx.doi.org/10.1093/nar/gkr1029] [PMID: 22110037]
[http://dx.doi.org/10.1093/nar/gkr1030] [PMID: 22127867]
[http://dx.doi.org/10.1093/nar/gkp952] [PMID: 19910365]
[http://dx.doi.org/10.1371/journal.pone.0131418] [PMID: 26125187]
[http://dx.doi.org/10.1093/nar/30.1.207] [PMID: 11752295] [PMCID: PMC99122]
[http://dx.doi.org/10.1371/journal.pcbi.0010066] [PMID: 16322766]
[http://dx.doi.org/10.1371/journal.pone.0130743] [PMID: 26115027]
[http://dx.doi.org/10.1186/s12918-018-0573-y] [PMID: 29745838]
[http://dx.doi.org/10.1186/1752-0509-6-15] [PMID: 22405054]
[http://dx.doi.org/10.1093/molbev/msh004] [PMID: 14595100]
[http://dx.doi.org/10.1109/INFOCOM.2018.8486345]