Abstract
Background: Understanding the subcellular location of proteins is essential for studying molecular and protein functions. Intracellular proteins must interact with appropriate molecules at the right time and in the right subcellular location to fulfill their functions. Therefore, the precise prediction of protein subcellular location can help elucidate complex cellular physiological response processes and is of great importance to research human diseases and pathophysiology.
Methods: Traditional approaches to protein subcellular pattern analysis are primarily based on feature concatenation and classifier design. However, highly complex structures and poor performance are prominent shortcomings of these traditional approaches. In this paper, we report the development of an end-to-end pixel-enlightened neural network (IDRnet) based on Interactive Pointwise Attention (IPA) for the prediction of protein subcellular locations using immunohistochemistry (IHC) images. Patch splitting was adopted to reduce interference caused by tissue microarrays, such as bubbles, edges, and blanks. The IPA unit was constructed with a Depthwise and Pointwise convolution (DP) unit, and a pointwise pixel-enlightened algorithm was applied to modify and enrich protein subcellular location information.
Results: IDRnet was able to achieve 97.33% accuracy in single-label IHC patch images and 88.59% subset accuracy in mixed-label IHC patch images, and outperformed other mainstream deep learning models. In addition, Gradient-weighted Class Activation Mapping (Grad-CAM) was adopted to visualize the spatial information of proteins in the feature map, which helped to explain and understand the IHC image's abstract features and concrete expression form.
Graphical Abstract
[http://dx.doi.org/10.1242/jcs.089110] [PMID: 22010196]
[http://dx.doi.org/10.1200/jco.2016.34.2_suppl.199]
[http://dx.doi.org/10.1126/science.1260419] [PMID: 25613900]
[http://dx.doi.org/10.1016/j.cell.2011.02.013] [PMID: 21376230]
[http://dx.doi.org/10.1093/bioinformatics/btaa656] [PMID: 32692832]
[http://dx.doi.org/10.1126/science.aal3321] [PMID: 28495876]
[http://dx.doi.org/10.1093/bioinformatics/btp002] [PMID: 19164304]
[http://dx.doi.org/10.1098/rsif.2011.0843] [PMID: 22552919]
[http://dx.doi.org/10.1093/bioinformatics/btw723] [PMID: 27993784]
[http://dx.doi.org/10.1093/bioinformatics/btx431] [PMID: 29036616]
[http://dx.doi.org/10.1021/pr7007626] [PMID: 18435555]
[http://dx.doi.org/10.1093/bib/bbaa313] [PMID: 33320936]
[PMID: 33989156]
[http://dx.doi.org/10.1093/bioinformatics/btab127] [PMID: 33630066]
[http://dx.doi.org/10.1007/s11704-016-6309-5]
[http://dx.doi.org/10.11999/JEIT190657]
[http://dx.doi.org/10.1093/bioinformatics/btt320] [PMID: 23740749]
[http://dx.doi.org/10.1016/j.neucom.2013.10.034]
[http://dx.doi.org/10.1093/bioinformatics/btv521] [PMID: 26363175]
[http://dx.doi.org/10.1186/s12859-019-3136-3] [PMID: 31655541]
[http://dx.doi.org/10.1109/TPAMI.2013.50] [PMID: 23787338]
[http://dx.doi.org/10.1109/MSP.2012.2205597]
[http://dx.doi.org/10.1186/s12859-020-03731-y] [PMID: 32907537]
[http://dx.doi.org/10.1109/TCBB.2019.2917429] [PMID: 31107658]
[http://dx.doi.org/10.1093/bib/bbab278] [PMID: 34337652]
[http://dx.doi.org/10.1007/s11704-017-6538-2]
[http://dx.doi.org/10.1145/3065386]
[http://dx.doi.org/10.1093/bioinformatics/btz909] [PMID: 31804670]
[http://dx.doi.org/10.1007/s12539-021-00452-5] [PMID: 34138403]
[http://dx.doi.org/10.1534/g3.116.033654] [PMID: 28391243]
[http://dx.doi.org/10.1093/nar/gky1049] [PMID: 30395287]
[http://dx.doi.org/10.1002/prot.26244] [PMID: 34546597]
[http://dx.doi.org/10.1093/bioinformatics/btz844] [PMID: 31722369]
[http://dx.doi.org/10.48550/arXiv.1704.04861]
[http://dx.doi.org/10.1609/aaai.v34i07.7016]
[http://dx.doi.org/10.48550/arXiv.1412.6980]
[http://dx.doi.org/10.1109/ICCV.2017.74]