Abstract
Background: There are numerous difficulties in using deep learning to automatically locate and identify diseases in chest X-rays (CXR). The most prevailing two are the lack of labeled data of disease locations and poor model transferability between different datasets. This study aims to tackle these problems.
Methods: We built a new form of bounding box dataset and developed a two-stage model for disease localization and identification of CXRs based on deep learning. The dataset marks anomalous regions in CXRs but not the corresponding diseases, different from all previous datasets. The advantages of this design are reduced labor of annotation and fewer possible errors associated with image labeling. The two-stage model combines the robustness of the region proposal network, feature pyramid network, and multi-instance learning techniques. We trained and validated our model with the new bounding box dataset and the CheXpert dataset. Then, we tested its classification and localization performance on an external dataset, which is the official split test set of ChestX-ray14.
Results: For classification result, the mean area under the receiver operating characteristic curve (AUC) metrics of our model on the CheXpert validation dataset was 0.912, which was 0.021, superior to the baseline model. The mean AUC of our model on an external testing set was 0.784, whereas the state-of-the-art model got 0.773. The localization results showed comparable performance to the stateof- the-art models.
Conclusion: Our model exhibits a good transferability between datasets. The new bounding box dataset is proven to be useful and shows an alternative technique for compiling disease localization datasets.
Keywords: Chest X-ray, Disease localization, CNN, Deep learning, Region proposal, Computer aided diagnosis.
Graphical Abstract
[http://dx.doi.org/10.3348/kjr.2019.0821] [PMID: 32323497]
[http://dx.doi.org/10.1111/1754-9485.13273] [PMID: 34231311]
[http://dx.doi.org/10.1109/CVPR.2017.369]
[http://dx.doi.org/10.1609/aaai.v33i01.3301590]
[http://dx.doi.org/10.1038/s41597-019-0322-0] [PMID: 31831740]
[http://dx.doi.org/10.1145/3233547.3233573]
[http://dx.doi.org/10.1109/CVPR.2018.00865]
[http://dx.doi.org/10.1007/978-3-030-00919-9_29]
[http://dx.doi.org/10.1001/jamanetworkopen.2019.1095] [PMID: 30901052]
[http://dx.doi.org/10.1148/radiol.2018180237] [PMID: 30251934]
[http://dx.doi.org/10.1109/ICCMC51019.2021.9418343]
[http://dx.doi.org/10.1109/CVPR.2016.319]
[http://dx.doi.org/10.1109/ICCV.2017.74]
[http://dx.doi.org/10.1093/jamia/ocv080] [PMID: 26133894]
[http://dx.doi.org/10.1007/978-3-030-62469-9_7]
[http://dx.doi.org/10.1109/CVPR.2017.243]
[http://dx.doi.org/10.1109/TPAMI.2016.2577031] [PMID: 27295650]
[http://dx.doi.org/10.1109/CVPR.2017.106]
[http://dx.doi.org/10.1109/ICCV.2015.169]
[http://dx.doi.org/10.1007/BF0119316]
[http://dx.doi.org/10.1145/1102351.1102439]
[http://dx.doi.org/10.1109/CVPR.2018.00943]