Abstract
Background: The acquisition and exchange of meaningful, integrated, and accurate information are at the forefront of the combat against COVID-19; still, there are many countries whose health systems are disrupted. Moreover, no one is adequately equipped for COVID-19 contingencies. Many organizations have established static information systems to manage the information. This fact presents numerous issues, including delays, inconsistencies, and inaccuracies in COVID-19 information collected for pandemic control and monitoring.
Objective: This paper presents a semantic representation of COVID-19 data, a domain ontology to facilitate measurement, clarification, linking, and sharing. We automatically generate a computer- intelligible knowledge base from COVID-19 case information, which contains machineunderstandable information. Furthermore, we have anticipated an ontology population algorithm from tabular data that delivers interoperable, consistent, and accurate content with COVID-19 information.
Methods: We utilized the tabula package to extract the tables from PDF files and user NLP libraries to sort and rearrange tables. The proposed algorithm was then applied to all instances to automatically add to the input ontology using the Owlready Python module. Moreover, to evaluate the performance, SPARQL queries were used to retrieve answers to competency questions.
Results: When there is an equivalence relationship, the suggested algorithm consistently finds the right alignments and performs at its best or very close to it in terms of precision. Moreover, a demonstration of algorithm performance and a case study on COVID-19 data to information management and visualization of the populated data are also presented.
Conclusion: This paper presents an ontology learning/matching tool for ontology and populating instances automatically to ontology by emphasizing the importance of a unit's distinguishing features by unit matching.
Graphical Abstract
[http://dx.doi.org/10.1186/s13326-021-00245-1] [PMID: 34275487]
[http://dx.doi.org/10.1007/s00354-021-00136-0] [PMID: 34667368]
[http://dx.doi.org/10.1145/371920.372105]
[http://dx.doi.org/10.1177/0165551519827892]
[http://dx.doi.org/10.2174/1872212113666190211141415]
[http://dx.doi.org/10.2174/2213275910801030162]
[http://dx.doi.org/10.1145/1142473.1142595]
[http://dx.doi.org/10.1109/ACCESS.2020.2973928]
[http://dx.doi.org/10.1007/s10694-019-00891-z]
[http://dx.doi.org/10.1145/1031171.1031289]
[http://dx.doi.org/10.1007/11891451_22]
[http://dx.doi.org/10.1006/knac.1993.1008]
[http://dx.doi.org/10.1016/j.websem.2005.10.001]
[http://dx.doi.org/10.5121/ijwest.2010.1301]
[http://dx.doi.org/10.1007/s00354-021-00129-z] [PMID: 34305259]
[http://dx.doi.org/10.1186/s13638-017-0993-1] [PMID: 29263717]
[http://dx.doi.org/10.1007/978-3-642-16248-0_55]
[http://dx.doi.org/10.1371/journal.pone.0179488] [PMID: 28644863]
[http://dx.doi.org/10.4018/IJKSS.2020070102]
[http://dx.doi.org/10.1115/DETC2009-86544]
[http://dx.doi.org/10.1016/j.knosys.2012.06.002]
[http://dx.doi.org/10.1136/amiajnl-2011-000163] [PMID: 21508414]
[http://dx.doi.org/10.1109/ALPIT.2007.30]
[http://dx.doi.org/10.1007/11926078_21]
[http://dx.doi.org/10.1007/3-540-45816-6_32]
[http://dx.doi.org/10.1017/S0269888900007797]
[http://dx.doi.org/10.1007/978-3-319-09846-3]
[http://dx.doi.org/10.5121/ijwest.2017.8401]
[http://dx.doi.org/10.1016/j.artmed.2017.07.002] [PMID: 28818520]
[http://dx.doi.org/10.1016/j.dss.2014.01.001]
[http://dx.doi.org/10.1145/508791.509008]