Generic placeholder image

Current Bioinformatics

Editor-in-Chief

ISSN (Print): 1574-8936
ISSN (Online): 2212-392X

Research Article

SCV Filter: A Hybrid Deep Learning Model for SARS-CoV-2 Variants Classification

Author(s): Han Wang and Jingyang Gao*

Volume 19, Issue 5, 2024

Published on: 16 November, 2023

Page: [434 - 445] Pages: 12

DOI: 10.2174/1574893618666230809121509

Price: $65

Abstract

Background: The high mutability of severe acute respiratory syndrome coronavirus 2(SARS-CoV-2) makes it easy for mutations to occur during transmission. As the epidemic continues to develop, several mutated strains have been produced. Researchers worldwide are working on the effective identification of SARS-CoV-2.

Objective: In this paper, we propose a new deep learning method that can effectively identify SARSCoV- 2 Variant sequences, called SCVfilter, which is a deep hybrid model with embedding, attention residual network, and long short-term memory as components.

Methods: Deep learning is effective in extracting rich features from sequence data, which has significant implications for the study of Coronavirus Disease 2019 (COVID-19), which has become prevalent in recent years. In this paper, we propose a new deep learning method that can effectively identify SARS-CoV-2 Variant sequences, called SCVfilter, which is a deep hybrid model with embedding, attention residual network, and long short-term memory as components.

Results: The accuracy of the SCVfilter is 93.833% on Dataset-I consisting of different variant strains; 90.367% on Dataset-II consisting of data collected from China, Taiwan, and Hong Kong; and 79.701% on Dataset-III consisting of data collected from six continents (Africa, Asia, Europe, North America, Oceania, and South America).

Conclusion: When using the SCV filter to process lengthy and high-homology SARS-CoV-2 data, it can automatically select features and accurately detect different variant strains of SARS-CoV-2. In addition, the SCV filter is sufficiently robust to handle the problems caused by sample imbalance and sequence incompleteness.

Other: The SCVfilter is an open-source method available at https://github.com/deconvolutionw/ SCVfilter.

[1]
Centers for Disease Control and Prevention. SARS-CoV-2 variant classifications and definitions. https://www.cdc.gov/coronavirus/2019-ncov/cases-updates/variant-surveillance/variant-info.html2021
[2]
Tao K, Tzou PL, Nouhin J, et al. The biological and clinical significance of emerging SARS-CoV-2 variants. Nat Rev Genet 2021; 22(12): 757-73.
[http://dx.doi.org/10.1038/s41576-021-00408-x] [PMID: 34535792]
[3]
Singh OP, Vallejo M, El-Badawy IM, Aysha A, Madhanagopal J, Mohd Faudzi AA. Classification of SARS-CoV-2 and non-SARS-CoV-2 using machine learning algorithms. Comput Biol Med 2021; 136: 104650.
[http://dx.doi.org/10.1016/j.compbiomed.2021.104650] [PMID: 34329865]
[4]
Arslan H. COVID-19 prediction based on genome similarity of human SARS-CoV-2 and bat SARS-CoV-like coronavirus. Comput Ind Eng 2021; 161: 107666.
[http://dx.doi.org/10.1016/j.cie.2021.107666] [PMID: 34511707]
[5]
Lopez-Rincon A, Tonda A, Mendoza-Maldonado L, et al. Classification and specific primer design for accurate detection of SARS-CoV-2 using deep learning. Scientific Reports 2020; 11: 947.
[http://dx.doi.org/10.1101/2020.03.13.990242]
[6]
Lopez-Rincon A, Tonda A, Mendoza-Maldonado L, et al. Classification and specific primer design for accurate detection of SARS-CoV-2 using deep learning. Sci Rep 2021; 11(1): 947.
[http://dx.doi.org/10.1038/s41598-020-80363-5] [PMID: 33441822]
[7]
Whata A, Chimedza C. Deep Learning for SARS COV-2 Genome Sequences. IEEE Access 2021; 9: 59597-611.
[http://dx.doi.org/10.1109/ACCESS.2021.3073728] [PMID: 34812391]
[8]
Liu J. SARS-Cov-2 RNA sequence classification based on territory information. arXiv:210103323 2021.
[9]
Akkaya UM, Kalkan H. Classification of DNA sequences with kmers based vector representations. 2021 Innovations in Intelligent Systems and Applications Conference (ASYU). 06-08 October 2021; Elazig, Turkey. 2021.
[http://dx.doi.org/10.1109/ASYU52992.2021.9599084]
[10]
Tasdelen A, Sen B. A hybrid CNN-LSTM model for pre-miRNA classification. Sci Rep 2021; 11(1): 14125.
[http://dx.doi.org/10.1038/s41598-021-93656-0] [PMID: 34239004]
[11]
Soliman N, Abdelhaleem S, El-Shafai W, et al. Hybrid approach for taxonomic classification based on deep learning. Intelligent Automation and Soft Computing 2021; 32: 1881-91.
[http://dx.doi.org/10.32604/iasc.2022.017683]
[12]
Iuchi H, Matsutani T, Yamada K, et al. Representation learning applications in biological sequence analysis. Comput Struct Biotechnol J 2021; 19: 3198-208.
[http://dx.doi.org/10.1016/j.csbj.2021.05.039] [PMID: 34141139]
[13]
He K, Zhang X, Ren S, Sun J. Deep residual learning for image recognition. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 27-30 June 2016; Las Vegas, NV, USA. 2016.
[http://dx.doi.org/10.1109/CVPR.2016.90]
[14]
Hochreiter S, Schmidhuber J. Long short-term memory. Neural Comput 1997; 9(8): 1735-80.
[http://dx.doi.org/10.1162/neco.1997.9.8.1735] [PMID: 9377276]
[15]
Woo S, Park J, Lee JY. Cbam: Convolutional block attention module. Proceedings of the European conference on computer vision (ECCV). 3-19.
[16]
Vaswani A, Shazeer N, Parmar N, et al. Attention is all you need. arXiv:170603762 2017.

Rights & Permissions Print Cite
© 2024 Bentham Science Publishers | Privacy Policy