Author: Arslan, Hilal; Arslan, Hasan
Title: A new COVID-19 detection method from human genome sequences using CpG island features and KNN classifier Cord-id: f2uaxzy2 Document date: 2021_1_9
ID: f2uaxzy2
Snippet: Various viral epidemics have been detected such as the severe acute respiratory syndrome coronavirus and the Middle East respiratory syndrome coronavirus in the last two decades. The coronavirus disease 2019 (COVID-19) is a pandemic caused by a novel betacoronavirus called severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2). After the rapid spread of COVID-19, many researchers have investigated diagnosis and treatment for this terrifying disease quickly. Identifying COVID-19 from the ot
Document: Various viral epidemics have been detected such as the severe acute respiratory syndrome coronavirus and the Middle East respiratory syndrome coronavirus in the last two decades. The coronavirus disease 2019 (COVID-19) is a pandemic caused by a novel betacoronavirus called severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2). After the rapid spread of COVID-19, many researchers have investigated diagnosis and treatment for this terrifying disease quickly. Identifying COVID-19 from the other types of coronaviruses is a difficult problem due to their genetic similarity. In this study, we propose a new efficient COVID-19 detection method based on the K-nearest neighbors (KNN) classifier using the complete genome sequences of human coronaviruses in the dataset recorded in 2019 Novel Coronavirus Resource. We also describe two features based on CpG island that efficiently detect COVID-19 cases. Thus, genome sequences including approximately 30,000 nucleotides can be represented by only two real numbers. The KNN method is a simple and effective non-parametric technique for solving classification problems. However, performance of the KNN depends on the distance measure used. We perform 19 distance metrics investigated in five categories to improve the performance of the KNN algorithm. Some efficient performance parameters are computed to evaluate the proposed method. The proposed method achieves 98.4% precision, 99.2% recall, 98.8% F-measure, and 98.4% accuracy in a few seconds when any L 1 type metric is used as a distance measure in the KNN.
Search related documents:
Co phrase search for related documents- accuracy achieve and acute respiratory syndrome coronavirus: 1, 2, 3, 4, 5
- accuracy achieve and logistic regression: 1, 2, 3, 4, 5, 6
- accuracy achieve and logistic regression random forest: 1
- accuracy class and acute respiratory syndrome: 1
- accuracy class and acute respiratory syndrome coronavirus: 1
- accuracy class and logistic regression: 1, 2, 3, 4
- accuracy class and logistic regression random forest: 1, 2
- accuracy outcome and acute respiratory syndrome: 1, 2, 3, 4, 5, 6
- accuracy outcome and acute respiratory syndrome coronavirus: 1, 2, 3, 4, 5, 6
- accuracy outcome and logistic regression: 1, 2, 3, 4, 5, 6, 7, 8, 9
- accuracy result and acute respiratory syndrome: 1, 2, 3, 4, 5
- accuracy result and acute respiratory syndrome coronavirus: 1, 2, 3, 4
- accuracy result and logistic regression: 1, 2, 3
- accuracy value and acute respiratory syndrome: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15
- accuracy value and acute respiratory syndrome coronavirus: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11
- accuracy value and logistic regression: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13
- acute respiratory syndrome and local mean: 1, 2, 3
- acute respiratory syndrome and local seafood market: 1, 2, 3, 4, 5
- acute respiratory syndrome and logistic regression: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25
Co phrase search for related documents, hyperlinks ordered by date