Selected article for: "alignment method and dna sequence"

Author: Gomes, Juliana Carneiro; Masood, Aras Ismael; de S. Silva, Leandro Honorato; Ferreira, Janderson; Júnior, Agostinho A. F.; dos Santos Rocha, Allana Lais; Castro, Letícia; da Silva, Nathália R. C.; Fernandes, Bruno J. T.; dos Santos, Wellington Pinheiro
Title: Optimizing the molecular diagnosis of Covid-19 by combining RT-PCR and a pseudo-convolutional machine learning approach to characterize virus DNA sequences
  • Cord-id: st6s7l0b
  • Document date: 2020_9_28
  • ID: st6s7l0b
    Snippet: The proliferation of the SARS-Cov-2 virus to the whole world caused more than 250,000 deaths worldwide and over 4 million confirmed cases. The severity of Covid-19, the exponential rate at which the virus proliferates, and the rapid exhaustion of the public health resources are critical factors. The RT-PCR with virus DNA identification is still the benchmark Covid-19 diagnosis method. In this work we propose a new technique for representing DNA sequences: they are divided into smaller sequences
    Document: The proliferation of the SARS-Cov-2 virus to the whole world caused more than 250,000 deaths worldwide and over 4 million confirmed cases. The severity of Covid-19, the exponential rate at which the virus proliferates, and the rapid exhaustion of the public health resources are critical factors. The RT-PCR with virus DNA identification is still the benchmark Covid-19 diagnosis method. In this work we propose a new technique for representing DNA sequences: they are divided into smaller sequences with overlap in a pseudo-convolutional approach, and represented by co-occurrence matrices. This technique analyzes the DNA sequences obtained by the RT-PCR method, eliminating sequence alignment. Through the proposed method, it is possible to identify virus sequences from a large database: 347,363 virus DNA sequences from 24 virus families and SARS-Cov-2. Experiments with all 24 virus families and SARS-Cov-2 (multi-class scenario) resulted 0.822222 ± 0.05613 for sensitivity and 0.99974 ± 0.00001 for specificity using Random Forests with 100 trees and 30% overlap. When we compared SARS-Cov-2 with similar-symptoms virus families, we got 0.97059 ± 0.03387 for sensitivity, and 0.99187 ± 0.00046 for specificity with MLP classifier and 30% overlap. In the real test scenario, in which SARS-Cov-2 is compared to Coronaviridae and healthy human DNA sequences, we got 0.98824 ± 001198 for sensitivity and 0.99860 ± 0.00020 for specificity with MLP and 50% overlap. Therefore, the molecular diagnosis of Covid-19 can be optimized by combining RT-PCR and our pseudo-convolutional method to identify SARS-Cov-2 DNA sequences faster with higher specificity and sensitivity.

    Search related documents:
    Co phrase search for related documents
    • accuracy show and low sensitivity: 1, 2, 3
    • accuracy show and low sensitivity rate: 1
    • accuracy show and machine learning: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25
    • accuracy show and machine learning model: 1, 2
    • accuracy variance and machine learning: 1, 2
    • activation function and acute stage: 1
    • activation function and machine learning: 1, 2, 3, 4, 5, 6
    • activation function and machine learning model: 1
    • actual positive and low sensitivity: 1
    • actual positive and low sensitivity rate: 1
    • actual positive and lower sensitivity: 1
    • actual positive and machine learning: 1, 2, 3
    • acute stage and low sensitivity: 1
    • acute stage and lower sensitivity: 1
    • acute stage and machine learning: 1, 2
    • long exist and low sensitivity: 1
    • long exist and machine learning: 1
    • low sensitivity and lower sensitivity: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24
    • low sensitivity and machine learning: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21