Selected article for: "distance measure and propose distance measure"

Author: Balasubramanian, Karthi; Nagaraj, Nithin
Title: Automatic Identification of SARS Coronavirus using Compression-Complexity Measures
  • Cord-id: ljli6a2z
  • Document date: 2020_3_27
  • ID: ljli6a2z
    Snippet: Finding vaccine or specific antiviral treatment for global pandemic of virus diseases (such as the ongoing COVID-19) requires rapid analysis, annotation and evaluation of metagenomic libraries to enable a quick and efficient screening of nucleotide sequences. Traditional sequence alignment methods are not suitable and there is a need for fast alignment-free techniques for sequence analysis. Information theory and data compression algorithms provide a rich set of mathematical and computational to
    Document: Finding vaccine or specific antiviral treatment for global pandemic of virus diseases (such as the ongoing COVID-19) requires rapid analysis, annotation and evaluation of metagenomic libraries to enable a quick and efficient screening of nucleotide sequences. Traditional sequence alignment methods are not suitable and there is a need for fast alignment-free techniques for sequence analysis. Information theory and data compression algorithms provide a rich set of mathematical and computational tools to capture essential patterns in biological sequences. In 2013, our research group (Nagaraj et al., Eur. Phys. J. Special Topics 222(3-4), 2013) has proposed a novel measure known as Effort-To-Compress (ETC) based on the notion of compression-complexity to capture the information content of sequences. In this study, we propose a compression-complexity based distance measure for automatic identification of SARS coronavirus strains from a set of viruses using only short fragments of nucleotide sequences. We also demonstrate that our proposed method can correctly distinguish SARS-CoV-2 from SARS-CoV-1 viruses by analyzing very short segments of nucleotide sequences. This work could be extended further to enable medical practitioners in automatically identifying and characterizing SARS coronavirus strain in a fast and efficient fashion using short and/or incomplete segments of nucleotide sequences. Potentially, the need for sequence assembly can be circumvented. Note The main ideas and results of this research were first presented at the International Conference on Nonlinear Systems and Dynamics (CNSD-2013) held at Indian Institute of Technology, Indore, December 12, 2013. In this manuscript, we have extended our preliminary analysis to include SARS-CoV-2 virus as well.

    Search related documents:
    Co phrase search for related documents
    • academic research and acute respiratory syndrome: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13
    • acid sequence and acute respiratory: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25
    • acid sequence and acute respiratory syndrome: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25
    • acid sequence and lz complexity: 1, 2
    • acid sequence and lz complexity measure: 1
    • acid sequence and lz lempel ziv complexity: 1
    • acute respiratory and lossless compression: 1
    • acute respiratory syndrome and lossless compression: 1
    • lossless compression and lz complexity: 1, 2
    • lossless compression and lz complexity measure: 1
    • lossless compression and lz lempel ziv complexity: 1
    • lz complexity and lz lempel ziv complexity: 1, 2, 3, 4, 5, 6