Author: Katebi, Mohsen; Feijao, Pedro; Booth, Julius; Mansouri, Mehrdad; La, Sean; Sweeten, Alex; Miraskarshahi, Reza; Nguyen, Matthew; Wong, Johnathan; Hsiao, William; Chauve, Cedric; Chindelevitch, Leonid
Title: PathOGiST: A Novel Method for Clustering Pathogen Isolates by Combining Multiple Genotyping Signals Cord-id: 6f2s40pn Document date: 2020_2_1
ID: 6f2s40pn
Snippet: In this paper we study the problem of clustering bacterial isolates into epidemiologically related groups from next-generation sequencing data. Existing methods for this problem mainly use a single genotyping signal, and either use a distance-based method with a pre-specified number of clusters, or a phylogenetic tree-based method with a pre-specified threshold. We propose PathOGiST, an algorithmic framework for clustering bacterial isolates by leveraging multiple genotypic signals and calibrate
Document: In this paper we study the problem of clustering bacterial isolates into epidemiologically related groups from next-generation sequencing data. Existing methods for this problem mainly use a single genotyping signal, and either use a distance-based method with a pre-specified number of clusters, or a phylogenetic tree-based method with a pre-specified threshold. We propose PathOGiST, an algorithmic framework for clustering bacterial isolates by leveraging multiple genotypic signals and calibrated thresholds. PathOGiST uses different genotypic signals, clusters the isolates based on these individual signals with correlation clustering, and combines the clusterings based on the individual signals through consensus clustering. We implemented and tested PathOGiST on three different bacterial pathogens - Escherichia coli, Yersinia pseudotuberculosis, and Mycobacterium tuberculosis - and we conclude by discussing further avenues to explore.
Search related documents:
Co phrase search for related documents, hyperlinks ordered by date