Author: Xiong, Jie; Dittmer, D. P.; Marron, J. S.
Title: "Virus hunting"using radial distance weighted discrimination Cord-id: gpnhzlad Document date: 2016_2_9
ID: gpnhzlad
Snippet: Motivated by the challenge of using DNA-seq data to identify viruses in human blood samples, we propose a novel classification algorithm called"Radial Distance Weighted Discrimination"(or Radial DWD). This classifier is designed for binary classification, assuming one class is surrounded by the other class in very diverse radial directions, which is seen to be typical for our virus detection data. This separation of the 2 classes in multiple radial directions naturally motivates the development
Document: Motivated by the challenge of using DNA-seq data to identify viruses in human blood samples, we propose a novel classification algorithm called"Radial Distance Weighted Discrimination"(or Radial DWD). This classifier is designed for binary classification, assuming one class is surrounded by the other class in very diverse radial directions, which is seen to be typical for our virus detection data. This separation of the 2 classes in multiple radial directions naturally motivates the development of Radial DWD. While classical machine learning methods such as the Support Vector Machine and linear Distance Weighted Discrimination can sometimes give reasonable answers for a given data set, their generalizability is severely compromised because of the linear separating boundary. Radial DWD addresses this challenge by using a more appropriate (in this particular case) spherical separating boundary. Simulations show that for appropriate radial contexts, this gives much better generalizability than linear methods, and also much better than conventional kernel based (nonlinear) Support Vector Machines, because the latter methods essentially use much of the information in the data for determining the shape of the separating boundary. The effectiveness of Radial DWD is demonstrated for real virus detection.
Search related documents:
Co phrase search for related documents- accurately rapidly and low positive: 1, 2
- accurately rapidly and machine learning: 1, 2, 3
- logistic regression and low dimension: 1, 2
- logistic regression and low false positive: 1
- logistic regression and low positive: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22
- logistic regression and machine learning: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25
- low dimension and machine learning: 1
- low positive and machine learning: 1, 2, 3, 4, 5, 6, 7
Co phrase search for related documents, hyperlinks ordered by date