Author: Alejandro A Schäffer; Eneida Hatcher; Linda Yankie; Lara Shonkwiler; J Rodney Brister; Ilene Karsch-Mizrachi; Eric P Nawrocki
Title: VADR: validation and annotation of virus sequence submissions to GenBank Document date: 2019_11_22
ID: besvz92f_39
Snippet: To test the joint hypothesis that the VADR operational semantics are more cautious and more rigorous than what was done before its development (pre-2018), we collected a set of test sequences that were last modified no later than December 31, 2017. Specifically, we ran each of the following two Entrez Nucleotide queries on June 24 AND 50:20000[slen] Distinct "complete genome" and "partial" sequence datasets were created to distinguish behavior on.....
Document: To test the joint hypothesis that the VADR operational semantics are more cautious and more rigorous than what was done before its development (pre-2018), we collected a set of test sequences that were last modified no later than December 31, 2017. Specifically, we ran each of the following two Entrez Nucleotide queries on June 24 AND 50:20000[slen] Distinct "complete genome" and "partial" sequence datasets were created to distinguish behavior on likely complete genomes from behavior on likely partial sequences. To separate the likely complete genomes, we used the empirical cumulative distribution functions of how many sequences returned by the above queries have each length, x, and are (not) annotated as "complete genome". The selected thresholds were > 7380nt This article is a US Government work. It is not subject to copyright under 17 USC 105 and is also made available for use under a CC0 license.
Search related documents:
Co phrase search for related documents- CC0 license and Government work: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25
- complete genome and Government work: 1
- complete genome and likely complete genome: 1
- complete genome and partial genome: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23
- complete genome and partial sequence: 1, 2, 3, 4, 5, 6
- distribution function and Government work: 1
- likely complete genome and partial genome: 1
Co phrase search for related documents, hyperlinks ordered by date