Selected article for: "NCBI protein database and protein database"

Author: Mathias Kuhring; Joerg Doellinger; Andreas Nitsche; Thilo Muth; Bernhard Y. Renard
Title: An iterative and automated computational pipeline for untargeted strain-level identification using MS/MS spectra from pathogenic samples
  • Document date: 2019_10_24
  • ID: k7hm3aow_16
    Snippet: For the viral samples we used all viral NCBI RefSeq proteins (314156 entries, via Entrez on July 10, 2017) in the iterative workflow and additionally all viral non-RefSeq proteins (1987275 entries in total, via Entrez on July 11, 2017) for the unique-PSMs-and Pipasic-based strategies. Viral spectra were filtered beforehand using corresponding host proteomes (all isoforms) including the UniProt Homo sapiens reference proteome (93588 entries, UP000.....
    Document: For the viral samples we used all viral NCBI RefSeq proteins (314156 entries, via Entrez on July 10, 2017) in the iterative workflow and additionally all viral non-RefSeq proteins (1987275 entries in total, via Entrez on July 11, 2017) for the unique-PSMs-and Pipasic-based strategies. Viral spectra were filtered beforehand using corresponding host proteomes (all isoforms) including the UniProt Homo sapiens reference proteome (93588 entries, UP000005640, May 23, 2017) for the cowpox sample and Gallus gallus Red jungle fowl reference proteome (29732 entries, UP000000539, May 16, 2017) for the bronchitis sample. For the iterative examination of the bacillus samples, all bacterial NCBI RefSeq proteins (65996005 entries, release 82) were downloaded via FTP (ftp.ncbi.nlm.nih.gov/refseq/release/bacteria/). Since downloading all NCBI Protein entries for bacteria (taxid 2) is impractical and requires too much time using the Entrez API, we used the NCBI Blast NR database as most comprehensive, common and readily available protein resource. This database can be obtained in fasta format via FTP (ftp.ncbi.nlm.nih.gov/blast/db/FASTA/, release 19.07.2017) and the bacteria subset (93760438 entries) was extracted with in-house scripts to create the databases for the unique-PSMs-and Pipasic-based strategies.

    Search related documents:
    Co phrase search for related documents
    • bacillus sample and cowpox sample: 1
    • bacillus sample and NCBI Blast NR database: 1
    • bronchitis sample and cowpox sample: 1, 2
    • Entrez api and iterative workflow: 1
    • Entrez api and NCBI Blast NR database: 1