Author: Valeria Lulla; Andrew E. Firth
Title: A hidden gene in astroviruses encodes a cell-permeabilizing protein involved in virus release Document date: 2019_6_6
ID: avq3zwmc_36
Snippet: Mammalian astrovirus nucleotide sequences were downloaded from the National Center for Biotechnology Information (NCBI) on 26 July 2018. Patent sequence records and sequences with ≥20 ambiguous nucleotide codes (e.g. "N"s) were removed. For the full-genome analyses, only sequences covering all or nearly all of ORF1a, ORF1b and ORF2 were retained, giving 221 sequences (listed in Fig. S1 ). For the ORF2 analyses, only sequences covering all or ne.....
Document: Mammalian astrovirus nucleotide sequences were downloaded from the National Center for Biotechnology Information (NCBI) on 26 July 2018. Patent sequence records and sequences with ≥20 ambiguous nucleotide codes (e.g. "N"s) were removed. For the full-genome analyses, only sequences covering all or nearly all of ORF1a, ORF1b and ORF2 were retained, giving 221 sequences (listed in Fig. S1 ). For the ORF2 analyses, only sequences covering all or nearly all of ORF2 were retained, giving 415 sequences (listed in Supplementary Dataset 1) . To identify the correct 5′ end of ORF1b, we identified the AAAAAAC frameshift site. To identify the correct initiation site of ORF2, we identified the highly conserved sgRNA promoter nucleotides 29 and selected the next ORF2-frame AUG codon as the ORF2 start site in representative reference sequences; for the other sequences, the ORF2 start site was identified by amino acid alignment to one of the reference sequences. ORF1b and ORF2 sequences were extracted, translated to amino acid sequences, aligned with MUSCLE 30 The copyright holder for this preprint (which was not peer-reviewed) is the . https://doi.org/10.1101/661579 doi: bioRxiv preprint acid rate matrices, with 1,000,000 (221-sequence trees; Fig. S1 and Fig. S2 ) or 5,000,000 (415sequence tree; Fig. S4 ) generations, discarding the first 25% as burn-in (other parameters were left at defaults). Trees were visualized with FigTree (http://tree.bio.ed.ac.uk/software/figtree/).
Search related documents:
Co phrase search for related documents- amino acid alignment and nucleotide sequence: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
- amino acid alignment and reference sequence: 1, 2
- amino acid alignment and start site: 1
- amino acid and astrovirus nucleotide sequence: 1
- amino acid and AUG codon: 1, 2, 3, 4, 5, 6, 7, 8
- amino acid and genome analysis: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75
- amino acid and initiation site: 1, 2, 3, 4
- amino acid and nucleotide code: 1
- amino acid and nucleotide sequence: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77
- amino acid and ORF1b end: 1, 2
- amino acid and patent sequence: 1
- amino acid and rate matrix: 1
- amino acid and reference sequence: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52
- amino acid and representative reference sequence: 1
- amino acid and start site: 1, 2, 3, 4, 5, 6, 7, 8, 9
- astrovirus nucleotide sequence and genome analysis: 1
- astrovirus nucleotide sequence and nucleotide sequence: 1, 2, 3
- astrovirus nucleotide sequence and ORF1b end: 1
- AUG codon and genome analysis: 1
Co phrase search for related documents, hyperlinks ordered by date