Sequence of DPV Sabia virus

Sabia virus glycoprotein precursor and nucleocapsid protein genes, complete cds.

ACC No: U41071

Dated: 2000-03-04 | Length: 3366 | CRC: -146544977

                !!NA_SEQUENCE 1.0
ID   SVU41071   standard; genomic RNA; VRL; 3366 BP.
XX
AC   U41071;
XX
SV   U41071.1
XX
DT   10-AUG-1996 (Rel. 48, Created)
DT   04-MAR-2000 (Rel. 63, Last updated, Version 2)
XX
DE   Sabia virus glycoprotein precursor and nucleocapsid protein genes, complete
DE   cds.
XX
KW   .
XX
OS   Sabia virus
OC   Viruses; ssRNA negative-strand viruses; Arenaviridae; Arenavirus;
OC   New world arenaviruses.
XX
RN   [1]
RP   1-3366
RX   MEDLINE; 96295431.
RX   PUBMED; 8661442.
RA   Gonzalez J.P., Bowen M.D., Nichol S.T., Rico-Hesse R.;
RT   "Genetic characterization and phylogeny of Sabia virus, an emergent
RT   pathogen in Brazil";
RL   Virology 221(2):318-324(1996).
XX
RN   [2]
RP   1-3366
RA   Gonzalez J.P.J., Bowen M.D., Nichol S.T., Rico-Hesse R.;
RT   ;
RL   Submitted (21-NOV-1995) to the EMBL/GenBank/DDBJ databases.
RL   Michael D. Bowen, Special Pathogens Branch, DVRD/NCID, Centers for Disease
RL   Control and Prevention, 1600 Clifton Rd., N.E., Atlanta, GA 30333, USA
XX
DR   GOA; Q90038.
DR   SPTREMBL; Q90037; Q90037.
DR   SPTREMBL; Q90038; Q90038.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .3366
FT                   /db_xref="taxon:45709"
FT                   /mol_type="genomic RNA"
FT                   /organism="Sabia virus"
FT                   /strain="SPH114202"
FT   CDS             59. .1525
FT                   /codon_start=1
FT                   /db_xref="SPTREMBL:Q90037"
FT                   /note="cleaved to yield G1 and G2 glycoproteins"
FT                   /product="glycoprotein precursor"
FT                   /protein_id="AAC55091.1"
FT                   /translation="MGQLFSFFEEVPNIIHEAINIALIAVSLIAALKGMINLWKSGLFQ
FT                   LIFFLTLAGRSCSFRIGRSTELQNITFDMLKVFEDHPTSCMVNHSTYYVHENKNATWCL
FT                   EVSVTDVTLLMAEHDRQVLNNLSNCVHPAVEHRSRMVGLLEWIFRALKYDFNHDPTPLC
FT                   QKQTSTVNETRVQINITEGFGSHGFEDTILQRLGVLFGSRIAFSNIQDLGKKRFLLIRN
FT                   STWKNQCEMNHVNSMHLMLANAGRSSGSRRPLGIFSWTITDAVGNDMPGGYCLERWMLV
FT                   TSDLKCFGNTALAKCNLDHDSEFCDMLKLFEFNKKAIETLNDNTKNKVNLLTHSINALI
FT                   SDNLLMKNRLKELLNTPYCNYTKFWYVNHTASGEHSLPRCWLVRNNSYLNESEFRNDWI
FT                   IESDHLLSEMLNKEYIDRQGKTPLTLVDICFWSTLFFTTTLFLHLVGFPTHRHIRGEPC
FT                   PLPHRLNSRGGCRCGKYPELKKPITWHKNH"
FT   misc_feature    1528. .1542
FT                   /note="intergenic region hairpin structure 1"
FT   misc_feature    1545. .1573
FT                   /note="intergenic region hairpin structure 2"
FT   misc_feature    1576. .1613
FT                   /note="intergenic region hairpin structure 3"
FT   CDS             complement(1620. .3308)
FT                   /codon_start=1
FT                   /db_xref="GOA:Q90038"
FT                   /db_xref="SPTREMBL:Q90038"
FT                   /product="nucleocapsid protein"
FT                   /protein_id="AAC55092.1"
FT                   /translation="MSNSKEIPSFRWTQSLRRGLSEFTTPVKTDVLRDAKMILDGLDFN
FT                   QVSLVQRILRKSKRNDGDLDKLRDLNKEVDNLMSMKSSQRDTILKLGDLNKSELMDLAS
FT                   DLEKLKRKVGQTERSASGGVYLGNLSQSQLTKRSDLLRKLGFQQQQVRSPGVVRIWDVA
FT                   DPNRLNNQFGSVPALTIACMTKQSDNTMGDVVQALTSLGLLYTVKFPNLIDLEKLTAEH
FT                   DCLQIVTKDESGLNISGYNYSLSAAVKAGATLLDGGNMLETIRITPDNFSQIIKTTLSI
FT                   KKKEGMFVDEKPGNRNPYENLLYKICLSGEGWPYIGSRSQIKGRSWENTTVDLSTKPQQ
FT                   GPRTPEKAGQNIRLSHLTELQESVVREAMGKIDPTLTTWIDIEGTSNDPVELALYQPDT
FT                   GNYILCYRKPHDEKGFKNGSRHSHGMLLKDLESAQPGLLSYVIGLLPQNMVLTTQGSDD
FT                   IRRLVDTHGRKDLKIVDIKLASEQARKFEEPIWSDFGHLCKKHNGVIVPKKKKDKDIPQ
FT                   SSEPHCALLDCLMFQSAIAGQPPQTKLEGLLPDALLFTLEAAFTI"
XX
SQ   Sequence 3366 BP; 907 A; 713 C; 727 G; 1019 T; 0 other;

U41071  Length: 3366  September 9, 2003 10:41  Type: N  Check: 8625  ..

       1  gcgcaccggg gatcctaggc gttttttagt cacgcttaaa tctttgattg
      51  cgtcaatcat gggtcaattg ttcagctttt ttgaagaagt tccgaatatc
     101  atccatgagg ctatcaacat agctctgata gcagtgagct taattgctgc
     151  cttgaaaggg atgattaact tgtggaagag tggccttttc caattgatat
     201  tctttttgac tctagcagga agatcgtgtt cttttagaat tggaaggagc
     251  acagaattgc aaaacataac gtttgatatg ttgaaggtat tcgaggacca
     301  ccccacatct tgcatggtga atcattccac ctactatgtc catgaaaaca
     351  aaaatgccac ttggtgtctt gaggtgtctg tgactgatgt taccctgctc
     401  atggctgaac atgatcgtca agtcctcaac aatctgtcaa actgtgtgca
     451  ccctgcagtt gagcacagaa gcaggatggt tggcttactt gagtggattt
     501  ttagagccct aaagtatgac ttcaatcatg atccaacacc gttgtgtcaa
     551  aagcaaactt caacagtgaa tgaaacacgt gtgcagataa acatcactga
     601  ggggtttggg tctcacgggt ttgaagatac catccttcaa agactagggg
     651  ttctattcgg ttcaagaatt gcattttcaa atatccagga tttaggtaaa
     701  aaaaggtttt tattgattag aaattcaact tggaaaaatc aatgcgaaat
     751  gaatcatgta aactccatgc atttaatgtt ggcgaatgct ggtcgctctt
     801  ctggttctag aagaccactc ggcattttct cctggacaat aactgatgca
     851  gtgggcaatg acatgcctgg tggttattgt cttgaaagat ggatgctagt
     901  gacgtcagat cttaagtgct ttggaaacac agcactagca aaatgtaacc
     951  ttgaccacga ttcggaattc tgtgacatgt tgaaattgtt tgagttcaac
    1001  aaaaaagcga tagagacatt gaatgacaat acaaaaaaca aggtaaactt
    1051  gctgacccac tcaattaatg cattaatatc tgacaactta ctgatgaaga
    1101  atcgacttaa agaattgttg aacactcctt attgtaatta caccaaattt
    1151  tggtatgtca atcacacagc atcaggggaa cactcattgc cacggtgctg
    1201  gcttgttaga aataatagct acttgaatga aagtgaattt agaaatgatt
    1251  ggattattga gagtgatcat ttattgtctg aaatgctcaa taaagaatac
    1301  atagatagac agggaaagac accgttgact ttggtggata tctgtttctg
    1351  gagcactttg tttttcacaa caacactgtt tcttcacctg gtaggctttc
    1401  caactcatag acacatacgt ggtgaaccct gcccactacc ccataggctc
    1451  aacagtagag gaggatgtag atgtgggaaa taccctgaac taaaaaagcc
    1501  aatcacctgg cacaagaacc actagaagga caatcattgt ctcccaccga
    1551  cccgggcaat gcccgggtcg gtgtggcccc ccagtccgcg gcaaatgccg
    1601  cggactgggg agcaccaatc tagatggtga atgctgcctc cagtgtgaag
    1651  agcaatgcat caggcaataa accttccagt ttggtttgag gtggttggcc
    1701  tgctatggct gactgaaaca ttagacaatc aagtagggca cagtgtggct
    1751  ctgaggactg tgggatgtct ttgtctttct ttttctttgg cacaataact
    1801  ccattgtgtt tcttacagag gtgaccaaaa tctgaccaga ttggctcctc
    1851  aaactttctc gcctgttcag atgccaattt aatgtcgaca atctttaagt
    1901  ctttgcgacc gtgtgtatct actaagcgcc ttatatcatc tgaaccttgg
    1951  gtggtgagga ccatgttttg aggaaggagc cctataacat agctgagcaa
    2001  gcctggctgt gcagattcta ggtcctttag caacatccca tgtgaatgcc
    2051  tgctaccatt tttgaacccc ttctcatcat gtggtttcct ataacagagg
    2101  atataattac ctgtgtctgg ttggtacaat gctaattcaa ccggatcatt
    2151  actggtaccc tcaatgtcaa tccatgttgt cagagttggg tcaatcttac
    2201  ccattgcctc tctcacaact gactcttgca actcagtcaa gtgggagagt
    2251  ctaatgttct gacctgcctt ttctggtgtt ctcggccctt gttggggctt
    2301  tgtgcttaaa tcaacagtgg tgttttccca tgacctaccc ttgatctggg
    2351  atctggagcc aatgtaaggc caaccttctc ctgaaaggca gattttgtac
    2401  agaaggtttt cataagggtt tctatttcca ggtttctcat ctacaaacat
    2451  gccttccttt ttctttatgg atagggttgt ctttatgatc tgagaaaagt
    2501  tgtcaggagt gatccttatg gtttccagca tgttaccacc atccagaagc
    2551  gttgcaccag ctttaacagc tgcagaaaga ctatagttat atcctgagat
    2601  gttcaagccg ctctcatctt tagtcactat ttgaagacag tcatgttctg
    2651  ctgtaagttt ttctaggtca atcaggttgg ggaacttaac tgtataaaga
    2701  agtcccaaag atgttagtgc ctgaacaaca tcccccatgg tattgtcact
    2751  ttgtttagtc atacaagcga ttgtcagtgc agggacagat ccaaattgat
    2801  tattcagcct gttcggatca gctacgtccc aaatccttac aacccctgga
    2851  gacctcactt gctgctgttg aaaaccaagt ttccttaaaa gatcagacct
    2901  tttggtgagc tgtgattggg aaaggtttcc caggtacaca cctcctgagg
    2951  ctgatctttc tgtttgtcca acttttcttt tcagtttctc caggtctgat
    3001  gcaagatcca tcagttcaga tttgttgaga tcaccaagtt ttaagattgt
    3051  gtctctttgg gaactcttca tgctcatcag gttgtccact tctttattta
    3101  ggtctctcag tttatcaaga tctccatcat tccttttaga ctttctaagg
    3151  atcctttgaa caagagagac ttgattgaaa tcaagaccat caagtatcat
    3201  tttggcatcc ctcagaacat cggtcttcac cggtgttgtg aactcactga
    3251  gccctcttct cagggattga gtccatctga agctggggat ttcctttgag
    3301  ttgctcattg tggaaaagac ttgctgacca gtaaaggtag acaatttgcc
    3351  taggatccac tgtgcg