Sequence of DPV Morogoro virus

Morogoro virus strain 3017/2004 segment S glycoprotein precursor (GPC) and nucleoprotein (NP) genes, complete cds.

ACC No: EU914103

Dated: 2009-07-31 | Length: 3383 | CRC: 878947918

                
ID   EU914103; SV 1; linear; genomic RNA; STD; VRL; 3383 BP.
XX
AC   EU914103;
XX
DT   31-JUL-2009 (Rel. 101, Created)
DT   31-JUL-2009 (Rel. 101, Last updated, Version 1)
XX
DE   Morogoro virus strain 3017/2004 segment S glycoprotein precursor (GPC) and
DE   nucleoprotein (NP) genes, complete cds.
XX
KW   .
XX
OS   Morogoro virus
OC   Viruses; ssRNA negative-strand viruses; Arenaviridae; Arenavirus;
OC   Old world arenaviruses; unclassified Old world arenaviruses.
XX
RN   [1]
RP   1-3383
RA   Gunther S., Leirs H.;
RT   "A Mopeia-related arenavirus in Mastomys natalensis in Morogoro, Tanzania";
RL   Unpublished.
XX
RN   [2]
RP   1-3383
RA   Gunther S., Leirs H.;
RT   ;
RL   Submitted (20-JUL-2008) to the EMBL/GenBank/DDBJ databases.
RL   Virology, Bernhard-Nocht-Institute for Tropical Medicine,
RL   Bernhard-Nocht-Str. 74, Hamburg 20359, Germany
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .3383
FT                   /organism="Morogoro virus"
FT                   /segment="S"
FT                   /host="Mastomys natalensis"
FT                   /strain="3017/2004"
FT                   /mol_type="genomic RNA"
FT                   /country="Tanzania:Morogoro"
FT                   /collection_date="2004"
FT                   /db_xref="taxon:573900"
FT   gene            46. .1515
FT                   /gene="GPC"
FT   CDS             46. .1515
FT                   /codon_start=1
FT                   /gene="GPC"
FT                   /product="glycoprotein precursor"
FT                   /protein_id="ACJ24973.1"
FT                   /translation="MGQIVTFFQEVPHIIEEVMNIVLITLSLLAILKGIYNIMTCGIIG
FT                   LLTFLFLCGRSCSSIYKDNYQFLSLDLDMSGLNATMPLSCSKNNSHHYIQVRNDTGLEL
FT                   TLTNTSLLDHKFCNLSDAHKRNLYDKALMSIVTTFHLNIPNFNQYEVMSCDFNGGKITV
FT                   QYNLSHSSYVDAANHCGTIANGIMDTFRRMYWSNALSPSEYISGTTCIQTAYQYLIIQN
FT                   TTWEDHCVFSRPSPMGFLSLLSQRTKNFYISRRLLGLFTWTLSDSEGNDMPGGYCLTRS
FT                   MLIGMDLKCFGNTAVAKCNQQHDEEFCDMLRLFDFNKQAISRLKSEAQQSLNLITKAVN
FT                   SLINDQLIMKNHLRDLMGIPYCNYTKFWYLNDTRSGATSLPRCWLISNGSYLNETQFSR
FT                   DIEQEANNMLTDMLRKEYEKRQSTTPLGLVDLFVFSTSFYLISVFLHLIKIPTHRHIRG
FT                   KPCPKPHRINHMAICSCGFYKQPGIPTQWKR"
FT   gene            complement(1620. .3332)
FT                   /gene="NP"
FT   CDS             complement(1620. .3332)
FT                   /codon_start=1
FT                   /gene="NP"
FT                   /product="nucleoprotein"
FT                   /protein_id="ACJ24974.1"
FT                   /translation="MSNSKEVKSFLWTQSLRRELSGFCTNVKVQVIKDAQALLHGLDFS
FT                   EVSNVQRLMRKEKRDDSDLKRLRDLNQAVNNLVELKSTQQKNVLRVGTLTPDDLLVLAA
FT                   DLDRLKAKVIRSERPLAAGVYMGNLTAQQLEQRKVLLQMVGMGGGPLGREPPRDGIVRI
FT                   WDVRNPELLNNQFGTMPSLTIACMCKQGQTDLNDVIQSLTDLGLVYTAKYPNMSDLEKL
FT                   TQAHPILGVIEPKKSAINISGYNFSLSAAVKAGACLIDGGNMLETIRVSARNLDGILKA
FT                   TLKVKRSLGMFVSDTPGDRNPYENLLYKLCLSGEGWPYIASRTSILGRAWDNTTVDLSG
FT                   DGTQAPKPAGGNSTRVAQAQGMSAGLTYSQTMELKDCMLQLDPNAKTWVDIEGRAEDPV
FT                   EVAIYQPSNGQYIHFYREPTDIKQFKQDAKHSHGIDIQDLFSVQPGLTSAVIEGLPRNM
FT                   VLTCQGVDDIRKLLDSQGRRDIKLIDVSMQKEEARKYEDSIWDEYKHLCTMHTGIVTQK
FT                   KKRGGKEEVTPHCALMDCLMFEAATVGSSKLTTPRPVLSKDLVFRMSTPKVVL"
XX
SQ   Sequence 3383 BP; 874 A; 839 C; 701 G; 969 T; 0 other;

eu914103 Length: 3383  31-JUL-2009  Type: N  Check: 7417  ..

       1  atttttggtt gcgctttgct ttgttcacaa attagtgatc aagaaatggg
      51  tcagattgtg accttctttc aggaagtgcc acatataata gaggaagtga
     101  tgaatatagt cctcataact ctatcactct tggccattct aaaaggaatc
     151  tacaacatca tgacctgcgg catcattggc ctcctgacat tcctcttttt
     201  gtgtgggaga tcctgctcaa gcatctataa agacaattat cagtttctct
     251  cactggatct ggatatgtca ggactcaatg caacaatgcc tctctcttgc
     301  tcaaagaaca attcacacca ctacatacaa gtgaggaacg atacagggct
     351  agagttaact ctcacaaaca ccagcctctt ggatcacaag ttttgtaacc
     401  tttcagatgc tcacaagagg aacctctatg acaaggcttt aatgtcaatc
     451  gtaacaacat tccacctcaa tatccctaat ttcaatcagt atgaggtcat
     501  gtcttgtgac ttcaatggtg ggaaaatcac tgtccagtac aacctatcgc
     551  actcatccta tgttgatgct gcaaaccact gtggaaccat cgccaatggt
     601  atcatggata cctttaggag gatgtactgg tccaatgcac tgtcaccttc
     651  tgaatacatc tctggaacaa catgcatcca gacagcatat cagtacttga
     701  tcattcagaa cacaacgtgg gaggatcact gtgttttctc caggccatcc
     751  cccatgggat tccttagcct attatctcaa cggacaaaga acttttacat
     801  atcaaggagg ctgttgggct tgttcacttg gactctaagt gattctgaag
     851  gaaatgatat gcctggtggc tactgtctga caagatctat gctaataggg
     901  atggacctga agtgctttgg caacactgct gttgcgaagt gcaatcaaca
     951  acatgatgaa gaattctgtg atatgctcag attatttgat ttcaacaagc
    1001  aagccatctc aaggcttaag tcagaagcac agcagagcct aaacctaata
    1051  acaaaggcag tgaactcatt gatcaatgat cagctgataa tgaaaaatca
    1101  tcttagggat ttgatgggga ttccttattg caattataca aaattttggt
    1151  acttgaacga cacgaggtca ggcgccactt cgctgccaag atgttggttg
    1201  atttctaatg ggtcttactt aaatgaaact caatttagta gggacataga
    1251  gcaggaggcc aacaacatgc tgacagatat gttaaggaaa gaatatgaga
    1301  aaaggcaaag cacaacacca cttggtttag tggacctctt tgtcttctct
    1351  acaagcttct atcttataag tgtatttttg cacctcatta aaatcccaac
    1401  acatcgacat ataagaggca agccctgccc caaaccacac aggattaatc
    1451  acatggcaat ctgctcatgt ggcttttaca aacaacctgg catcccaact
    1501  caatggaaga gatagtccga tggaccctcc gagacgcacc gcccgaggcg
    1551  gtgcgtctcg ggggcccgcc ggcccccgca gcccaccacc ttgcggtggt
    1601  gggctgcggg ggtgtccatt tacaggacaa cctttggtgt gctcatccta
    1651  aagaccagat ccttgctcag cactggcctt ggtgttgtca gcttcgaact
    1701  ccccacagta gcggcctcaa acattagaca gtccatcaag gcacaatggg
    1751  gagtcacttc ctccttgcca cccctctttt tcttctgagt gactatgcct
    1801  gtgtgcatag tgcagaggtg cttgtactca tcccagatgc tgtcttcata
    1851  tttccttgcc tcttctttct gcatagacac atcgatcaat ttgatgtctc
    1901  ttctcccttg tgagtccaac agcttcctta tgtcatcaac accttggcag
    1951  gtgagcacca tgtttctagg tagcccctct ataacagcac ttgtcagccc
    2001  tggctgaaca gagaacaagt cttggatgtc aatcccatga gaatgttttg
    2051  cgtcctgctt gaattgcttg atgtcagtgg gttctctata aaagtgaatg
    2101  tactgcccgt tggatggctg atatatggcc acttcaacag ggtcctcagc
    2151  tctaccctct atatcgaccc aggttttggc atttgggtcc aattgcagca
    2201  tgcagtcctt cagctccatt gtctgagaat aggttaatcc agctgacatg
    2251  ccctgtgcct gtgcaaccct ggtagaattg cctccagcag gcttgggtgc
    2301  ctgtgtcccg tccccactta ggtcaactgt tgtgttgtcc catgccctcc
    2351  cgaggatgga tgttcttgat gcaatgtagg gccaaccctc cccagagaga
    2401  cacaacttgt aaaggagatt ctcataggga ttcctatccc ctggcgtgtc
    2451  tgagacgaac atgcccaaag acctctttac ctttagggta gctttgagga
    2501  ttccatcaag gtttctggct gaaaccctga tggtttcaag catgtttcca
    2551  ccatcaatca aacatgcacc agccttcaca gctgcagaca ggctgaagtt
    2601  gtatccagat atgtttatgg cacttttttt gggttctata actcccagga
    2651  ttgggtgtgc ttgggtcagt ttctccagat cagacatatt cggatatttt
    2701  gctgtgtaca caagaccaag gtctgttaac gactggatga catcatttaa
    2751  gtcagtctga ccttgtttgc acatgcatgc tattgttaga cttggcatcg
    2801  tgccaaactg attgttaagc aattctgggt ttctaacatc ccagattctt
    2851  acaatgccat ctctaggtgg ctccctaccc agtggaccac ctcccattcc
    2901  aaccatttgc aacagaacct ttctttgttc taattgttgt gctgtcagat
    2951  ttcccatgta gacacctgct gccagcggcc tttcacttct gatgaccttg
    3001  gctttgagtc tgtccaagtc ggctgcaaga acaagcaagt catcaggagt
    3051  taatgttcca acccttaaga cattcttttg ctgagtggat ttcaactcaa
    3101  caagattgtt aacagcctga tttagatctc ttaatctttt tagatctgag
    3151  tcatccctct tttctttcct catcaacctc tgcacattgc tgacctctga
    3201  gaaatccagc ccgtgaagaa gggcttgagc atccttgatg acttgcactt
    3251  tcacattcgt gcagaagcct gagagctctc tccttagact ctgagtccaa
    3301  aggaaagatt tcacctcctt tgagttagac atcttcacac cttgttcctc
    3351  agcaacttga tctgcaattg gcgcagtcaa taa