Sequence of DPV Lassa virus

Nigeria Lassa virus S (small) RNA for glycoprotein precursor (GPC) and nucleocapsid protein

ACC No: X52400

Dated: 1999-02-10 | Length: 3417 | CRC: -1143295472

                !!NA_SEQUENCE 1.0
ID   ARLVGPCN   standard; RNA; VRL; 3417 BP.
XX
AC   X52400; M36544;
XX
SV   X52400.1
XX
DT   18-JUN-1990 (Rel. 24, Created)
DT   10-FEB-1999 (Rel. 58, Last updated, Version 11)
XX
DE   Nigeria Lassa virus S (small) RNA for glycoprotein precursor (GPC) and
DE   nucleocapsid protein
XX
KW   envelope protein; glycoprotein; N protein; nucleocapsid protein.
XX
OS   Lassa virus
OC   Viruses; ssRNA negative-strand viruses; Arenaviridae; Arenavirus;
OC   1-LCMV-LASV complex.
XX
RN   [1]
RP   1-3417
RA   Clegg J.C.S.;
RT   ;
RL   Submitted (02-APR-1990) to the EMBL/GenBank/DDBJ databases.
RL   Clegg J.C.S., PHLS Centre for Applied Microbiology and Research, Porton
RL   Down, Salisbury SP4 OJG, UK.
XX
RN   [2]
RP   1-3417
RA   Clegg J.C.S., Wilson S.M., Oram J.D.;
RT   ;
RL   Virus Res. 18:151-164(1990).
XX
DR   SWISS-PROT; P04935; NCAP_LASSG.
DR   SWISS-PROT; P17332; VGLY_LASSG.
XX
CC   See also  for overlapping sequence.
XX
CC   Data kindly reviewed (02-JUL-1990) by Clegg J.S.S.
XX
CC   Entry M36544 was deleted, because it was a duplication of x52400.
CC   M36544 is now a secondary accession number of x52400.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .3417
FT                   /db_xref="taxon:11620"
FT                   /organism="Lassa virus"
FT                   /strain="GA391"
FT   CDS             71. .1543
FT                   /db_xref="SWISS-PROT:P17332"
FT                   /note="glycoprotein precursor (AA 1-490)"
FT                   /protein_id="CAA36645.1"
FT                   /translation="MGQIVTFFQEVPHVIEEVMNIVLIALSILAILKGLYNVATCGLIG
FT                   LVTFLLLSGRSCSLIYKGTYELQTLELNMETLNMTMPLSCTKNNSHHYIRVGNETGLEL
FT                   TLTNTSILNHKFCNLSDAHKRNLYDHSLMSIISTFHLSIPNFNQYEAMSCDFNGGKITV
FT                   QYNLSHSFAVDAAGHCGTLANGVLQTFMRMAWGGSYIALDSGRGNWDCIMTSYQYLIIQ
FT                   NTTWDDHCQFSRPSPIGYLGLLSQRTRDIYISRRLLGTFTWTLSDSEGNETPGGYCLTR
FT                   WMLIEAELKCFGNTAVAKCNEKHDEEFCDMLRLFDFNKQAIRRLKTEAQMSIQLINKAV
FT                   NALINDQLIMKNHLRDIMGIPYCNYSRYWYLNHTSTGKTSLPRCWLISNGSYLNETKFS
FT                   DDIEQQADNMITEMLQKEYIDRQGKTPLGLVDLFVFSTSFYLISIFLHLVKIPTHRHIV
FT                   GKPCPKPHRLNHMGICSCGLYKQPGVPVRWKR"
FT   CDS             complement(1603. .3315)
FT                   /db_xref="SWISS-PROT:P04935"
FT                   /note="nucleocapsid protein (AA 1-570)"
FT                   /protein_id="CAA36646.1"
FT                   /translation="MSASKEVRSFLWTQSLRRELSGYCSNIKLQVVKDAQALLHGLDFS
FT                   EVSNVQRLMRKQKRDDGDLKRLRDLNQAVNNLVELKSTQQKSVLRVGTLSSDDLLILAA
FT                   DLEKLKSKVTRTERPLSSGVYMGNLSSQQLDQRRALLNMIGMTGVSGGGKGASDGIVRV
FT                   WDVKNAELLNNQFGTMPSLTLACLTKQGQVDLNDAVQALTDLGLIYTAKYPNSSDLDRL
FT                   SQSHPILNMIDTKKSSLNISGYNFSLGAAVKAGACMLDGGNMLETIKVSPQTMDGILKS
FT                   ILKVKKSLGMFVSDTPGERNPYENILYKICLSGDGWPYIASRTSIVGRAWENTVVDLEQ
FT                   DNKPQKIGNGGSNKSLQSAGFAAGLTYSQLMTLKDFKCFNLIPNAKTWMDIEGRPEDPV
FT                   EIALYQPSSGCYVHFFREPTDLKQFKQDAKYSHGIDVTDLFAAQPGLTSAVIEALPRNM
FT                   VITCQGSEDIRKLLESQGRRDIKLIDITLSKADSRKFENAVWDQFKDLCHMHTGVVVEK
FT                   KKRGGKEEITPHCALMDCIMFDAAVSGGLDAKVLRVVLPRDMVFRTSTPKVVL"
FT   misc_feature    1555. .1596
FT                   /note="intergenic hairpin region"
XX
SQ   Sequence 3417 BP; 946 A; 799 C; 707 G; 965 T; 0 other;

X52400  Length: 3417  September 30, 2002 12:40  Type: N  Check: 402  ..

       1  gcaccgggga tcctaggcat ttaggattgc gcattttaaa acctcctttt
      51  tggaaagtgt cgcaatcagg atgggacaga ttgtgacatt cttccaagaa
     101  gttcctcatg ttattgagga agtgatgaat attgtcctta ttgcactatc
     151  catcctagca attctgaagg gactatacaa tgttgccacg tgtggcttga
     201  tagggcttgt cacattcctt ctactttcag gaaggtcatg ctcactgatc
     251  tacaaaggga cttacgagct gcaaaccctt gagttgaata tggagactct
     301  taatatgacc atgccgctat cctgcacaaa gaacaacagt catcattata
     351  taagggtggg gaatgagaca ggacttgagc tcaccttgac caacaccagc
     401  attcttaatc acaaattctg taacctctct gatgcccaca aaaggaatct
     451  ttatgatcac agcctcatga gtatcatctc tacctttcat ctgtccattc
     501  ccaacttcaa tcaatatgag gcaatgagct gcgatttcaa tggggggaaa
     551  atcactgtgc aatacaacct gagtcatagc ttcgcagtgg atgcagcagg
     601  tcactgcggc acacttgcaa atggtgtctt acaaacattt atgagaatgg
     651  cttggggagg gagttatatt gctcttgatt ctggacgcgg taactgggac
     701  tgtataatga ccagttacca atatctaatc attcagaata caacctggga
     751  tgaccactgc caattttcca gaccatcacc tattggctac cttgggcttc
     801  tctcacaaag aactagagac atatacatta gcagaaggtt gttggggaca
     851  ttcacctgga cactatcaga ctcagaggga aatgaaacac cagggggata
     901  ctgccttact agatggatgt tgattgaggc cgaattaaag tgctttggaa
     951  acactgcagt agccaagtgc aatgagaaac atgatgaaga attttgtgac
    1001  atgctaaggt tgttcgattt caacaaacag gccataagga ggctcaaaac
    1051  agaggcccaa atgagcatac agctgatcaa caaggctgtc aatgctttaa
    1101  taaatgatca gctcatcatg aagaaccact tgagagacat catgggcata
    1151  ccatattgta actacagcag atattggtac cttaaccaca catcaacagg
    1201  aaagacctca ctaccaaggt gttggcttat ctcaaatgga tcatatctca
    1251  atgagaccaa gttttcagat gacatcgaac aacaagctga caacatgata
    1301  acagagatgc tacaaaagga atacatagat agacagggca aaactccact
    1351  ggggttagtt gacctatttg tttttagcac aagtttctat ctgataagca
    1401  tctttctcca cctggtcaaa ataccaaccc atagacacat tgtaggtaaa
    1451  ccttgcccaa aaccccacag gctgaaccac atgggcatct gctcctgtgg
    1501  tctatacaaa cagccaggtg tgcctgtcag atggaagagg tgaaatccca
    1551  cagggccccc gtgacccacc gccaattggc ggtgggtcac gggggcgtcc
    1601  atctacagga cgactttagg tgttgaagtt ctgaacacca tgtctctagg
    1651  gagcacaact ctcaggactt ttgcatcaag tcctcctgaa actgctgcat
    1701  caaacataat gcaatccatc agtgcacaat gaggagttat ttcctcttta
    1751  ccacctctct tctttttctc cacaactacc ccagtgtgca tgtgacatag
    1801  atccttgaat tgatcccaaa cagcattctc aaactttctt gaatctgctt
    1851  tactaagagt gatgtcaatc agttttatgt ctctcctccc ttgtgactca
    1901  aggagttttc tgatatcctc tgatccttgg caagtgatga ccatgttccg
    1951  aggaagggct tctatcactg cactggttaa cccaggttgg gcagcaaaca
    2001  aatcagtcac atcaatacca tgtgaatact ttgcatcttg tttgaattgc
    2051  ttcaaatctg ttggctccct aaagaaatgt acatagcaac ccgagctcgg
    2101  ttgataaagg gctatctcaa ctgggtcttc tggtcttcct tcaatatcca
    2151  tccaggtttt tgcgttggga atcaagttga agcacttgaa atctttgaga
    2201  gtcatcaact gagagtaagt taatcctgca gcaaagcctg cagactgtaa
    2251  tgacttgttg gaccccccat ttccaatttt ctggggcttg ttgtcttgct
    2301  caaggtccac cacagtattt tcccatgctc ttcccacaat cgaggtcctt
    2351  gatgcaatat agggccatcc gtctcctgag agacagatct tgtataggat
    2401  gttctcataa gggttccttt cacccggtgt gtctgataca aacattccca
    2451  gactcttctt aactttcaag attgacttca agataccatc catggtctga
    2501  ggtgaaacct taatagtctc taacatgtta ccaccatcaa gcatgcaggc
    2551  ccctgctttg acagcagcac ccaaactgaa attgtaacca gagatgttga
    2601  gtgaactttt cttagtgtca atcatattca gaattggatg actctgagac
    2651  aatctgtcga gatcagatga gttggggtat ttggctgtgt aaatcagccc
    2701  taaatctgtc aaagcttgaa cggcatcatt caggtccact tgcccctgtt
    2751  tggtcaggca tgctaaagtt aggcttggca ttgttccgaa ctgattgttg
    2801  agtaactctg catttttgac atcccaaact ctcacaatgc catcactggc
    2851  accctttccc cctccactta ctccagtcat gccaatcatg ttcaaaaggg
    2901  ctctcctttg atcaagctgt tgtgaactca aattccccat ataaactcct
    2951  gaactcaaag gcctttctgt tctggtgact tttgatttca gtttttctaa
    3001  atcagcggcc aggattagta gatcgtctga acttaaggtt ccaactctta
    3051  agacactttt ctgctgtgtg gatttgagct caacaagatt gttgactgct
    3101  tgattgagat ctctcagtcg ttttaggtcg ccatcatctc ttttctgctt
    3151  gcgcatcaat ctctgaacat tactgacctc ggagaagtca agaccatgaa
    3201  ggagagcttg agcgtcttta actacctgca actttatgtt ggaacagtag
    3251  ccagatagtt cccttcttag ggattgagtc cacaagaatg acctcacttc
    3301  cttggaagca ctcattgtcg tgatggttgt ctgacccttg agtgggtctt
    3351  gaatgtggtc actccaaagg tttgattagt gcaaagcgca atccaatagc
    3401  ctaggatcca ctgtgcg