Sequence of DPV Allpahuayo virus
Allpahuayo virus strain CLHP-2472 nucleocapsid protein (N) and glycoprotein precursor (GPC) genes, complete cds.
ACC No: AY012687
Dated: 2001-12-11 | Length: 3343 | CRC: 512546216
ID AY012687; SV 1; linear; genomic RNA; STD; VRL; 3343 BP.
XX
AC AY012687;
XX
DT 02-JAN-2001 (Rel. 66, Created)
DT 11-DEC-2001 (Rel. 70, Last updated, Version 2)
XX
DE Allpahuayo virus strain CLHP-2472 nucleocapsid protein (N) and glycoprotein
DE precursor (GPC) genes, complete cds.
XX
KW .
XX
OS Allpahuayo virus
OC Viruses; ssRNA negative-strand viruses; Arenaviridae; Arenavirus;
OC New world arenaviruses.
XX
RN [1]
RP 1-3343
RX DOI; 10.1006/viro.2000.0803.
RX PUBMED; 11384226.
RA Moncayo A.C., Hice C.L., Watts D.M., Travassos de Rosa A.P.A., Guzman H.,
RA Russell K.L., Calampa C., Gozalo A., Popov V.L., Weaver S.C., Tesh R.B.;
RT "Allpahuayo virus: a newly recognized arenavirus (arenaviridae) from
RT arboreal rice rats (oecomys bicolor and oecomys paricola) in northeastern
RT peru";
RL Virology 284(2):277-286(2001).
XX
RN [2]
RP 1-3343
RA Moncayo A.C., Hice C.L., Watts D.M., Travassos de Rosa A.P.A., Guzman H.,
RA Russell K.L., Calampa C., Gozalo A., Popov V.L., Weaver S.C., Tesh R.B.;
RT ;
RL Submitted (07-NOV-2000) to the EMBL/GenBank/DDBJ databases.
RL Pathology, University of Texas Medical Branch, 301 University Blvd.,
RL Galveston, TX 77555-0609, USA
XX
FH Key Location/Qualifiers
FH
FT source 1. .3343
FT /organism="Allpahuayo virus"
FT /specific_host="Oecomys bicolor"
FT /strain="CLHP-2472"
FT /mol_type="genomic RNA"
FT /country="Peru: Northeast"
FT /db_xref="taxon:144752"
FT CDS 33. .1718
FT /codon_start=1
FT /gene="N"
FT /product="nucleocapsid protein"
FT /db_xref="GOA:Q9DK04"
FT /db_xref="InterPro:IPR000229"
FT /db_xref="UniProtKB/TrEMBL:Q9DK04"
FT /protein_id="AAG42532.1"
FT /translation="MSSENVPSFRWTQSLRRGLSNWTHAVKGDVLADARAIVSALDFHQ
FT VAQVQRMMRKDKRSEADLTRLRDMNKEVDALMMMRSAQKDNILKVGGLSKDELMELASD
FT LDKLRKKVQRTEGGGQPGVYAGNLTSSQLNQRSEILKMMGMGTGPRGPVGGVVKVWDIK
FT DSSLLVNQFGSMPALTIACMTQQGGEQMNDVVQALTSLGLVYTVKYPNLSDLEKLTEKH
FT PCLKLITQEPAQINISGYNLSLSAAVKADACMIDGGNMLETLQVKPSMFSTLIKTILEV
FT KNREGMFVSPSPGQRNPYENILYKVCLSGDGWPYIGSRSQIKGRAWENTTVDLEGKPSV
FT NHPPVRNGGSPDLKQIPKTKEDEVIRAIEQLDPRGTTWVDIEGPPGDPVELALFQPETG
FT NYLHCYRRPHNENAFKDQSKFSHGLLLKDLADTQPGLISCIIRHLPNNMVLTAQGNDDI
FT IKLLEMHGRRDIKVLDVKLSSDQARLMEDVVWERYNMLCVKHTGLVIKKKKKGAAPGSA
FT NPHCALLDCIMFDATVTGYLRDQKPKRLLPLDTLYRDNANLINL"
FT CDS complement(1789. .3312)
FT /codon_start=1
FT /gene="GPC"
FT /product="glycoprotein precursor"
FT /db_xref="GOA:Q9DK03"
FT /db_xref="InterPro:IPR001535"
FT /db_xref="UniProtKB/TrEMBL:Q9DK03"
FT /protein_id="AAG42531.1"
FT /translation="MGQVVTFLQSLPEVINEAINIALIAISIICILKGLVNFWKCGVVQ
FT LAIFLCLAGRKCDGLMIDRRHELSHVELNLTRMFDNLPQSCSKNNTHHYYKGPKGTTWG
FT IELTLTNTSLDSYANMSRIRSLAFGNITNCDKTGEAGHTLKWLLNELHFNVLHVTRHVG
FT ARCRVSEGAGLLIQYNLTIGDHGGEVGRHLIASLAQIIGDNKAAWVGKCDSHCTMDGKC
FT NYTNCEGFTHYNYLIIQNTTWENHCSYSPMSTIRMALNKVAYSSVSRQLLGFFTWDISD
FT SSGAHVPGGYCLEQWAIVWAGIKCFDNAVMAKCNKDHNVEFCDTMRLFDFNQNAIKTLQ
FT LNVENSVNLLKRSINGLISDSLVIRNSLKQLAKIPYCNYTKFWYVNDTITGKHSLPQCW
FT LMRNGSYLNETHFKNEWLWESQNLYNEMLLKEYEDRQGKTPIALTDICFWSLVFFTSTV
FT FLQLVGIPTHRHLVGEGCPKPHRITSNSLCACGYYKIPKRPTRWVRKGK"
XX
SQ Sequence 3343 BP; 966 A; 721 C; 770 G; 886 T; 0 other;
ay012687 Length: 3343 11-DEC-2001 Type: N Check: 7061 ..
1 tttaggataa cgctttgcta acgatctcga acatgagttc tgagaatgta
51 ccctcattcc gctggactca atcccttaga aggggtctgt ccaactggac
101 ccatgctgtg aagggagacg ttcttgcgga tgcgagggcc atagtctctg
151 cgcttgactt ccaccaagtt gcacaagtgc aaagaatgat gaggaaggat
201 aaaaggagcg aggcggatct gaccagattg agagacatga ataaagaggt
251 tgacgctctt atgatgatga ggtctgcaca aaaagacaat atcttgaaag
301 tgggtggatt gtcaaaagat gagttgatgg agctggcttc tgacctagac
351 aaactgagga agaaggtcca gaggacagaa gggggtggtc aaccaggtgt
401 atatgcaggg aacctaactt catctcagct gaatcagaga tcagaaatcc
451 tgaagatgat ggggatgggt acaggtccaa gaggcccagt tggaggtgtt
501 gtgaaggttt gggacatcaa ggacagcagc ctccttgtca atcaatttgg
551 ctctatgcca gcactaacca tcgcctgcat gacacagcaa ggaggagaac
601 agatgaatga tgtggttcaa gccctaacat ctctgggact tgtgtacaca
651 gtgaagtatc cgaacttgtc tgacttagag aaattgaccg agaaacatcc
701 ctgtctgaag ctcatcacac aagagcctgc tcagatcaac atttcggggt
751 acaatctaag cttgtctgct gcagtcaagg cggatgcttg tatgattgat
801 ggtggcaaca tgttggaaac tttacaggtt aaaccctcaa tgtttagcac
851 ccttatcaag acaattcttg aagtgaaaaa cagagaaggc atgtttgtga
901 gtccatcccc tgggcagaga aatccctacg aaaacatatt atacaaggtt
951 tgtctgtcag gagatggttg gccgtacatt ggctcaagat cacaaatcaa
1001 aggaagagca tgggagaaca ccactgttga tcttgagggc aaaccatcag
1051 tgaaccaccc tccagtgagg aatggtggtt caccagatct aaaacaaata
1101 cccaagacaa aggaagatga ggtgataaga gccattgaac agcttgaccc
1151 gagaggaaca acctgggtgg atattgaggg tccaccaggt gatcctgttg
1201 aacttgctct atttcaacca gaaactggca attacttgca ttgctacaga
1251 agaccacaca atgaaaacgc attcaaggac cagagcaagt tctctcatgg
1301 cttgttgctc aaagatctgg ctgatacaca gccaggactc atctcgtgca
1351 taataagaca ccttccaaac aacatggtgt tgactgcaca aggtaatgat
1401 gacataatca aactacttga gatgcatgga agaagagaca ttaaggtgtt
1451 ggatgtgaag ttgtcatcag atcaagctcg tctaatggag gatgtggtat
1501 gggaacgtta caacatgctc tgcgtgaaac acactgggct cgtcattaag
1551 aagaagaaga agggtgcagc tccaggatca gccaatcccc attgcgcatt
1601 gttggactgt ataatgtttg atgccactgt cacaggctat cttcgagacc
1651 aaaaaccgaa aagacttctt ccacttgaca ctctatacag agacaatgct
1701 aatctaatta acctttaaaa ccttctaatt gaaggcctcg atgtcacacc
1751 ccctaggggg tgtgacatcg aggcgctggg agagcgaatt acttgccttt
1801 tctgacccat ctagtcggtc tttttggaat tttatagtaa ccgcaggcac
1851 acaatgagtt tgatgtaatc ctatggggtt ttgggcaccc ttcgccgact
1901 aagtgtcgat gtgttgggat gccaacaagc tgcaagaaca ctgtgctggt
1951 gaagaaaacc aaggaccaga agcagatgtc agttaaggca atgggtgtct
2001 ttccctgcct atcctcatat tctttcagca gcatctcatt gtacaaattt
2051 tgactttccc acaaccactc atttttaaag tgtgtctcat ttaagtaaga
2101 gccattgcgc ataagccaac attgaggcaa actatgtttg cctgtgatgg
2151 tgtcattcac ataccagaac ttagtgtaat tacagtaagg aatcttggct
2201 agctgcttca aactattcct aatcaccaaa ctatctgata tcaaaccatt
2251 aatgctcctc ttcagaagat ttacactgtt ctcaacattc agctgaaggg
2301 ttttaatggc gttttggttg aagtcaaaca gtctcattgt gtcacaaaac
2351 tcaacattgt ggtccttgtt gcactttgcc atcacagcat tatcaaaaca
2401 ttttattcct gcccatacta ttgcccattg ttctaaacaa tatccacctg
2451 ggacatgtgc tccgcttgag tccgagatgt cccatgtgaa gaatccaagt
2501 agttgtctcg acacagaact ataagctact ttgttcaaag ccatccgaat
2551 tgttgacatt ggtgagtaag agcaatgatt ctcccaggtt gtattctgaa
2601 tgatgaggta attatagtgt gtgaatccct cacagttggt ataattgcat
2651 ttgccatcca ttgtgcaatg gctgtcacac tttcccaccc aagctgcttt
2701 gttgtcacca atgatctgag caagagaagc aatgagatgt ctgccgactt
2751 cacctccatg atctccaata gtgaggttgt attgtatgag cagcccagca
2801 ccctctgaca ctctgcatct cgctccaaca tgtctggtga catggagcac
2851 attgaagtga agctcattaa gcagccactt gagtgtgtgc cctgcttcac
2901 ctgtcttgtc acagttggtg atgttcccga atgccaaact tctgatcctg
2951 ctcatattag cataactgtc aagagaagtg tttgtaagag tcagttcaat
3001 tccccaggtg gttccttttg ggcctttata atagtggtgt gtgttgtttt
3051 tgctgcaaga ttgagggagg ttgtcaaaca ttctagttag gtttagttcg
3101 acgtgtgaga gctcatgtct tctgtcaatc atcagaccat cacattttct
3151 acctgcaagg caaaggaaaa ttgcgagttg cacaacccca cacttccaaa
3201 agttgaccaa tcctttcaga atacagataa tagaaatggc aatcaaggca
3251 atgttaattg cctcattgat cacctcgggt agagactgta gaaatgtgac
3301 aacttgtccc attttggagt cacacaggcg tgatcaagga aat