Sequence of DPV East African cassava mosaic virus
East African cassava mosaic virus-KE2 segment DNA A, complete sequence, isolate EACMV-KE2[K25]
ACC No: AJ717538
Dated: 2006-09-14 | Length: 2801 | CRC: -167809094
ID AJ717538; SV 1; circular; genomic DNA; STD; VRL; 2801 BP.
XX
AC AJ717538;
XX
DT 09-MAR-2006 (Rel. 87, Created)
DT 14-SEP-2006 (Rel. 89, Last updated, Version 3)
XX
DE East African cassava mosaic virus-KE2 segment DNA A, complete sequence,
DE isolate EACMV-KE2[K25]
XX
KW AC1 gene; AC1 protein; AC2 gene; AC2 protein; AC3 gene; AC3 protein;
KW AC4 gene; AC4 protein; AV1 gene; AV1 protein; AV2 gene; AV2 protein.
XX
OS East African cassava mosaic virus-KE2
OC Viruses; ssDNA viruses; Geminiviridae; Begomovirus.
XX
RN [1]
RP 1-2801
RA Bull S.E.;
RT ;
RL Submitted (18-MAY-2004) to the EMBL/GenBank/DDBJ databases.
RL Bull S.E., Department of Disease & Stress Biology, John Innes Centre,
RL Colney Lane, Norwich, Norfolk, NR4 7UH, UNITED KINGDOM.
XX
RN [2]
RA Bull S.E.;
RT "Diversity of cassava-infecting geminiviruses in Kenya";
RL Thesis (2005), MSc (Research), University of East Anglia, Norwich, UK.
XX
RN [3]
RA Bull S.E., Briddon R.W., Sserubombwe W.S. ., Ngugi K., Markham P.G.,
RA Stanley J.;
RT "Genetic diversity and phylogeography of cassava mosaic viruses in Kenya";
RL J. Gen. Virol. 87:3053-3065(2006).
XX
RN [4]
RA Bull S.E., Briddon R.W., Sserubombwe W.S., Ngugi K., Markham P.G. .,
RA Stanley J.;
RT "Genetic diversity and phylogeography of cassava mosaic viruses in Kenya";
RL J. Gen. Virol. 87(Pt 10):3053-3065(2006).
XX
FH Key Location/Qualifiers
FH
FT source 1. .2801
FT /organism="East African cassava mosaic virus-KE2"
FT /segment="DNA A"
FT /specific_host="Manihot esculenta"
FT /isolate="EACMV-KE2[K25]"
FT /mol_type="genomic DNA"
FT /country="Kenya:Kwale, Misakwakwani"
FT /clone="K25FA-2[2001]"
FT /virion
FT /db_xref="taxon:374778"
FT CDS 174. .539
FT /codon_start=1
FT /gene="AV2"
FT /product="AV2 protein"
FT /db_xref="InterPro:IPR002511"
FT /db_xref="InterPro:IPR005159"
FT /db_xref="UniProtKB/TrEMBL:Q2A8Y0"
FT /protein_id="CAJ78070.1"
FT /translation="MWDPLLNDFPETVHGFRSMLAVKYLLHLEQEYDRGTVGAEYIRDL
FT IGVLRCKNYVEATRRYNNLNTRIQGAEEAELRQPIHEPCCCPHCPRHQKQNMGQQAHVS
FT EAQDVQNVSKPRCPEGL"
FT CDS 334. .1107
FT /codon_start=1
FT /gene="AV1"
FT /product="AV1 protein"
FT /db_xref="GOA:Q2A8X9"
FT /db_xref="InterPro:IPR000143"
FT /db_xref="InterPro:IPR000263"
FT /db_xref="InterPro:IPR000650"
FT /db_xref="UniProtKB/TrEMBL:Q2A8X9"
FT /protein_id="CAJ78071.1"
FT /translation="MSKRPGDIIISTPVSKVRRRLNFDSPYTNRVVAPTVRVTRSKIWA
FT NRPMYRKPKMYRMYRSPDVPKGCEGPCKVQSYEQRDDVKHTGMVRCVSDVTRGSGITHR
FT VGKRFCVKSIYILGKIWMDENIKKQNHTNHVMFFLVRDRRPYGPSPQDFGQVFNMFDNE
FT PTTATVKNDLRDRYQVLRKFYATVVGGPSGMKEQALVKRFFRINNHVVYNHQEQAKYEN
FT HTENALLLYMACTHASNPVYATLKIRIYFYDAVTN"
FT CDS complement(1104. .1508)
FT /codon_start=1
FT /gene="AC3"
FT /product="AC3 protein"
FT /db_xref="InterPro:IPR000657"
FT /db_xref="UniProtKB/TrEMBL:Q2A8Z6"
FT /protein_id="CAJ78072.1"
FT /translation="MDSRTGELITAPQATNGVFTWEITNPLYFAITNHDKRPGNMNHDI
FT ITIQIRFNHNIRKALGIHKCFLNFKVWTTLRPQTGLFLRVFRSQVLTYLDMIGVISINT
FT VLQSVDHVLYDVLLNTLQVTEQHAIKFNLY"
FT CDS complement(1249. .1656)
FT /codon_start=1
FT /gene="AC2"
FT /product="AC2 protein"
FT /db_xref="GOA:Q2A8Z5"
FT /db_xref="InterPro:IPR000942"
FT /db_xref="UniProtKB/TrEMBL:Q2A8Z5"
FT /protein_id="CAJ78073.1"
FT /translation="MPPSSPSTSHCSQVPIKVQHRTAKTRAIRRRRVDLECGCSFYLHI
FT GCINHGFSHRGTHHCASSNEWRFYLGNNKSPLFRNHKPRQEAREHEPRHHHNPDTVQPQ
FT HPEGVGDSQVFSQLQGLDDLTASDWSFLKSI"
FT CDS complement(1565. .2644)
FT /codon_start=1
FT /gene="AC1"
FT /product="AC1 protein"
FT /db_xref="GOA:Q2A8Z4"
FT /db_xref="InterPro:IPR001191"
FT /db_xref="InterPro:IPR001301"
FT /db_xref="UniProtKB/TrEMBL:Q2A8Z4"
FT /protein_id="CAJ78074.1"
FT /translation="MPRAGRFQINAKNYFITYPRCSLTKEEALSQLKAFSYPTNIKFIR
FT VCRELHQDGVPHLHVLIQFEGKFQCTNPRFFDLISPSRSTHFHPNIQGAKSSSDVKAYI
FT EKGGEFLDDGIFQVDARSARGEGQHLAQVYADALNASSKSEALQIIKEKDPKSFFLQFH
FT NISANADRIFQAPPQTYVSPFLSSSFTNVPEDLEVWVSENVMGSAARPWRPNSIVLEGD
FT SRTGKTMWARSLGPHNYLCGHLDLSPKVYSNDAWYNVIDDVDPHYLKHFKEFMGAQRDW
FT QSNTKYGKPIQIKGGIPTIFLCNPGPTSSYKEFLDEEKNQSLKAWALKNATFVTLHEPL
FT FSSAHQSPTPHSEDQGHQT"
FT CDS complement(2254. .2487)
FT /codon_start=1
FT /gene="AC4"
FT /product="AC4 protein"
FT /db_xref="InterPro:IPR002488"
FT /db_xref="UniProtKB/TrEMBL:Q2A8Z3"
FT /protein_id="CAJ78075.1"
FT /translation="MGCLISMFSSNSKASSNVPTPDSSISFPHPDQHISIRTFRALNHR
FT PMSRLTLKREGNFLTMEFSKSMPEVPGGRVSI"
XX
SQ Sequence 2801 BP; 741 A; 560 C; 728 G; 772 T; 0 other;
aj717538 Length: 2801 14-SEP-2006 Type: N Check: 1379 ..
1 accggatggc cgcgcccgaa aaagcaggtg gaccccacga catggccgca
51 ctcgtgaaag aaagtggtcc ccacgcactt gtattggtcg gccagtcata
101 ttcacgcgtg gaagtctaga tatttgtggg ttgacgttat atgcttcgtc
151 gcgaagtagt ggagcgcgtc aacatgtggg atccattgtt gaacgatttt
201 cccgaaaccg ttcacggttt ccgttctatg cttgctgtta aatacctgtt
251 acatctggaa caggaatacg atcgcggtac tgtcggggct gagtatatac
301 gggatctaat aggggttcta cggtgtaaga attatgtcga agcgaccagg
351 agatataata atctcaacac ccgtatccaa ggtgcggagg aggctgaact
401 tcgacagccc atacacgaac cgtgttgttg cccccactgt ccgcgtcacc
451 agaagcaaaa tatgggccaa caggcccatg tatcggaagc ccaagatgta
501 cagaatgtat cgaagcccag atgtcccgaa gggctgtgaa ggcccatgta
551 aggttcagtc ttatgaacag agggatgatg tgaagcatac tggtatggtc
601 cgatgtgtga gtgatgttac tcgtgggtca ggcatcaccc atagagttgg
651 gaagaggttt tgtgtgaagt ccatatatat attgggcaag atctggatgg
701 atgagaatat caagaagcaa aatcatacga accatgttat gttcttcctc
751 gttcgagata gaaggcctta tggtccgagc ccacaagatt ttggacaagt
801 gttcaacatg tttgataatg agcctactac ggcaactgtg aagaatgatc
851 ttagggaccg gtatcaggtg ttacgtaaat tctatgcgac tgtggttggt
901 ggaccctctg ggatgaagga acaagctctg gttaagaggt tttttaggat
951 caataatcat gtagtgtata atcatcaaga acaggccaag tatgagaatc
1001 atactgagaa tgcgttgtta ttgtatatgg catgtacaca tgcctcaaat
1051 cctgtgtatg ctacgctgaa aatacgcatc tatttctatg atgcagtgac
1101 aaattaataa aggttgaatt ttattgcatg ttgctccgta acttggagcg
1151 tgtttagtaa tacatcgtac agaacatgat caacagattg aagtacagtg
1201 ttaatggaaa taacgcctat catatctaaa tacgtgagca cttgagatct
1251 aaatactctt aagaaaagac cagtctgagg ccgtaaggtc gtccagacct
1301 tgaagttgag aaaacacttg tgaatcccca acgccttccg gatgttgtgg
1351 ttgaaccgta tctggattgt gatgatgtcg tggttcatgt tccctggcct
1401 cttgtcgtgg tttgtgattg cgaaatagag gggatttgtt atttcccagg
1451 taaaaacgcc attcgttgct tgaggcgcag tgatgagttc ccctgtgcga
1501 gaatccatgg ttgatgcagc cgatatggag atagaacgag cagccgcatt
1551 cgaggtctac ccgcctacgt ctgatggccc tggtcttcgc tgtgcggtgt
1601 tggactttga tgggcacttg agaacaatgg ctcgtggagg gtgacgaagg
1651 tggcattctt taaagcccag gctttaaggg actggttctt ttcctcatcc
1701 agaaactctt tatatgatga tgttggtcct ggattgcaga ggaagatagt
1751 gggaatgccg cctttaattt gaatcggctt cccgtacttt gtattgcttt
1801 gccagtccct ttgggccccc atgaattctt tgaagtgttt gaggtagtgg
1851 gggtcgacgt catcaatgac gttgtaccag gcgtcgttgc tgtagacctt
1901 tggactgaga tccaggtgtc cacataaata attatgtggt cccaatgacc
1951 tggcccacat ggtcttccct gtacgactat caccttctag aacaatactg
2001 ttgggtctcc aaggccgcgc agcggaaccc atcacgttct cggaaaccca
2051 gacttcaaga tcctcaggaa cgttagtaaa agaggatgat aagaacggac
2101 taacgtaagt ttggggcgga gcctggaaga tgcgatctgc gttagcagat
2151 atgttatgga actgtaaaaa aaaggacttg ggatcttttt ctttgataat
2201 ttgaagagct tctgatttag aagaagcatt caacgcgtct gcatatacct
2251 gagctaaatg ctgaccctcc cccctggcac ttcgggcatc gacttggaaa
2301 attccatcgt caagaaattc ccctcccttt tcaatgtaag ccttgacatc
2351 ggacgatgat ttagcgccct gaatgttcgg atggaaatgt gttgatctgg
2401 atggggaaat gagatcgaag aatctggggt tggtacattg gaacttgcct
2451 tcgaattgga tgagaacatg gagatgaggc accccatcct gatgtagttc
2501 tctgcaaacc ctaatgaatt tgatattcgt cggataagaa aacgctttta
2551 attgggaaag ggcctcttcc tttgttaatg agcatcgggg ataggtgatg
2601 aaataatttt tggcatttat ttgaaaacga ccggctcttg gcatatttgc
2651 tgtcgttttg gatcggggga cactcaaaac tccagggaaa cggtggaatg
2701 gggggcatta tataggatgt cccccaatgg catatgtgta aataggtaga
2751 gttccattca aaaattgaat tgcgaatatt ggcggccatc cgattaatat
2801 t