Sequence of DPV East African cassava mosaic Kenya virus

East African cassava mosaic Kenya virus isolate Comoros:Grande-Comore:GC27AG1:2009 segment DNA-A, complete sequence.

ACC No: JF909101

Dated: 2012-12-05 | Length: 2797 | CRC: 1988960778

                
ID   JF909101; SV 1; circular; genomic DNA; STD; VRL; 2797 BP.
XX
AC   JF909101;
XX
DT   21-JUN-2012 (Rel. 113, Created)
DT   05-DEC-2012 (Rel. 115, Last updated, Version 3)
XX
DE   East African cassava mosaic Kenya virus isolate
DE   Comoros:Grande-Comore:GC27AG1:2009 segment DNA-A, complete sequence.
XX
KW   .
XX
OS   East African cassava mosaic Kenya virus
OC   Viruses; ssDNA viruses; Geminiviridae; Begomovirus.
XX
RN   [1]
RC   Publication Status: Online-Only
RP   1-2797
RX   PUBMED; 23186303.
RA   De Bruyn A., Villemot J., Lefeuvre P., Villar E., Hoareau M.,
RA   Harimalala M., Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B.,
RA   Harkins G.W., Varsani A., Martin D.P., Lett J.M.;
RT   "East African cassava mosaic-like viruses from Africa to Indian ocean
RT   islands: molecular diversity, evolutionary history and geographical
RT   dissemination of a bipartite begomovirus";
RL   BMC Evol. Biol. 12(1):228-228(2012).
XX
RN   [2]
RP   1-2797
RA   Villemot J., Lefeuvre P., Villar E., Hoareau M., Harimalala M.,
RA   Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B., Varsani A., Martin D.P.,
RA   Lett J.-M.;
RT   ;
RL   Submitted (24-MAR-2011) to the INSDC.
RL   UMR PVBMT, CIRAD, 7, chemin de l'IRAT, Saint-Pierre, Reunion 97410, France
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .2797
FT                   /organism="East African cassava mosaic Kenya virus"
FT                   /segment="DNA-A"
FT                   /host="Manihot esculenta (cassava)"
FT                   /isolate="Comoros:Grande-Comore:GC27AG1:2009"
FT                   /mol_type="genomic DNA"
FT                   /country="Comoros:Grande-Comore"
FT                   /lat_lon="11.48 S 43.32 E"
FT                   /collection_date="2009"
FT                   /db_xref="taxon:393599"
FT   gene            174. .530
FT                   /gene="AV2"
FT   CDS             174. .530
FT                   /codon_start=1
FT                   /gene="AV2"
FT                   /product="movement protein"
FT                   /protein_id="AEG89940.1"
FT                   /translation="MWDPLLNDFPETVHGFRSMLAVKYLLHLEQEYDRGTVGAEYIRDL
FT                   IGVLRCKSYVEATRRYNNLNTRIQGAEEAELRQPIHEPCCCPHCPRHQKQNMGQQAHVS
FT                   EAQDVQNVSKPRCS"
FT   gene            334. .1107
FT                   /gene="AV1"
FT   CDS             334. .1107
FT                   /codon_start=1
FT                   /gene="AV1"
FT                   /product="coat protein"
FT                   /protein_id="AEG89939.1"
FT                   /translation="MSKRPGDIIISTPVSKVRRRLNFDSPYTNRVVAPTVRVTRSKIWA
FT                   NRPMYRKPKMYRMYRSPDVPKGCEGPCKVQSYEQRDDVKHTGMVRCVSDVTRGSGITHR
FT                   VGKRFCVKSIYILGKIWMDENIKKQNHTNHVMFFLVQDRRPYGQSPQDFGQVFNMFDNE
FT                   PTTATVKNDLRDRYQVLRKFYTTVVGGPSGMKEQSLVKRFFRINNHVVYNHQEQAKYEN
FT                   HTENALLLYMACTHASNPVYATLKIRIYFYDAVTN"
FT   gene            complement(1104. .1508)
FT                   /gene="AC3"
FT   CDS             complement(1104. .1508)
FT                   /codon_start=1
FT                   /gene="AC3"
FT                   /product="replication enhancer"
FT                   /protein_id="AEG89943.1"
FT                   /translation="MDSRTGELITAPQAKNGVFTWEITNPLYFDITNHDRRPGNMNHDI
FT                   ITFQIRFNHNIRKALGIHKCFLNFKVWTTLRPPTGLFLKVFRYQVLKYLDMIGVISINT
FT                   VIQAVDHVLYNVLLNTLQVTEQHAIKFNLY"
FT   gene            complement(1249. .1656)
FT                   /gene="AC2"
FT   CDS             complement(1249. .1656)
FT                   /codon_start=1
FT                   /gene="AC2"
FT                   /product="transcription activator protein"
FT                   /protein_id="AEG89942.1"
FT                   /translation="MPPSSPSTSHCSQVPIKVQHRTAKTRAVRRRRVDLECGCSFYLHI
FT                   DCINHGFSHRGTHHCASSKEWRFYLGNNKSPLFRHHQPRQETREHEPRHHHIPDTVQPQ
FT                   HPEGIGDSQVFSQLQGLDDLTASDWSFLKSI"
FT   gene            complement(1580. .2644)
FT                   /gene="AC1"
FT   CDS             complement(1580. .2644)
FT                   /codon_start=1
FT                   /gene="AC1"
FT                   /product="replication associated protein"
FT                   /protein_id="AEG89941.1"
FT                   /translation="MPRAGRFSIKAKNYFLTYPKCSLSKEEALDQLRQLQTPTNKLFIK
FT                   ICRELHENGEPHLHALIQFEGKYNCTNQRFFDLISPSRSAHFHPNIQGAKSSSDVKSYL
FT                   DKDGDTIQWGEFQIDGRSARGGQQSANDAYAKALNSANKSEALNVIRELAPKDFVLQFH
FT                   NLNSNLERIFQEPLTPYISPFLSSSFTNVPEELEAWVSENVMGSAARPWRPSSIVIEGD
FT                   SRTEKTMWARSLGPHNYLCGHLDLSPKVYSNDAWYNVIDDVDPHYLKHFKEFMGAQRDW
FT                   QSNTKYGKPIQIKGGIPTIFLCNPGPTSSYKEFLDEEKHRSLKAWALKNATFITLHEPL
FT                   FSSAHQSPTPHSED"
FT   gene            complement(2197. .2493)
FT                   /gene="AC4"
FT   CDS             complement(2197. .2493)
FT                   /codon_start=1
FT                   /gene="AC4"
FT                   /product="C4 protein"
FT                   /protein_id="AEG89944.1"
FT                   /translation="MKMGNLICMPSFSSKASTIVPTNDSSTSYPLPGLPISTQIFRELN
FT                   QAPTSSPIWIRTETPSNGASFRSTDDLLEADNNPPMTLTPRLLTQQISQRLLM"
XX
SQ   Sequence 2797 BP; 726 A; 558 C; 725 G; 788 T; 0 other;

jf909101 Length: 2797  05-DEC-2012  Type: N  Check: 6145  ..

       1  accggatggc cgcgcccgaa aaaagcaggt tgaccccaca agatggccgc
      51  gcccgttaaa gaaagtggtc cccgcgcacc tgtgttggtc ggccagtgat
     101  attcacgcgt gaaagtctag atatttgttg tttgtcttta tagacttcgt
     151  cgcgaagtag tgagcgcgtc aacatgtggg atccattgtt gaacgatttt
     201  cccgaaaccg ttcacggttt ccgttctatg cttgctgtta aatacctgtt
     251  acatctggaa caggaatacg atcgcggtac tgtcggggcg gagtatatac
     301  gtgatttaat aggggttcta cggtgtaaga gttatgtcga agcgaccagg
     351  agatataata atctcaacac ccgtatccaa ggtgcggagg aggctgaact
     401  tcgacagccc atacacgaac cgtgttgttg cccccactgt ccgcgtcacc
     451  agaagcaaaa tatgggccaa caggcccatg tatcggaagc ccaagatgta
     501  cagaatgtat cgaagcccag atgttcctaa gggctgtgaa ggcccatgta
     551  aggttcagtc ctatgaacag agggatgatg tgaagcacac tggtatggtc
     601  cgatgtgtca gtgatgttac tcgtggatca ggcattaccc atagagtcgg
     651  gaagaggttt tgtgtgaagt ccatatatat attgggcaag atttggatgg
     701  atgagaatat caagaagcaa aatcatacga accatgttat gttcttcctt
     751  gtccaagata gaaggcctta tggtcagagt cctcaagatt ttggacaagt
     801  gttcaacatg tttgataatg aacctactac ggcaactgtg aagaatgatc
     851  ttagggaccg atatcaggtg ttacgtaaat tctatacgac tgttgtgggt
     901  ggaccctctg ggatgaagga acaatctctg gttaagaggt tttttaggat
     951  caataatcat gtagtgtata atcatcagga acaggccaag tatgagaatc
    1001  atactgagaa tgcgttgtta ttgtatatgg catgtacaca tgcctcgaat
    1051  ccggtgtacg ctacgctgaa aatacgcatc tatttctatg atgcagtgac
    1101  aaattaataa aggttgaatt ttattgcatg ttgctccgta acttggagtg
    1151  tgttgagtaa tacattgtac agaacatgat caacagcttg aattacagtg
    1201  ttaatggaaa taacgcctat catatctaaa tacttgagca cttgatatct
    1251  aaatactttt aagaaaagac cagtcggagg ccgtaaggtc gtccagacct
    1301  tgaagttgag aaaacacttg tgaatcccca atgccttccg gatgttgtgg
    1351  ttgaaccgta tctggaatgt gatgatgtcg tggttcatgt tccctggtct
    1401  cctgtcgtgg ttggtgatgt cgaaatagag gggatttgtt atttcccagg
    1451  taaaaacgcc attctttgct tgaggcgcag tgatgagttc ccctgtgcga
    1501  gaatccatga ttgatgcagt cgatatggag atagaacgag cagccgcatt
    1551  cgaggtctac ccgcctacgt ctgacggccc tagtcttcgc tgtgcggtgt
    1601  tggactttga tgggcacttg agaacaatgg ctcgtggagg gtgatgaagg
    1651  tggcattctt taaagcccag gctttaaggg accggtgctt ttcctcgtcc
    1701  agaaactctt tatatgatga tgttggtcct ggattgcata ggaagatagt
    1751  gggaatgccg cctttaattt gaattggctt cccgtacttt gtattgcttt
    1801  gccagtccct ttgggccccc atgaattctt tgaaatgctt gaggtagtgg
    1851  gggtcgacgt catcaatgac gttgtaccat gcgtcgttgc tgtatacctt
    1901  tggactgaga tccaggtgtc cacacaagta gttatgtggt cccaaagagc
    1951  gagcccacat tgtcttctct gtcctactat ctccctcgat gacgatacta
    2001  ctaggtctcc atggccgcgc agcggaaccc atcacgttct cggaaaccca
    2051  ggcttcaagt tcctcaggaa cgttagtgaa agaagaagaa agaaagggag
    2101  aaatataagg agtgagaggc tcttgaaaaa tcctctctaa attgctattt
    2151  aaattatgaa actgtaaaac aaaatctttt ggggctagtt cccgtattac
    2201  attaagagcc tctgacttat ttgctgagtt aagagccttg gcgtaagcgt
    2251  cattggcgga ttgttgtccg cctcgagcag atcgtccgtc gatctgaaac
    2301  tcgccccatt ggatggtgtc tccgtcctta tccagatagg acttgacgtc
    2351  ggagcttgat ttagctccct gaatatttgg gtggaaatgg gcagaccggg
    2401  aaggggatat gaggtcgaag aatcgttggt tggtacaatt gtacttgcct
    2451  tcgaactgaa tgagggcatg cagatgaggt tccccatttt catggagctc
    2501  tctgcagatc ttgatgaaca atttatttgt tggggtttgg agttgtcgga
    2551  gctgatctaa ggcctcttct ttcgatagag aacatttggg atatgtgagg
    2601  aaatagtttt tggctttgat gctaaaacga ccagcccttg gcatttgcgc
    2651  tgtcgtatag caatcggggg gcacgcaaag tctgtagcaa tcgggggaat
    2701  ggggggcaat ttatatgatg ccccccaaat ggcatttatg taatatcctc
    2751  atgaaatttg aatttcgaac gtggaaagcg gccatccgta taatatt