Sequence of DPV East African cassava mosaic virus

East African cassava mosaic virus-Kenya isolate Comoros:Grande-Comore:GC36BI2:2009 segment DNA-A, complete sequence.

ACC No: JF909115

Dated: 2012-12-05 | Length: 2801 | CRC: -97545575

                
ID   JF909115; SV 1; circular; genomic DNA; STD; VRL; 2801 BP.
XX
AC   JF909115;
XX
DT   21-JUN-2012 (Rel. 113, Created)
DT   05-DEC-2012 (Rel. 115, Last updated, Version 3)
XX
DE   East African cassava mosaic virus-Kenya isolate
DE   Comoros:Grande-Comore:GC36BI2:2009 segment DNA-A, complete sequence.
XX
KW   .
XX
OS   East African cassava mosaic virus-Kenya
OC   Viruses; ssDNA viruses; Geminiviridae; Begomovirus.
XX
RN   [1]
RC   Publication Status: Online-Only
RP   1-2801
RX   PUBMED; 23186303.
RA   De Bruyn A., Villemot J., Lefeuvre P., Villar E., Hoareau M.,
RA   Harimalala M., Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B.,
RA   Harkins G.W., Varsani A., Martin D.P., Lett J.M.;
RT   "East African cassava mosaic-like viruses from Africa to Indian ocean
RT   islands: molecular diversity, evolutionary history and geographical
RT   dissemination of a bipartite begomovirus";
RL   BMC Evol. Biol. 12(1):228-228(2012).
XX
RN   [2]
RP   1-2801
RA   Villemot J., Lefeuvre P., Villar E., Hoareau M., Harimalala M.,
RA   Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B., Varsani A., Martin D.P.,
RA   Lett J.-M.;
RT   ;
RL   Submitted (24-MAR-2011) to the INSDC.
RL   UMR PVBMT, CIRAD, 7, chemin de l'IRAT, Saint-Pierre, Reunion 97410, France
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .2801
FT                   /organism="East African cassava mosaic virus-Kenya"
FT                   /segment="DNA-A"
FT                   /host="Manihot esculenta (cassava)"
FT                   /isolate="Comoros:Grande-Comore:GC36BI2:2009"
FT                   /mol_type="genomic DNA"
FT                   /country="Comoros:Grande-Comore"
FT                   /lat_lon="11.84 S 43.31 E"
FT                   /collection_date="2009"
FT                   /db_xref="taxon:1229189"
FT   gene            174. .539
FT                   /gene="AV2"
FT   CDS             174. .539
FT                   /codon_start=1
FT                   /gene="AV2"
FT                   /product="movement protein"
FT                   /protein_id="AEG90024.1"
FT                   /translation="MWDPLLNDFPETVHGFRSMLAVKYLLHLEQEYDRGTVGAEYIRDL
FT                   IGVLRCKSYVEATRRYNNLNTRIQGAEEAELRQPIHEPCCCPHCPRHQKQNMGQQAHVS
FT                   EAQDVQNVSKPRCSEGL"
FT   gene            334. .1107
FT                   /gene="AV1"
FT   CDS             334. .1107
FT                   /codon_start=1
FT                   /gene="AV1"
FT                   /product="coat protein"
FT                   /protein_id="AEG90023.1"
FT                   /translation="MSKRPGDIIISTPVSKVRRRLNFDSPYTNRVVAPTVRVTRSKIWA
FT                   NRPMYRKPKMYRMYRSPDVPKGCEGPCKVQSYEQRDDVKHTGMVRCVSDVTRGSGITHR
FT                   VGKRFCVKSIYILGKIWMDENIKKQNHTNHVMFFLVRDRRPYGQSPQDFGQVFNMFDNE
FT                   PTTATVKNDLRDRYQVLRKFYTTVVGGPSGMKEQALVKRFFRINNHVVYNHQEQAKYEN
FT                   HTENALLLYMACTHASNPVYATLKIRIYFYDAVTN"
FT   gene            complement(1104. .1508)
FT                   /gene="AC3"
FT   CDS             complement(1104. .1508)
FT                   /codon_start=1
FT                   /gene="AC3"
FT                   /product="replication enhancer"
FT                   /protein_id="AEG90027.1"
FT                   /translation="MDSRTGELITAPQATNGVFTWEITNPLYFEITNHDKRPGNMNHDI
FT                   ITLQIRFNHNLRKALGIHKCFLNFKVWTTLRPQTGRFLKVFRYQVLKYLDMIGVISINT
FT                   VLQAVDHVLYDVLLNTLQVTEQHAIKFNLY"
FT   gene            complement(1249. .1656)
FT                   /gene="AC2"
FT   CDS             complement(1249. .1656)
FT                   /codon_start=1
FT                   /gene="AC2"
FT                   /product="transcription activator protein"
FT                   /protein_id="AEG90026.1"
FT                   /translation="MPPSSPSTSHCSLVPIKVQHRTAKTRAVRRRRVDLECGCSFYLHI
FT                   DCINHGFSHRGTHHCASSNEWRFYLGNNKSPLFRNHQPRQAAREHEPRHHHTPDTVQPQ
FT                   PPEGTGDSQVFSQLQGLDDLTASDWSFLKSI"
FT   gene            complement(1565. .2644)
FT                   /gene="AC1"
FT   CDS             complement(1565. .2644)
FT                   /codon_start=1
FT                   /gene="AC1"
FT                   /product="replication associated protein"
FT                   /protein_id="AEG90025.1"
FT                   /translation="MPRAGRFQINARNYFITYPRCSLTKEEALSQLKALSYPTNIKFIR
FT                   VCRELHQDGVPHLHVLIQFEGKFQCTNPRFFDLISTSRSTHFHPNIQGAKSSSDVKAYI
FT                   EKGGEFLDDGIFQVDARSARGEGQHLAQVYADALNASSKTEALQIIKEKDPKSFFLQFH
FT                   NISANADRIFQAPAQTYVSPFLSSSFTQVPEDIEVWVSENICSPAARPWRPISIVLEGD
FT                   SRTGKTMWARSLGPHNYLCGHLDLSPKVYSNDAWYNVFDDVDPHYLKHFKEFMGAQRDW
FT                   QSNTKYGKPIQIKGGIPTILLCNPGPTSSYKEFLEEEKNQSLKAWALKNATFITLHEPL
FT                   FSSAHQSPTPHSEDQGRQT"
FT   gene            complement(2254. .2487)
FT                   /gene="AC4"
FT   CDS             complement(2254. .2487)
FT                   /codon_start=1
FT                   /gene="AC4"
FT                   /product="C4 protein"
FT                   /protein_id="AEG90028.1"
FT                   /translation="MGCLISMFSSSSKGSSNVPTHDSSISFPHPDPHISIRTFRELNHR
FT                   PMSKLILKREGNFLTMEFSRSMPEVQGGRASI"
XX
SQ   Sequence 2801 BP; 733 A; 561 C; 714 G; 793 T; 0 other;

jf909115 Length: 2801  05-DEC-2012  Type: N  Check: 7999  ..

       1  accggatggc cgcgcccgaa aaagcaggtg gaccccacaa gatggccgcg
      51  cccgtgaaag aaagtggtcc ccgcgcactt gtgttggtcg gccagtcata
     101  ttcacgcgtg aaagtctaga tatttgttgt ttgtctttat agacttcgtc
     151  gcgaagtagt ggagcgcgtc aacatgtggg atccattgtt gaacgatttc
     201  cccgaaaccg ttcacggttt ccgttctatg cttgctgtta aatacctgtt
     251  acatctggaa caggaatacg atcgcggtac tgtcggggcg gagtatatac
     301  gtgatttaat aggggttcta cggtgtaaga gttatgtcga agcgaccagg
     351  agatataata atctcaacac ccgtatccaa ggtgcggagg aggctgaact
     401  tcgacagccc atacacgaac cgtgttgttg cccccactgt ccgcgtcacc
     451  agaagcaaaa tatgggccaa caggcccatg tatcggaagc ccaagatgta
     501  cagaatgtat cgaagcccag atgttccgaa gggctgtgaa ggcccatgta
     551  aggttcagtc ctatgaacag agggatgatg tgaagcacac tggtatggtc
     601  cgatgtgtca gtgatgttac tcgtggatca ggcattaccc atagagtcgg
     651  gaagaggttt tgtgtgaagt ccatatatat attgggcaag atttggatgg
     701  atgagaatat caagaagcaa aatcatacga accatgttat gttcttcctt
     751  gttcgagata gaaggcctta tgggcagagt cctcaagatt ttggacaagt
     801  gttcaacatg tttgataatg aacctactac ggcaactgtg aagaatgatc
     851  ttcgggaccg atatcaggtg ttacgtaaat tctatacgac tgttgtaggt
     901  ggaccctctg ggatgaagga acaagctctg gttaagaggt tttttaggat
     951  caataatcat gtagtgtata atcatcagga acaggccaag tatgagaatc
    1001  atactgagaa tgcgttgtta ttgtatatgg catgtacaca tgcctcgaat
    1051  cctgtgtacg ctacgctgaa aatacgcatc tatttctatg atgcagtgac
    1101  aaattaataa aggttgaatt ttattgcatg ttgctccgta acttggagcg
    1151  tgtttagtaa tacatcgtac agaacatgat caacagcctg aagtacagtg
    1201  ttaatggaaa taacgcctat catatctaaa tacttgagca cttgatatct
    1251  aaatactttt aagaaacgac cagtctgagg ccgtaaggtc gtccagacct
    1301  tgaagttgag aaaacacttg tgaatcccca gtgccttccg gaggttgtgg
    1351  ttgaaccgta tctggagtgt gatgatgtcg tggttcatgt tccctggccg
    1401  cttgtcgtgg ttggtgattt cgaaatagag gggatttgtt atttcccagg
    1451  taaaaacgcc attcgttgct tgaggcgcag tgatgagttc ccctgtgcga
    1501  gaatccatgg ttgatgcagt cgatatggag atagaacgag cagccgcatt
    1551  cgaggtctac ccgcctacgt ctgacggccc tggtcttcgc tgtgcggtgt
    1601  tggactttga tgggcactag agaacaatgg ctcgtggagg gtgatgaagg
    1651  tggcattctt taaagcccag gctttaaggg actggttctt ttcctcttcc
    1701  agaaactctt tatatgatga tgttggtcca ggattgcaga ggaggatagt
    1751  gggaatgccg cctttaattt gaattggctt cccgtacttt gtattgcttt
    1801  gccagtctct ttgggccccc atgaattctt tgaaatgctt taaatagtgc
    1851  gggtctacgt cgtcaaagac gttgtaccat gcgtcgtttg aatatacctt
    1901  tggagacaga tccaggtgtc cacatagata attatggggt cccagtgaac
    1951  gagcccacat ggttttccct gttcggctat caccttcgag aacaatactg
    2001  atcggtctcc atggccgcgc agcgggactg catatatttt ctgagaccca
    2051  tacttctatg tcttcgggga cttgtgtaaa tgatgatgat aagaacggac
    2101  taacataagt ttgggccgga gcctggaaga ttctatccgc gttagcagat
    2151  atgttatgga actgtaaaaa aaaagacttt ggatcttttt ctttaataat
    2201  ctgaagagct tctgttttag aagaagcatt caacgcgtcg gcatatacct
    2251  gagctaaatg ctggccctcc ccccttgcac ttctggcatc gacctggaaa
    2301  attccatcgt caagaaattc ccctcccttt tcaatataag ctttgacatc
    2351  ggacgatgat ttagctccct gaatgttcgg atggaaatgt gtggatctgg
    2401  atgtggaaat gagatcgaag aatcgtgggt tggtacattg gaacttccct
    2451  tcgaactgga tgagaacatg gagatgaggc accccatcct gatgtagttc
    2501  tcggcaaacc ctgatgaatt tgatattcgt cgggtaagaa agggctttta
    2551  attgggaaag tgcctcttcc tttgttaatg agcatcgggg ataggtaatg
    2601  aaataatttc tggcatttat ttgaaaacga ccggctctcg gcatatttgc
    2651  tgtcgttttg tatcggtgga cactcaaaac tccaggggaa cggtggaatg
    2701  gtggacatta tataggatgt cccccaatgg cattcgtgta aataggtaga
    2751  cttccatttc aaatttgaat gacgaatatt ggcggccatc cgattaatat
    2801  t