Sequence of DPV East African cassava mosaic virus

East African cassava mosaic virus-Kenya isolate Comoros:Moheli:MO29AN3:2009 segment DNA-A, complete sequence.

ACC No: JF909149

Dated: 2012-12-05 | Length: 2801 | CRC: -764988289

                
ID   JF909149; SV 1; circular; genomic DNA; STD; VRL; 2801 BP.
XX
AC   JF909149;
XX
DT   21-JUN-2012 (Rel. 113, Created)
DT   05-DEC-2012 (Rel. 115, Last updated, Version 3)
XX
DE   East African cassava mosaic virus-Kenya isolate Comoros:Moheli:MO29AN3:2009
DE   segment DNA-A, complete sequence.
XX
KW   .
XX
OS   East African cassava mosaic virus-Kenya
OC   Viruses; ssDNA viruses; Geminiviridae; Begomovirus.
XX
RN   [1]
RC   Publication Status: Online-Only
RP   1-2801
RX   PUBMED; 23186303.
RA   De Bruyn A., Villemot J., Lefeuvre P., Villar E., Hoareau M.,
RA   Harimalala M., Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B.,
RA   Harkins G.W., Varsani A., Martin D.P., Lett J.M.;
RT   "East African cassava mosaic-like viruses from Africa to Indian ocean
RT   islands: molecular diversity, evolutionary history and geographical
RT   dissemination of a bipartite begomovirus";
RL   BMC Evol. Biol. 12(1):228-228(2012).
XX
RN   [2]
RP   1-2801
RA   Villemot J., Lefeuvre P., Villar E., Hoareau M., Harimalala M.,
RA   Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B., Varsani A., Martin D.P.,
RA   Lett J.-M.;
RT   ;
RL   Submitted (24-MAR-2011) to the INSDC.
RL   UMR PVBMT, CIRAD, 7, chemin de l'IRAT, Saint-Pierre, Reunion 97410, France
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .2801
FT                   /organism="East African cassava mosaic virus-Kenya"
FT                   /segment="DNA-A"
FT                   /host="Manihot esculenta (cassava)"
FT                   /isolate="Comoros:Moheli:MO29AN3:2009"
FT                   /mol_type="genomic DNA"
FT                   /country="Comoros:Moheli"
FT                   /lat_lon="12.36 S 43.68 E"
FT                   /collection_date="2009"
FT                   /db_xref="taxon:1229189"
FT   gene            174. .530
FT                   /gene="AV2"
FT   CDS             174. .530
FT                   /codon_start=1
FT                   /gene="AV2"
FT                   /product="movement protein"
FT                   /protein_id="AEG90228.1"
FT                   /translation="MWDPLLNDFPETVHGFRSMLAVKYLLHLEQQYDRGTVGAEYIRDL
FT                   IGVLRCKSYVEATRRYNNLNTRIQGAEEAELRQPIHEPCCCPHCPRHQKQNMGQQAHVS
FT                   EAQDVQNVSKPRCP"
FT   gene            334. .1107
FT                   /gene="AV1"
FT   CDS             334. .1107
FT                   /codon_start=1
FT                   /gene="AV1"
FT                   /product="coat protein"
FT                   /protein_id="AEG90227.1"
FT                   /translation="MSKRPGDIIISTPVSKVRRRLNFDSPYTNRVVAPTVRVTRSKIWA
FT                   NRPMYRKPKMYRMYRSPDVPKGCEGPCKVQSYEQKDDVKHTGMVRCVSDVTRGSGITHR
FT                   VGKRFCVKSIYILGKIWMDENIKKQNHTNHVMFFLVRDRRPYGPSPQDFGQVFNMFDNE
FT                   PTTATVKNDLRDRYQVLRKFYATVVGGPSGMKEQALVKRFFRINNHVVYNHQEQAKYEN
FT                   HTENALLLYMACTHASNPVYATLKIRIYFYDAVTN"
FT   gene            complement(1104. .1508)
FT                   /gene="AC3"
FT   CDS             complement(1104. .1508)
FT                   /codon_start=1
FT                   /gene="AC3"
FT                   /product="replication enhancer"
FT                   /protein_id="AEG90231.1"
FT                   /translation="MDSRTGELITAPQATNGVFTWEITNPLYFEITNHDKRPGNMNHDI
FT                   ITLQIRFNHNLRKALGIHKCFLNFKIWTTLRPQTGRFLKVFKYQVLKYLDMIGVISINT
FT                   VLQAVDHVMYDVLLNTLQVTEQHAIKFNLY"
FT   gene            complement(1249. .1656)
FT                   /gene="AC2"
FT   CDS             complement(1249. .1656)
FT                   /codon_start=1
FT                   /gene="AC2"
FT                   /product="transcription activator protein"
FT                   /protein_id="AEG90230.1"
FT                   /translation="MPPSSPSTSHCSLVPIKVQHRTAKTRAVRRRRVDLECGCSFYLHI
FT                   DCINHGFSHRGTHHCASSNEWRFYLGNNKSPLFRNHQPRQAAREHEPRHHHTPDTVQPQ
FT                   PPEGIGDSQVFSQLQDLDDLTASDWSFLKSI"
FT   gene            complement(1565. .2644)
FT                   /gene="AC1"
FT   CDS             complement(1565. .2644)
FT                   /codon_start=1
FT                   /gene="AC1"
FT                   /product="replication associated protein"
FT                   /protein_id="AEG90229.1"
FT                   /translation="MPRAGRFQINARNYFITYPRCSLTKEEALSQLKALSYPTNIKFIR
FT                   VCRELHQDGVPHLHVLIQFEGKFQCTNPRFFDLISTSRSTHFHPNIQGAKSSSDVKAYI
FT                   EKGGEFLDDGIFQVDARSARGEGQHLAQVYADALNASSKTEALQIIKEKDPKSFFLQFH
FT                   NISANADRIFQAPPQTYVSPFLSSSFTQVPEDIEVWVSENICSPAARPWRPISIVLEGD
FT                   SRTGKTMWARSLGPHNYLCGHLDLSPKVYSNDAWYNVIDDVDPHYLKHFKEFMGAQRDW
FT                   QSNTKYGKPIQIKGGIPTIFLCNPGPTSSYKEFLEEEKNQSLKAWALKNATFITLHEPL
FT                   FSSAHQSPTPHSEDQGRQT"
FT   gene            complement(2254. .2487)
FT                   /gene="AC4"
FT   CDS             complement(2254. .2487)
FT                   /codon_start=1
FT                   /gene="AC4"
FT                   /product="C4 protein"
FT                   /protein_id="AEG90232.1"
FT                   /translation="MGCLISMFSSSSKGSSNVPTQDSSISFPHPDPHISIRTFRELNHR
FT                   PMSKLILKREGNFLTMEFSRSMPEVQGGRASI"
XX
SQ   Sequence 2801 BP; 730 A; 556 C; 710 G; 805 T; 0 other;

jf909149 Length: 2801  05-DEC-2012  Type: N  Check: 4934  ..

       1  accggatggc cgcgcccgaa aaagcaggtg gaccccacag gatggccgcg
      51  cccgtgagag aaagtggtcc ccgcgcactt gttttggtcg gccagtcata
     101  ttcacgcgtg aaagtctaga tatttgttgt tggtctttat agacttcgtc
     151  gcgaagtagt ggagcgcgtc aacatgtggg atccattgtt gaacgatttc
     201  cccgaaaccg ttcacggttt ccgttctatg cttgctgtta aatacctgtt
     251  acatctggaa cagcaatacg atcgcggtac tgttggggct gagtatatac
     301  gggatctaat aggggttcta cggtgtaaga gttatgtcga agcgaccagg
     351  agatataata atctcaacac ccgtatccaa ggtgcggagg aggctgaact
     401  tcgacagccc atacacgaac cgtgttgttg cccccactgt ccgcgtcacc
     451  agaagcaaaa tatgggccaa caggcccatg tatcggaagc ccaagatgta
     501  cagaatgtat cgaagcccag atgtccctaa gggctgtgaa ggcccatgta
     551  aggttcagtc ctatgaacag aaggatgatg ttaagcacac tggtatggtt
     601  cgatgtgtca gtgatgttac tcgtgggtca ggtattactc atagagtcgg
     651  gaagaggttt tgtgttaagt ccatatatat actgggcaag atctggatgg
     701  atgagaatat caagaagcaa aatcatacga accatgttat gttcttcctt
     751  gtgcgagata gaaggcctta tggtccgagt cctcaagatt ttggacaagt
     801  gttcaacatg tttgataatg aacctactac ggcaactgtg aaaaatgatc
     851  ttagggaccg gtatcaggtg ttacgtaaat tctatgcgac tgttgttggt
     901  ggaccctctg ggatgaagga acaagctctg gttaagaggt tttttaggat
     951  caataatcat gtagtgtata atcatcagga acaggccaag tatgagaatc
    1001  atacggagaa tgcgttgtta ttgtatatgg catgtacaca tgcctcgaat
    1051  cctgtgtatg ctacgctgaa aatacgcatc tatttctatg atgcagtgac
    1101  aaattaataa aggttgaatt ttattgcatg ttgctccgta acttggagcg
    1151  tgttgagtaa tacatcgtac ataacatgat caacagcctg aagtacagtg
    1201  ttaatggaaa taacgcctat catatctaaa tacttgagca cttgatattt
    1251  aaatactttt aagaaacgac cagtctgagg ccgtaaggtc gtccagatct
    1301  tgaagttgag aaaacacttg tgaatcccca atgccttccg gaggttgtgg
    1351  ttgaaccgta tctggagtgt gatgatgtcg tggttcatgt tccctggccg
    1401  cttgtcgtgg ttggtgattt cgaaatagag gggatttgtt atttcccagg
    1451  taaaaacgcc attcgttgct tgaggcgcag tgatgagttc ccctgtgcga
    1501  gaatccatgg ttgatgcagt cgatatggag atagaacgag cagccgcatt
    1551  cgaggtctac ccgcctacgt ctgacggccc tggtcttcgc tgtgcggtgt
    1601  tggactttga tgggcactag agaacaatgg ctcgtggagg gtgatgaagg
    1651  tggcattctt taaagcccag gctttaaggg attggttctt ttcctcttcc
    1701  agaaactctt tatatgatga tgttggtcca ggattacaga ggaagatagt
    1751  gggaatgccg cctttaattt gaattggctt cccgtacttt gtattgcttt
    1801  gccagtctct ttgggccccc atgaattctt tgaaatgctt tagatagtgc
    1851  gggtctacgt cgtcaatgac gttgtaccat gcgtcgtttg aatatacctt
    1901  tggagacaga tccaggtgtc cacatagata attatggggt cccagtgaac
    1951  gagcccacat ggttttccct gttcggctat caccttcgag aacaatactg
    2001  atcggtctcc atggccgcgc agcgggactg catatatttt ctgataccca
    2051  tacttctatg tcttcgggga cttgtgtaaa tgatgatgat aagaacggac
    2101  taacataagt ttggggcgga gcctggaaga ttctatccgc gttagcagat
    2151  atgttatgga actgtaaaaa aaaggacttt ggatcttttt ctttaataat
    2201  ctgaagagct tctgttttag aagaagcatt caacgcgtct gcatatactt
    2251  gagctaaatg ctggccctcc ccccttgcac ttctggcatc gacctggaaa
    2301  attccatcgt caagaaattc ccctcccttt tcaatataag ctttgacatc
    2351  ggacgatgat ttagctccct gaatgttcgg atggaaatgt gtggatctgg
    2401  atgtggaaat gagatcgaag aatcttgggt tggtacattg gaacttccct
    2451  tcgaactgga tgagaacatg gagatgaggc accccatcct gatgtagttc
    2501  tcggcaaacc ctaatgaatt tgatattcgt cgggtaagaa agggctttta
    2551  attgggaaag tgcctcttcc tttgttaatg agcatcgggg ataggtaatg
    2601  aaataatttc tggcatttat ttgaaaacga ccggctctcg gcatatttgc
    2651  tgtcgttttg aatcggtgga cactcaaaac tccaggggaa cggtggaatg
    2701  gtggacatta tataggatgt cccccaatgg cattcgtgta aataggtaga
    2751  cttccatttc aaatttgaat gtcgaatatt ggcggccatc cgattaatat
    2801  t