Sequence of DPV East African cassava mosaic virus

East African cassava mosaic virus-Kenya isolate Comoros:Mayotte:YT23B10:2003 segment DNA-A, complete sequence.

ACC No: JF909173

Dated: 2012-12-05 | Length: 2801 | CRC: 392904848

                
ID   JF909173; SV 1; circular; genomic DNA; STD; VRL; 2801 BP.
XX
AC   JF909173;
XX
DT   21-JUN-2012 (Rel. 113, Created)
DT   05-DEC-2012 (Rel. 115, Last updated, Version 3)
XX
DE   East African cassava mosaic virus-Kenya isolate
DE   Comoros:Mayotte:YT23B10:2003 segment DNA-A, complete sequence.
XX
KW   .
XX
OS   East African cassava mosaic virus-Kenya
OC   Viruses; ssDNA viruses; Geminiviridae; Begomovirus.
XX
RN   [1]
RC   Publication Status: Online-Only
RP   1-2801
RX   PUBMED; 23186303.
RA   De Bruyn A., Villemot J., Lefeuvre P., Villar E., Hoareau M.,
RA   Harimalala M., Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B.,
RA   Harkins G.W., Varsani A., Martin D.P., Lett J.M.;
RT   "East African cassava mosaic-like viruses from Africa to Indian ocean
RT   islands: molecular diversity, evolutionary history and geographical
RT   dissemination of a bipartite begomovirus";
RL   BMC Evol. Biol. 12(1):228-228(2012).
XX
RN   [2]
RP   1-2801
RA   Villemot J., Lefeuvre P., Villar E., Hoareau M., Harimalala M.,
RA   Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B., Varsani A., Martin D.P.,
RA   Lett J.-M.;
RT   ;
RL   Submitted (24-MAR-2011) to the INSDC.
RL   UMR PVBMT, CIRAD, 7, chemin de l'IRAT, Saint-Pierre, Reunion 97410, France
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .2801
FT                   /organism="East African cassava mosaic virus-Kenya"
FT                   /segment="DNA-A"
FT                   /host="Manihot esculenta (cassava)"
FT                   /isolate="Comoros:Mayotte:YT23B10:2003"
FT                   /mol_type="genomic DNA"
FT                   /country="Mayotte"
FT                   /lat_lon="12.92 S 45.16 E"
FT                   /collection_date="2003"
FT                   /db_xref="taxon:1229189"
FT   gene            174. .530
FT                   /gene="AV2"
FT   CDS             174. .530
FT                   /codon_start=1
FT                   /gene="AV2"
FT                   /product="movement protein"
FT                   /protein_id="AEG90372.1"
FT                   /translation="MWDPLLNDFPETVHGFRSMLAVKYLLHLEQEYDRGTVGAEYIRDL
FT                   IGVLRCKSYVEATRRYNNLNTRIQGAEEAELRQPIHEPCCCPHCPRHQKQNMGQQAHVS
FT                   EAQDVQNVSKPRCP"
FT   gene            334. .1107
FT                   /gene="AV1"
FT   CDS             334. .1107
FT                   /codon_start=1
FT                   /gene="AV1"
FT                   /product="coat protein"
FT                   /protein_id="AEG90371.1"
FT                   /translation="MSKRPGDIIISTPVSKVRRRLNFDSPYTNRVVAPTVRVTRSKIWA
FT                   NRPMYRKPKMYRMYRSPDVPKGCEGPCKVQSYEQRDDVKHTGMVRCVSDVTRGSGITHR
FT                   VGKRFCVKSIYILGKIWMDENIKKQNHTNHVMFFLVRDRRPYGPSPQDFGQVFNMFDNE
FT                   PTTATVKNDLRDRYQVLRKFYATVVGGPSGMKEQALVKRFFRINNHVVYNHQEQAKYEN
FT                   HTENALLLYMACTHASNPVYATLKIRIYFYDAVTN"
FT   gene            complement(1104. .1508)
FT                   /gene="AC3"
FT   CDS             complement(1104. .1508)
FT                   /codon_start=1
FT                   /gene="AC3"
FT                   /product="replication enhancer"
FT                   /protein_id="AEG90375.1"
FT                   /translation="MDFRTGELITAPQAKNGVFTWEITNPLYFDITNHDTRPGNMNHDI
FT                   ITLQIRFNHNLRKALAIHKCFLNFKVWTTLRPQTGLFLRVFRYQVLKYLDMIGVISINT
FT                   VITAVDHVLYDVLLNTLQVTEQHAIKFNLY"
FT   gene            complement(1249. .1656)
FT                   /gene="AC2"
FT   CDS             complement(1249. .1656)
FT                   /codon_start=1
FT                   /gene="AC2"
FT                   /product="transcription activator protein"
FT                   /protein_id="AEG90374.1"
FT                   /translation="MPPSSPSMSHCSQVPIKVQHRTAKTRALRRRRVDLECGCSFYLHI
FT                   DCINHGFSHRGTHHCASSKEWRFYLGNNKSPLFRHHQPRHEAREHEPRHHHTPDTFQPQ
FT                   PPEGIGDSQVFSQLQGLDDLTASDWSFLKSI"
FT   gene            complement(1565. .2644)
FT                   /gene="AC1"
FT   CDS             complement(1565. .2644)
FT                   /codon_start=1
FT                   /gene="AC1"
FT                   /product="replication associated protein"
FT                   /protein_id="AEG90373.1"
FT                   /translation="MPRAGRFQINAKNYFITYPRCSLTKEEALSQLKALSYPTNIKFIR
FT                   VCRELHQDGVPHLHVLIQFEGKFQCTNQRFFDLISPSRSTHFHPNIQGAKSSSDVKAYI
FT                   EKGGEFLDDGIFQVDARSARGEGQHLAQVYADALNASSKSEALQIIKEKDPKSFFLQFH
FT                   NISANADRIFQAPPQTYVSPFLSSSFTHVPEELEVWVSENICSPAARPWRPVSIVLEGD
FT                   SRTGKTMWARSLGPHNYLCGHLDLSPKVYSNDAWYNVIDDVDPHYLKHFKEFMGAQRDW
FT                   QSNTKYGKPIQIKGGIPTIFLCNPGPTSSYKEFLDEEKNQSLKAWALKNATFVTLHEPL
FT                   FSSAHQSPTPHSENQGPPT"
FT   gene            complement(2254. .2487)
FT                   /gene="AC4"
FT   CDS             complement(2254. .2487)
FT                   /codon_start=1
FT                   /gene="AC4"
FT                   /product="C4 protein"
FT                   /protein_id="AEG90376.1"
FT                   /translation="MGCLISMFSSNSKASSNVQTNDSSISFPHPDQHISIRTFRELNRR
FT                   PMSRLTLKREGNFLTMEFSKSMPEVQGGRASI"
XX
SQ   Sequence 2801 BP; 731 A; 557 C; 721 G; 792 T; 0 other;

jf909173 Length: 2801  05-DEC-2012  Type: N  Check: 471  ..

       1  accggatggc cgcgcccgaa aaagctggtg gaccccactg tatgaccgcg
      51  cccgttaaag aaagtggtcc ccgcgcacgt gggttggtcg gccagtcata
     101  ttcacgcgtg aaggtctaga tatttgttgt ttgtctttat agacttcgtc
     151  acgaagtagt cgagcgcgtc aacatgtggg atccattgtt gaatgatttt
     201  cccgaaaccg ttcacggttt ccgttctatg cttgctgtta aatacctgtt
     251  acatctggaa caggaatacg atcgcggtac tgtcggggct gagtatatac
     301  gggatctaat aggggttcta cggtgtaaga gttatgtcga agcgaccagg
     351  agatataata atctcaacac ccgtatccaa ggtgcggagg aggctgaact
     401  tcgacagccc atacacgaac cgtgttgttg cccccactgt ccgcgtcacc
     451  agaagcaaaa tatgggccaa caggcccatg tatcggaagc ccaagatgta
     501  cagaatgtat cgaagcccag atgtccctaa gggctgtgaa ggcccatgta
     551  aggttcagtc gtatgaacag agggatgatg ttaagcacac tggtatggtc
     601  cgatgtgtca gtgatgttac tcgtgggtca ggcatcaccc atagagtcgg
     651  gaagaggttt tgtgtgaagt ccatatatat attgggcaag atctggatgg
     701  atgagaatat caagaagcaa aatcatacga accatgttat gttcttcctc
     751  gttcgagata gaaggcctta tggtccgagc ccgcaagatt tcggacaagt
     801  gttcaacatg tttgataatg aacctactac ggcaacggtg aagaatgatc
     851  tgagggatcg gtatcaggtg ttacgaaaat tctatgcgac cgttgttggt
     901  ggaccctccg ggatgaagga acaagcgctg gttaagaggt tttttaggat
     951  caataatcat gtagtgtata atcatcagga acaggccaag tatgagaatc
    1001  atacggagaa tgcgttgtta ttgtatatgg catgtacaca tgcctcaaat
    1051  ccagtgtacg ctactctgaa aatacgcatc tatttctatg atgcagtgac
    1101  aaattaataa aggttgaatt ttattgcatg ttgctccgta acttggagtg
    1151  tgtttagtaa tacatcgtac agaacatgat caacagcggt aattacagtg
    1201  ttaatggaaa taacgcctat catatctaaa tacttgagca cttgatatct
    1251  aaatactctt aagaaaagac cagtctgagg ccgtaaggtc gtccagacct
    1301  tgaagttgag aaaacacttg tgaatcgcca atgccttccg gaggttgtgg
    1351  ttgaaacgta tctggagtgt gatgatgtcg tggttcatgt tccctggcct
    1401  cgtgtcgtgg ttggtgatgt cgaaatagag gggatttgtt atttcccagg
    1451  taaaaacgcc attctttgct tgaggcgcag tgatgagttc ccctgtgcga
    1501  aaatccatgg ttgatgcagt cgatatggag atagaacgag cagccacatt
    1551  cgaggtctac ccgcctacgt cggagggccc tggttttcgc tgtgcggtgt
    1601  tggactttga tgggcacttg agaacaatgg ctcatggagg gtgacgaagg
    1651  tggcattctt taaagcccag gctttaaggg actgattctt ttcctcgtcc
    1701  agaaactctt tatatgatga tgttggtcct ggattgcaga ggaagatagt
    1751  gggaatgccg cctttaattt gaattggctt cccgtacttt gtattgcttt
    1801  gccagtccct ttgggccccc atgaattctt tgaagtgttt gagataatgc
    1851  gggtctacgt cgtcaatgac gttgtaccat gcgtcgtttg aatatacctt
    1901  tggagacaga tccaggtgtc cacatagata attatggggt cccagtgaac
    1951  gagcccacat ggttttcccg gttcggctat caccttcgag aacaatactg
    2001  accggtctcc atggccgcgc agcgggactg catatatttt ctgataccca
    2051  tacctctagt tcttcgggaa cgtgtgtaaa tgatgatgat aagaatggac
    2101  taacgtaagt ttgtggcgga gcctggaaga ttctatctgc gttagcagat
    2151  atgttatgga actgtaaaaa aaaggacttt ggatcttttt ctttaataat
    2201  ttgaagagct tctgatttag aagaagcatt caacgcgtct gcatatacct
    2251  gagctaaatg ctggccctcc ccccttgcac ttctggcatc gacttggaaa
    2301  attccatcgt caagaaattc ccctcccttt tcaatgtaag ccttgacatc
    2351  ggacgacgat ttagctccct gaatgttcgg atggaaatgt gttgatctgg
    2401  atggggaaat gagatcgaag aatcgttggt ttgtacattg gaacttgcct
    2451  tcgaattgga tgagaacatg gagatgaggc accccatcct gatgtagttc
    2501  tctgcaaacc ctaatgaatt tgatattcgt cgggtaagaa agggctttta
    2551  attgggaaag ggcctcttcc ttggttaatg agcatcgggg ataggttatg
    2601  aaataatttt tggcatttat ttgaaaacga ccggctcttg gcatatttgc
    2651  tgtcgttttg gatcggggga cactcaaaac tccaggggaa cggtggaatg
    2701  gggggcatta tatatgatgt cccccaatgg catatgtgta aatatgtcga
    2751  cctccattca aattttgaat tgggaatatt ggcggccatc cgattaatat
    2801  t