Sequence of DPV East African cassava mosaic virus

East African cassava mosaic virus-Kenya isolate Comoros:Grande-Comore:GC32BY1:2010 segment DNA-A, complete sequence.

ACC No: JF909111

Dated: 2012-12-05 | Length: 2801 | CRC: 1356234720

                
ID   JF909111; SV 1; circular; genomic DNA; STD; VRL; 2801 BP.
XX
AC   JF909111;
XX
DT   21-JUN-2012 (Rel. 113, Created)
DT   05-DEC-2012 (Rel. 115, Last updated, Version 3)
XX
DE   East African cassava mosaic virus-Kenya isolate
DE   Comoros:Grande-Comore:GC32BY1:2010 segment DNA-A, complete sequence.
XX
KW   .
XX
OS   East African cassava mosaic virus-Kenya
OC   Viruses; ssDNA viruses; Geminiviridae; Begomovirus.
XX
RN   [1]
RC   Publication Status: Online-Only
RP   1-2801
RX   PUBMED; 23186303.
RA   De Bruyn A., Villemot J., Lefeuvre P., Villar E., Hoareau M.,
RA   Harimalala M., Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B.,
RA   Harkins G.W., Varsani A., Martin D.P., Lett J.M.;
RT   "East African cassava mosaic-like viruses from Africa to Indian ocean
RT   islands: molecular diversity, evolutionary history and geographical
RT   dissemination of a bipartite begomovirus";
RL   BMC Evol. Biol. 12(1):228-228(2012).
XX
RN   [2]
RP   1-2801
RA   Villemot J., Lefeuvre P., Villar E., Hoareau M., Harimalala M.,
RA   Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B., Varsani A., Martin D.P.,
RA   Lett J.-M.;
RT   ;
RL   Submitted (24-MAR-2011) to the INSDC.
RL   UMR PVBMT, CIRAD, 7, chemin de l'IRAT, Saint-Pierre, Reunion 97410, France
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .2801
FT                   /organism="East African cassava mosaic virus-Kenya"
FT                   /segment="DNA-A"
FT                   /host="Manihot esculenta (cassava)"
FT                   /isolate="Comoros:Grande-Comore:GC32BY1:2010"
FT                   /mol_type="genomic DNA"
FT                   /country="Comoros:Grande-Comore"
FT                   /lat_lon="11.48 S 43.31 E"
FT                   /collection_date="2010"
FT                   /db_xref="taxon:1229189"
FT   gene            174. .530
FT                   /gene="AV2"
FT   CDS             174. .530
FT                   /codon_start=1
FT                   /gene="AV2"
FT                   /product="movement protein"
FT                   /protein_id="AEG90000.1"
FT                   /translation="MWDPLLNDFPETVHGFRSMLAVKYLLHLEQQYDRGTVGAEYIRDL
FT                   IGVLRCKSYVEATRRYNNLNTRIQGAEEAELRQPIHEPCCCPHCPRHQKQNMGQQAHVS
FT                   EAQDVQNVSKPRCP"
FT   gene            334. .1107
FT                   /gene="AV1"
FT   CDS             334. .1107
FT                   /codon_start=1
FT                   /gene="AV1"
FT                   /product="coat protein"
FT                   /protein_id="AEG89999.1"
FT                   /translation="MSKRPGDIIISTPVSKVRRRLNFDSPYTNRVVAPTVRVTRSKIWA
FT                   NRPMYRKPKMYRMYRSPDVPKGCEGPCKVQSYEQRDDVKHTGMVRCVSDVTRGSGITHR
FT                   VGKRFCVKSIYILGKIWMDENIKKQNHTNHVMFFLVRDRRPYGPSPQDFGQVFNMFDNE
FT                   PTTATVKNDLRDRYQVLRKFYATVVGGPSGMKEQALVKRFFRINNHVVYNHQEQAKYEN
FT                   HTENALLLYMACTHASNPVYATLKIRIYFYDAVTN"
FT   gene            complement(1104. .1508)
FT                   /gene="AC3"
FT   CDS             complement(1104. .1508)
FT                   /codon_start=1
FT                   /gene="AC3"
FT                   /product="replication enhancer"
FT                   /protein_id="AEG90003.1"
FT                   /translation="MDSRTGELITAPQATNGVFTWEITNPLYFEITNHDKRPGNMHHDI
FT                   ITLQIRFNHNLRKALGIHKCFLNFRVWTTLRPQTGRFLKVFRYQVLKYLDMIGVISINT
FT                   VLQAVDHVLYDVLLNTLQVTEQHAIKFNLY"
FT   gene            complement(1249. .1656)
FT                   /gene="AC2"
FT   CDS             complement(1249. .1656)
FT                   /codon_start=1
FT                   /gene="AC2"
FT                   /product="transcription activator protein"
FT                   /protein_id="AEG90002.1"
FT                   /translation="MPPSSPSTSHCSLVPIKVQHRTAKTRAVRRRRVDLECGCSFYLHI
FT                   DCINHGFSHRGTHHCASSNEWRFYLGNNKSPLFRNHQPRQAAREHAPRHHHTPDTIQPQ
FT                   PPEGIGDSQVFSQLQGLDDLTASDWSFLKSI"
FT   gene            complement(1565. .2644)
FT                   /gene="AC1"
FT   CDS             complement(1565. .2644)
FT                   /codon_start=1
FT                   /gene="AC1"
FT                   /product="replication associated protein"
FT                   /protein_id="AEG90001.1"
FT                   /translation="MPRAGRFQINARNYFITYPRCSLTKEEALSQLKALSYPTNFIFIR
FT                   VCRELHQDGVPHLHVLIQFEGKFQCTNPRFFDLISTSRSTHFHPNIQGAKSSSDVKAYI
FT                   EKGGDFLDDGIFQVDARSARGEGQHLAQVYADALNASSKTEALQIIKEKDPKSFFLQFH
FT                   NISANADRIFQAPPQTYISPFLSSSFTQVPEDIEVRVSENICTPAARPWRPISIVLECD
FT                   SLTGKTMWARSLGPHNYLCGHLDLSPKVYSNDAWYNVIDDVYPLYLKHFKDFLGAQRDC
FT                   QIYIKYGKPIQIKGGIPSIFFCNPGSTSTYKEFLEEEKNLSLKAWALKNATFITLHEPL
FT                   FSSAHQSPTPHSEDQGRQT"
FT   gene            complement(2254. .2487)
FT                   /gene="AC4"
FT   CDS             complement(2254. .2487)
FT                   /codon_start=1
FT                   /gene="AC4"
FT                   /product="C4 protein"
FT                   /protein_id="AEG90004.1"
FT                   /translation="MGCLISMFSSSSKGSSNVPTQDSSISFPHPDPHISIRTFRALNHR
FT                   PMSKLILKREGTFLTMEFSRSMPEVQGGRASI"
XX
SQ   Sequence 2801 BP; 742 A; 553 C; 713 G; 793 T; 0 other;

jf909111 Length: 2801  05-DEC-2012  Type: N  Check: 8621  ..

       1  accggatggc cgcgcccgaa aaagcaggtg gaccccacag tatggccgcg
      51  cccgttaaag aaagtggtcc ccgcgcacgt gttttggtcg gccagtcata
     101  ttcacgcgtg aaagtctaga tatttgttgt tggtctttat agacttcgtc
     151  gcgaagtagt ggatcgcgtc aacatgtggg atccattgtt gaacgatttc
     201  cccgaaaccg ttcacggttt ccgttctatg cttgctgtta aatacctgtt
     251  acatctggaa cagcaatacg atcgcggtac tgtcggggct gagtatatac
     301  gggatctaat aggggtacta cggtgtaaga gttatgtcga agcgaccagg
     351  agatataata atctcaacac ccgtatccaa ggtgcggagg aggctgaact
     401  tcgacagccc atacacgaac cgtgttgttg cccccactgt ccgcgtcacc
     451  agaagcaaaa tatgggccaa caggcccatg tatcggaagc ccaagatgta
     501  cagaatgtat cgaagcccag atgtccctaa gggctgtgaa ggcccatgta
     551  aggttcagtc ctacgaacag agggatgatg ttaagcacac gggtatggtt
     601  cgatgtgtca gtgatgttac gcgtgggtca ggtattactc atagagtcgg
     651  gaagaggttt tgtgttaagt ccatatatat attgggcaag atctggatgg
     701  atgagaatat caagaagcaa aatcatacga accatgtgat gttcttcctt
     751  gttcgagata gaaggcctta tggtccgagt cctcaagatt ttggacaagt
     801  gttcaacatg tttgataatg aacctactac tgcaactgtg aaaaatgatc
     851  ttagggaccg gtatcaggtg ttacgtaaat tctatgcgac tgttgttggt
     901  ggaccctctg ggatgaagga acaagctctg gttaagaggt tttttaggat
     951  caataatcat gtagtgtata atcatcagga acaggccaag tatgagaatc
    1001  atactgagaa tgcgttgtta ttgtatatgg catgtacaca tgcctcgaat
    1051  cctgtgtacg ctacgctgaa aatacgcatc tatttctatg atgcagtgac
    1101  aaattaataa aggttgaatt ttattgcatg ttgctccgta acttggagcg
    1151  tgtttagtaa tacatcgtac agaacatgat caacagcctg aagtacagtg
    1201  ttaatggaaa taacgcctat catatctaaa tacttgagca cttgatatct
    1251  aaatactttt aagaaacgac cagtctgagg ccgtaaggtc gtccagaccc
    1301  tgaagttgag aaaacacttg tgaatcccca atgccttccg gaggttgtgg
    1351  ttgaatcgta tctggagtgt gatgatgtcg tggtgcatgt tccctggccg
    1401  cttgtcgtgg ttggtgattt cgaaatagag gggatttgtt atttcccagg
    1451  taaaaacgcc attcgttgct tgaggcgcag tgatgagttc ccctgtgcga
    1501  gaatccatgg ttgatgcagt cgatatggag atagaacgag cagccgcatt
    1551  cgaggtctac ccgcctacgt ctgacggccc tggtcttcgc tgtgcggtgt
    1601  tggactttga tgggcactag agaacaatgg ctcgtggagg gtgatgaagg
    1651  tggcattctt taaagcccag gctttaaggg acaggttctt ttcctcttcc
    1701  agaaactctt tatatgttga tgttgatcca ggattgcaga agaagataga
    1751  gggaatgccg cctttaattt gaattggctt cccgtacttt atatagattt
    1801  gacagtctct ttgggccccc aggaaatctt tgaaatgctt tagatagagc
    1851  gggtatacgt cgtcaatgac gttgtaccat gcgtcgtttg aatatacttt
    1901  tggagacaga tccaggtgtc cacatagata attatggggt cccagtgaac
    1951  gagcccacat ggtttttcct gttaggctat cacattcgag aacaatactg
    2001  atcggtctcc atggccgcgc agcgggagtg catatatttt ctgatacccg
    2051  tacttctatg tcttcgggga cttgtgtaaa tgatgatgat aagaacggac
    2101  taatataagt ttggggcgga gcctggaaga ttctatccgc gttagcagat
    2151  atgttatgga actgtaaaaa aaaggacttg ggatcttttt ctttaataat
    2201  ctgaagagct tctgttttag aagaagcatt caacgcgtct gcatatacct
    2251  gagctaaatg ctggccctcc ccccttgcac ttctggcatc gacctggaaa
    2301  attccatcgt caagaaagtc ccctcccttt tcaatataag ctttgacatc
    2351  ggacgatgat ttagcgccct gaatgttcgg atggaaatgt gtggatctgg
    2401  atgtggaaat gagatcgaag aatcttgggt tggtacattg gaacttccct
    2451  tcgaactgga tgagaacatg gagatgaggc accccatcct gatgtagttc
    2501  tcggcaaacc ctaatgaata tgaaattcgt cgggtaagaa agggctttta
    2551  attgggaaag tgcctcttcc tttgttaatg agcatcgggg ataggtaatg
    2601  aaataatttc tggcatttat ttgaaaacga ccggctctcg gcatatttgc
    2651  tgtcgttttg tatcggtgga cactcaaaac tccaggggaa cggtggaatg
    2701  gtggacatta tataggatgt cccccaatgg cattcgtgta aataggtaga
    2751  cttccatttc aaatttgaat gtcgaatatt ggcggccatc cgattaatat
    2801  t