Sequence of DPV East African cassava mosaic Zanzibar virus

East African cassava mosaic Zanzibar virus - Kenya [Kilifi] AV2 gene, AV1 gene, AC3 gene, AC2 gene, AC1 gene and AC4 gene

ACC No: AJ516003

Dated: 2005-09-12 | Length: 2784 | CRC: -1533928863

                !!NA_SEQUENCE 1.0
ID   EAF516003  standard; circular genomic DNA; VRL; 2784 BP.
XX
AC   AJ516003;
XX
SV   AJ516003.1
XX
DT   15-NOV-2002 (Rel. 73, Created)
DT   12-SEP-2005 (Rel. 85, Last updated, Version 4)
XX
DE   East African cassava mosaic Zanzibar virus - Kenya [Kilifi] AV2 gene, AV1
DE   gene, AC3 gene, AC2 gene, AC1 gene and AC4 gene
XX
KW   AC1 gene; AC2 gene; AC3 gene; AC4 gene; AV1 gene; AV2 gene; coat protein;
KW   pre coat protein; replication enhancer protein;
KW   replicaton initiation protein; transcriptional activator protein;
KW   transcriptional regulator protein.
XX
OS   East African cassava mosaic Zanzibar virus - Kenya [Kilifi]
OC   Viruses; ssDNA viruses; Geminiviridae; Begomovirus.
XX
RN   [1]
RP   1-2784
RA   Winter S.;
RT   ;
RL   Submitted (13-NOV-2002) to the EMBL/GenBank/DDBJ databases.
RL   Winter S., Plant Virus Division, DSMZ, Messeweg 11/12, Braunschweig 38104,
RL   GERMANY.
XX
RN   [2]
RA   Were H.K., Winter S., Maiss E.;
RT   "Characterisation of a strain of EACMZV from the coastal region of
RT   KenyaOccurrence and distribution of cassava begomoviruses in Kenya";
RL   Ann. Appl. Biol. 145(10):175-184(2004).
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .2784
FT                   /country="Kenya:Kilifi"
FT                   /db_xref="taxon:268969"
FT                   /mol_type="genomic DNA"
FT                   /note="collected in May 1999"
FT                   /virion
FT                   /organism="East African cassava mosaic Zanzibar virus -
FT                   Kenya [Kilifi]"
FT                   /isolation_source="cassava plant"
FT   CDS             173. .529
FT                   /db_xref="InterPro:IPR002511"
FT                   /db_xref="InterPro:IPR005159"
FT                   /db_xref="UniProtKB/TrEMBL:Q8B8N2"
FT                   /gene="AV2"
FT                   /product="pre coat protein"
FT                   /protein_id="CAD56699.1"
FT                   /translation="MWDPLLNDFPETVHGFRSMLAVKYLLHLEQEYDRGTVGAEYIRDL
FT                   IGVLRCKSYVEATRRYNNLSTRIQGAEEAELRQPIHEPCGCPHCPRHQKQTMGQQAHVP
FT                   EAQDVQNVSKPRCP"
FT   CDS             333. .1106
FT                   /db_xref="GOA:Q8B8N1"
FT                   /db_xref="InterPro:IPR000263"
FT                   /db_xref="InterPro:IPR000650"
FT                   /db_xref="UniProtKB/TrEMBL:Q8B8N1"
FT                   /gene="AV1"
FT                   /product="coat protein"
FT                   /protein_id="CAD56700.1"
FT                   /translation="MSKRPGDIIISAPVSKVRRRLNFDSPYTNRVVAPTVRVTRSKLWA
FT                   NRPMYRKPKMYRMYRSPDVPKGCEGPCKVQSYEQRDDVKHTGMVRCVSDVTRGSGITHR
FT                   VGKRFCVKSIYILGKIWMDENIKKQNHTNHVMFFLVRDRRPYGPSPQDFGQVFNMFDNE
FT                   PTTATVKNDLRDRYQVLRKFYATVVGGPSGMKEQALVKRFFRINNHVVYNHQEQAKYEN
FT                   HTENALLLYMACTHASNPVYATLKIRIYFYDAVTN"
FT   CDS             complement(1103. .1507)
FT                   /db_xref="InterPro:IPR000657"
FT                   /db_xref="UniProtKB/TrEMBL:Q8B8N0"
FT                   /gene="AC3"
FT                   /product="replication enhancer protein"
FT                   /protein_id="CAD56701.1"
FT                   /translation="MDSRTGELITAPQATNGVFTWEITNPLYFAITNHDKRPGNMNHDI
FT                   ITIQIRFNHNIRKALGIHKCFLNFKVWTTLRPQTGLFLRVFRSQVLKYLDMIGVISINT
FT                   VLQSVDHVLYDVLLNTLQVTEQHAITFNLY"
FT   CDS             complement(1248. .1655)
FT                   /db_xref="GOA:Q8B8M9"
FT                   /db_xref="InterPro:IPR000942"
FT                   /db_xref="UniProtKB/TrEMBL:Q8B8M9"
FT                   /gene="AC2"
FT                   /product="transcriptional activator protein"
FT                   /protein_id="CAD56702.1"
FT                   /translation="MPPSSPSTSHCSQVPIKVQHRTAKTRAIRRRRVDLECGCSFYIHI
FT                   NCINHGFSHRGTHHCASSNEWRFYLGNNKSPIFRNHQPRQEAREHEPRHHHNPDTVQPQ
FT                   HPEGIGDSQVFSQLQGLDDLTASDWSFLKSI"
FT   CDS             complement(1564. .2643)
FT                   /db_xref="GOA:Q8B8M8"
FT                   /db_xref="HSSP:1L5I"
FT                   /db_xref="InterPro:IPR001191"
FT                   /db_xref="InterPro:IPR001301"
FT                   /db_xref="UniProtKB/TrEMBL:Q8B8M8"
FT                   /gene="AC1"
FT                   /product="replicaton initiation protein"
FT                   /protein_id="CAD56703.1"
FT                   /translation="MTPPKRFKIQAKNYFLTYPKCSLSKHDALSQILNIPTPTNKKYIK
FT                   VCRELHDDGQPHLHMLIQFEGKFSCTNKRFFDLVSPTRSTHFHPNIQGAKSSSDVKSYI
FT                   DKDGDTTEWGEFQIDARSARGGCHNANDACAEALNSGSKAAALLIIKEKLPKEFIFQYH
FT                   NLSSNLDKIFQEPPAPYVSPFLSYSFTNVPEELEVWVSENVMGSAARPWRPNSIVIEGD
FT                   SRTGKTMWARSLGPHNYLCGHLDLSPKVYSNDAWYNVIDDVDPHYLKHFKEFMGAQRDW
FT                   QSNTKYGKPIQIKGGIPTIFLCNPGPTSSYKEFLDEEKNQSLKAWALKNATFVTINEPL
FT                   FSSAHQSPTPHSEDQGHQT"
FT   CDS             complement(2229. .2486)
FT                   /db_xref="InterPro:IPR002488"
FT                   /db_xref="UniProtKB/TrEMBL:Q776Y2"
FT                   /gene="AC4"
FT                   /product="transcriptional regulator protein"
FT                   /protein_id="CAD56704.1"
FT                   /translation="MGNLISTCLFSSKANSHAQISDSSTWYPQHDQHISIRTFKELNPA
FT                   PTSSPTSTKMGIPLSGANSRSTPDRLEAAATMLMTHVPKH"
XX
SQ   Sequence 2784 BP; 728 A; 546 C; 714 G; 796 T; 0 other;

AJ516003  Length: 2784  September 20, 2005 08:47  Type: N  Check: 8962  ..

       1  accggatggc cgcgcccgaa aaagaaggtg gaccccacgg atggccgcgc
      51  ccatgaaaga aagtggtccc cgcgcacttg tttcggtcag ccagtcatat
     101  tcacgcgtgg aagtctagat atttgttgtt tggctttata gacttcgtcg
     151  cgaagtagtg gagcgcgtca acatgtggga tccattgtta aacgatttcc
     201  ctgaaaccgt tcacggtttc cgttccatgc ttgctgttaa atacctgtta
     251  catcttgaac aggaatacga tcgcggtact gtcggggctg agtatatacg
     301  ggatctaata ggggtgctac ggtgtaagag ttatgtcgaa gcgaccagga
     351  gatataataa tctcagcacc cgtatccaag gtgcggagga ggctgaactt
     401  cgacagccca tacacgaacc gtgtggttgc ccccactgtc cgcgtcacca
     451  gaagcaaact atgggccaac aggcccatgt accggaagcc caagatgtac
     501  agaatgtatc gaagcccaga tgtccctaag ggctgtgaag gcccatgtaa
     551  ggttcagtcg tatgaacaga gggatgatgt taagcacact ggtatggtcc
     601  gatgtgtcag tgatgttact cgtgggtcag gcattaccca tagagtcggg
     651  aagaggttct gtgtgaagtc catatatata ttgggcaaga tttggatgga
     701  tgagaatatc aagaagcaga accatacgaa ccatgttatg tttttcctcg
     751  tgcgagatag aaggccgtat ggtccgagtc cgcaagattt tggacaagtg
     801  ttcaacatgt tcgataatga acctactact gcaactgtga agaatgatct
     851  tagggaccgg tatcaggtgt tacgtaaatt ctatgcgact gttgtgggtg
     901  gaccctctgg gatgaaggaa caagctctgg ttaagaggtt ttttaggatc
     951  aataatcatg tagtgtataa tcatcaggaa caggccaagt atgagaatca
    1001  tactgagaat gcgttgttat tgtatatggc atgtacacat gcctcaaatc
    1051  ctgtgtatgc tactctgaaa atacgcatct atttctatga tgcagtgaca
    1101  aattaataaa ggttgaatgt tattgcatgt tgctccgtaa cttggagcgt
    1151  gtttagtaat acatcgtaca gaacatgatc aacagattga agtacagtgt
    1201  taatggaaat aacgcctatc atatctaaat acttgagcac ttgagatcta
    1251  aatactctta agaaaagacc agtctgaggc cgtaaggtcg tccagacctt
    1301  gaagttgaga aaacacttgt gaatccccaa tgccttccgg atgttgtggt
    1351  tgaaccgtat ctggattgtg atgatgtcgt ggttcatgtt ccctggcctc
    1401  ttgtcgtggt tggtgattgc gaaatatagg ggatttgtta tttcccaggt
    1451  aaaaacgcca ttcgttgctt gaggcgcagt gatgagttcc cctgtgcgag
    1501  aatccatggt tgatgcagtt aatatggata tagaacgagc agccgcattc
    1551  gaggtctacc cgcctacgtc tgatggccct ggtcttcgct gtgcggtgtt
    1601  ggactttgat gggcacttga gaacaatggc tcgttgatgg tgacgaaggt
    1651  ggcattcttt aaagcccagg ctttaaggga ctgattcttt tcctcatcca
    1701  gaaactcttt atatgatgat gttggtcctg gattgcagag gaagatagtg
    1751  ggaatgccgc ctttaatttg aattggcttc ccgtattttg tattgctttg
    1801  ccagtcccgt tgggccccca tgaattcttt gaagtgcttt agatagtggg
    1851  ggtcgacgtc atcaatgacg ttgtaccagg cgtcgttgct gtagaccttt
    1901  ggactgagat ccaggtgtcc acacaaataa ttatgtggtc ccaatgacct
    1951  ggcccacatt gtcttccctg tacgactatc accctcaata acaatactgt
    2001  tgggtctcca aggccgcgca gcggaaccca tcacgttctc ggaaacccag
    2051  acttcaagtt cctcaggaac gttagtaaaa gaataagaca gaaaaggaga
    2101  aacataagga gctggtggct cttgaaaaat cttatctaaa ttactactta
    2151  agttatgata ttgaaaaata aattcttttg ggagtttctc cttaataatt
    2201  agaagtgctg ctgccttgga acctgagttt aatgcttcgg cacatgcgtc
    2251  attagcattg tggcagccgc ctctagccga tctggcgtcg atctggaatt
    2301  cgccccactc agtggtatcc ccatctttgt cgatgtagga cttgacgtcg
    2351  gagctggatt tagctccttg aatgttcgga tggaaatgtg ttgatcgtgt
    2401  tggggatacc aggtcgaaga atcgcttatt tgtgcatgag aatttgcctt
    2451  cgaactgaat aagcatgtgg agatgaggtt gcccatcgtc gtgaagttct
    2501  ctgcacactt tgatgtattt cttgtttgtg ggagttggga tgtttaatat
    2551  ttgggataat gcgtcgtgtt tagatagaga acatttggga tatgtgagaa
    2601  aatagttttt ggcctgtatt ttaaaacgct tggggggagt catttatgcg
    2651  agagcaattg gagacacctc ggtggatgtc tctacggaat tggagacaat
    2701  atatagtgtc tctaaatggc ataatggtaa ttaggaagat ctattttcaa
    2751  aatttgaacc aaaagcggcc atccgtataa tatt