Sequence of DPV Tomato yellow leaf curl Thailand virus

Tomato yellow leaf curl Thailand virus isolate PY2-7 segment DNA-A, complete sequence.

ACC No: GU723746

Dated: 2010-04-01 | Length: 2744 | CRC: 43420770

                
ID   GU723746; SV 1; circular; genomic DNA; STD; VRL; 2744 BP.
XX
AC   GU723746;
XX
DT   01-APR-2010 (Rel. 104, Created)
DT   01-APR-2010 (Rel. 104, Last updated, Version 1)
XX
DE   Tomato yellow leaf curl Thailand virus isolate PY2-7 segment DNA-A,
DE   complete sequence.
XX
KW   .
XX
OS   Tomato yellow leaf curl Thailand virus
OC   Viruses; ssDNA viruses; Geminiviridae; Begomovirus.
XX
RN   [1]
RP   1-2744
RA   Tsai W.S., Shih S.L., Green S.K., Kenyon L., Jan F.-J.;
RT   "Molecular diversity and pathogenicity of tomato-infecting begomoviruses in
RT   Taiwan";
RL   Unpublished.
XX
RN   [2]
RP   1-2744
RA   Tsai W.S., Shih S.L., Green S.K., Kenyon L., Jan F.-J.;
RT   ;
RL   Submitted (08-FEB-2010) to the EMBL/GenBank/DDBJ databases.
RL   Virology Unit, AVRDC-The World Vegetable Center, PO Box 42, Shanhua,
RL   Tainan, Taiwan 74199, ROC
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .2744
FT                   /organism="Tomato yellow leaf curl Thailand virus"
FT                   /segment="DNA-A"
FT                   /host="tomato"
FT                   /isolate="PY2-7"
FT                   /mol_type="genomic DNA"
FT                   /country="Taiwan:Yanpu, Pingtung"
FT                   /collection_date="2007"
FT                   /db_xref="taxon:85752"
FT   CDS             87. .473
FT                   /codon_start=1
FT                   /product="V2 protein"
FT                   /protein_id="ADE09267.1"
FT                   /translation="MVPYLRLSHQVSAKFKMWDPLLNEFPENVHGFRCMLAVKYLQAVE
FT                   KTYSPDTLGFDLIRDLIGVIRAKNYVEASSRYSHFHARLESTSPSELRQPIQQPCCCPH
FT                   CPRHKRADMEEPTCIQKAQVLQNV"
FT   CDS             295. .1065
FT                   /codon_start=1
FT                   /product="V1 protein"
FT                   /note="coat protein"
FT                   /protein_id="ADE09264.1"
FT                   /translation="MSKRPADILISTPVSKVRRRLNFDSPYNSRAAVPTVRVTKGQIWK
FT                   NRPAYRKPRFYRMYRSPDVPKGCEGPCKVQSFDAKNDIGHMGKVICLSDVTRGMGLTHR
FT                   VGKRFCVKSLYFVGKIWMDENIKVKNHTNTVLFWIVRDRRPTGTPNDFQQVFNVYDNEP
FT                   STATVKNDQRDRFQVIRRFQATVTGGQYAAKEQAIIRKFYRVNNYVVYNHQEAGKYENH
FT                   TENALLLYMACTHASNPVYATLKVRSYFYDSVTN"
FT   CDS             complement(1062. .1466)
FT                   /codon_start=1
FT                   /product="C3 protein"
FT                   /protein_id="ADE09266.1"
FT                   /translation="MDLRTGELLTATQLESGVYIWTVKNPLYFKITKHLESPFQRNHDI
FT                   ITLQIQFNHNLRKALGIHKCFLVCKIWTHLHPQTSRFLTVFKYQCIKYLDRLGVISINN
FT                   VIRAMSHVLYNVLEGTIDVIEEHDIKFNIY"
FT   CDS             complement(1207. .1611)
FT                   /codon_start=1
FT                   /product="C2 protein"
FT                   /protein_id="ADE09265.1"
FT                   /translation="MRSSSPSKAHSTQVPIKVQHRIAKRATRRRRVDLPCGCSYFVAIG
FT                   CHNNGFTHRGTTHCNSIREWRVYLDGQKSPIFQDNQAPREPIPEEPRHNHVTNPVQPQP
FT                   EESVGDTQMFSSLQNLDSFTSSDLAFLNSI"
FT   CDS             complement(1514. .2599)
FT                   /codon_start=1
FT                   /product="C1 protein"
FT                   /note="replication-associated protein"
FT                   /protein_id="ADE09263.1"
FT                   /translation="MAPPNKFRINAKNYFLTYPHCSLTKEEALSQIQALETPTNKLFIR
FT                   ICRELHEDGTPHLHVLIQFEGKFQCKNQRFFDLTSPTRSAHFHPNIQGAKSSTDVKTYM
FT                   EKDGDVLDHGIFQIDGRSARGGCQSANDAYAEAINSGSKASALTILREKAPKDFVLQFH
FT                   NLNSNLDRIFTPPMEEYIFPFSSSSFNQVPEELKEWACTNVLSAAARPLRPIGIVIEGD
FT                   SRTGKTMWARSLGPHNYLCGHLDLSPKVYNNNVWYNVIDDVDPHYLKHFKEFMGAQRDW
FT                   QSNTKYGKPVQIKGGIPTIFLCNPGPNSSYKEYLEEEKNSALRNWAIRNAIFVTLKSPL
FT                   YSGSNQGATPNSQEGNQTTES"
FT   CDS             complement(2149. .2448)
FT                   /codon_start=1
FT                   /product="C4 protein"
FT                   /protein_id="ADE09268.1"
FT                   /translation="MKMGLLTCMSSSNSKENFNVKIKDSSISHPQPGQHISIRTFRELK
FT                   AQQMLKHTWKKTETCLIMEFSRSMEDRLEEVANLPTTHMPRQSIQGPKLRPSLY"
XX
SQ   Sequence 2744 BP; 734 A; 539 C; 607 G; 864 T; 0 other;

gu723746 Length: 2744  01-APR-2010  Type: N  Check: 4429  ..

       1  accggatggc cgcgaatttt tttgaagtgg tccccttgat gtgatgtttc
      51  atccaattaa aacgctcggc gaaagcttaa ttatttatgg tcccctattt
     101  aagacttagt caccaagttt cggcgaaatt caaaatgtgg gatccactcc
     151  taaacgaatt tccggaaaac gtccacggtt tccgttgtat gttagcggtt
     201  aagtatctgc aagcggtcga gaagacgtat tcacctgata ccctagggtt
     251  tgatctcatc cgtgatctca tcggtgtaat tcgtgcgaag aactatgtcg
     301  aagcgtccag cagatattct catttccacg cccgtctcga aagtacgtcg
     351  ccgtctgaac ttcgacagcc catacaacag ccgtgctgct gtccccactg
     401  tccgcgtcac aaaagggcag atatggaaga accgacctgc atacagaaag
     451  cccaggttct acagaatgta tagaagtcct gatgtcccta agggatgtga
     501  gggtccatgt aaagtgcaat ctttcgatgc gaagaacgac attggtcata
     551  tgggcaaggt aatctgtctg tctgacgtta cccgtggtat ggggcttact
     601  catcgagttg gcaagcgttt ctgtgtcaag tcactttatt ttgtcgggaa
     651  gatctggatg gatgaaaata ttaaggttaa gaatcacact aataccgttt
     701  tattttggat agttagggat cggcgtccta ctggaacgcc taatgatttt
     751  cagcaggtct ttaatgtata tgataatgaa cccagcactg ctactgtaaa
     801  gaacgaccag cgtgatcgtt tccaggttat aaggaggttt caggcaacgg
     851  tgactggtgg acaatatgca gctaaggagc aggcgattat tagaaagttt
     901  tatcgtgtta ataattatgt agtttacaat caccaggaag ctgggaagta
     951  cgagaaccat actgaaaatg ctttgttgtt gtatatggca tgtactcatg
    1001  cctctaatcc tgtgtatgct actttgaaag tcagaagtta tttctatgac
    1051  tcagtgacga attaataaat attaaatttt atatcgtgtt cttcaattac
    1101  atcaattgtt ccttctaata cattgtacag tacatgagac attgccctaa
    1151  ttacattatt tatactaatc acgcctaatc tatctaaata tttaatacat
    1201  tgatatttaa atactgttaa gaaacgcgag gtctgaggat gtaaatgagt
    1251  ccagattttg cagactagaa aacatttgtg tatccccaac gctttcctca
    1301  ggttgtggtt gaactggatt tgtaacgtga ttatgtcgtg gttcctctgg
    1351  aatgggctct ctaggtgctt ggttatcttg aaatataggg gatttttgac
    1401  cgtccagata tacacgccac tctctaattg agttgcagtg agtagttccc
    1451  cggtgcgtaa atccattatt gtgacatcct attgcgacga agtacgaaca
    1501  tccacaaggt agatcaactc tccgtcgtct ggttgccctc ttggctattc
    1551  ggtgttgcac cttgattgga acctgagtag agtgggcttt tgagggtgac
    1601  gaagatcgca tttcttatag cccagtttct aagtgcggag ttcttttctt
    1651  cttccaagta ctctttataa ctggagttgg gtccaggatt gcagagaaag
    1701  atagtgggaa ttccgccttt aatttgaact ggctttccgt actttgtgtt
    1751  tgattgccag tccctttggg cccccatgaa ttctttaaag tgttttagat
    1801  agtgcggatc gacgtcatcg atgacgttgt accacacatt attattgtac
    1851  acttttggac ttaaatctaa atggccacac agataattat gtggtcccaa
    1901  tgacctagcc cacatcgtct tccccgttct gctatcaccc tcaattacta
    1951  ttccaatggg tctcaatggc cgcgcagcgg cactgagaac attagtacaa
    2001  gcccattctt taagttcttc tggaacttga ttaaaagaag aagaagaaaa
    2051  tggaaaaata tattcctcca ttggaggagt aaaaatccta tctaaattag
    2101  aatttaaatt atgaaattgc aaaacaaaat ctttaggggc tttttccctc
    2151  agtatagtga gggccgaagc tttggaccct gaattgattg cctcggcata
    2201  tgcgtcgttg gcagattggc aacctcctct agccgatctt ccatcgatct
    2251  ggaaaattcc atgatcaagc acgtctccgt ctttttccat gtatgtttta
    2301  acatctgttg agcttttagc tccctgaatg ttcggatgga aatgtgctga
    2351  cctggttggg gatgtgagat cgaagaatct ttgattttta cattgaaatt
    2401  ttccttcgaa ttggatgagg acatgcaggt gaggagtccc atcttcatgg
    2451  agttccctgc agattctgat gaataattta ttagttggtg tttctagtgc
    2501  ttgaatttgg gaaagtgctt cctctttagt gagagaacaa tgtgggtatg
    2551  tcaggaaata gttcttggca tttattctga atttattagg aggagccatt
    2601  gactggtcaa tcggtgtctc tcaaacttgg ctatgcaatc ggtgtctggg
    2651  gtcttattta tacctggaca ccaaatggca taattgtaat ttagtgaatg
    2701  tgatttaaaa ttcaaaatcc aaaagcggcc atccgtataa tatt