Sequence of DPV Tomato yellow leaf curl virus

Tomato yellow leaf curl virus - [Tunisia], complete sequence.

ACC No: EF101929

Dated: 2008-01-08 | Length: 2781 | CRC: 1312522104

                ID   EF101929; SV 1; circular; genomic DNA; STD; VRL; 2781 BP.
XX
AC   EF101929;
XX
DT   04-DEC-2006 (Rel. 90, Created)
DT   08-JAN-2008 (Rel. 94, Last updated, Version 2)
XX
DE   Tomato yellow leaf curl virus - [Tunisia], complete sequence.
XX
KW   .
XX
OS   Tomato yellow leaf curl virus - [Tunisia]
OC   Viruses; ssDNA viruses; Geminiviridae; Begomovirus.
XX
RN   [1]
RP   1-2781
RX   AGRICOLA; IND43899955.
RX   DOI; 10.1111/j.1439-0434.2007.01224.x.
RA   Gharsallah Chouchane S., Gorsane F., Nakhla M.K., Maxwell D.P.,
RA   Marrakchi M., Fakhfakh H.;
RT   "First Report of Tomato Yellow Leaf Curl Virus-Israel Species Infecting
RT   Tomato, Pepper and Bean in Tunisia";
RL   J. Phytopathol. 155(4):236-240(2007).
XX
RN   [2]
RP   1-2781
RA   Gharsallah Chouchane S., Gorsane F., Nakhla M.K., Maxwell D.P.,
RA   Marrakchi M., Fakhfakh H.;
RT   ;
RL   Submitted (26-OCT-2006) to the EMBL/GenBank/DDBJ databases.
RL   Biology, Faculty of Sciences of Tunis, Tunis 2092, Tunisia
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .2781
FT                   /organism="Tomato yellow leaf curl virus - [Tunisia]"
FT                   /mol_type="genomic DNA"
FT                   /country="Tunisia:Sahel region"
FT                   /db_xref="taxon:413710"
FT   gene            148. .498
FT                   /gene="V1"
FT   CDS             148. .498
FT                   /codon_start=1
FT                   /gene="V1"
FT                   /product="pre-coat protein"
FT                   /db_xref="GOA:A1E2T1"
FT                   /db_xref="InterPro:IPR002511"
FT                   /db_xref="InterPro:IPR005159"
FT                   /db_xref="UniProtKB/TrEMBL:A1E2T1"
FT                   /protein_id="ABK96895.1"
FT                   /translation="MWDPLLNEFPESVHGFRCMLAIKYLQSVEETYEPNTLGHDLIRDL
FT                   ISVVRARDYVEATRRYNHFHARLEGSPKAELRQPIQQPCCCPHCPRHKQATIMDVQAHV
FT                   PKAQNIQNVSKP"
FT   gene            308. .1084
FT                   /gene="CP"
FT   CDS             308. .1084
FT                   /codon_start=1
FT                   /gene="CP"
FT                   /product="coat protein"
FT                   /db_xref="GOA:A1E2T2"
FT                   /db_xref="InterPro:IPR000143"
FT                   /db_xref="InterPro:IPR000263"
FT                   /db_xref="InterPro:IPR000650"
FT                   /db_xref="UniProtKB/TrEMBL:A1E2T2"
FT                   /protein_id="ABK96896.1"
FT                   /translation="MSKRPGDIIISTPVSKVRRRLNFDSPYSSRAAVPIVQGTNKRRSW
FT                   TYRPMYRKPRIYRMYRSPDVPRGCEGPCKVQSYEQRDDIKHTGIVRCVSDVTRGSGITH
FT                   RVGKRFCVKSIYFLGKVWMDENIKKQNHTNQVMFFLVRDRRPYGNSPMDFGQVFNMFDN
FT                   EPSTATVKNDLRDRFQVMRKFHATVIGGPSGMKEQALVKRFFKINSHVTYNHQEAAKYE
FT                   NHTENALLLYMACTHASNPVYATMKIRIYFYDSISN"
FT   gene            complement(1081. .1485)
FT                   /gene="C3"
FT   CDS             complement(1081. .1485)
FT                   /codon_start=1
FT                   /gene="C3"
FT                   /product="C3 protein"
FT                   /db_xref="InterPro:IPR000657"
FT                   /db_xref="UniProtKB/TrEMBL:A1E2T3"
FT                   /protein_id="ABK96897.1"
FT                   /translation="MDSRTGELITAPQAENGVFIWEINNPLYFKITEHSQRPFLMNHDI
FT                   ISIQIRFNHNIRKVLGIHKCFLNFRIWTTLQPQTGRFLRVFRYEVLKYLDSLGVISINN
FT                   VIRAVDHVLYDVLENTINVTEAHDIKYKCY"
FT   gene            complement(1226. .1633)
FT                   /gene="C2"
FT   CDS             complement(1226. .1633)
FT                   /codon_start=1
FT                   /gene="C2"
FT                   /product="C2 protein"
FT                   /db_xref="GOA:A1E2T4"
FT                   /db_xref="InterPro:IPR000942"
FT                   /db_xref="UniProtKB/TrEMBL:A1E2T4"
FT                   /protein_id="ABK96898.1"
FT                   /translation="MQPSSPSTSHCSQVSIKVQHKIAKKKPIRRKRVDLDCGCSYYLHL
FT                   NCNNHGFTHRGTHHCSSGREWRFYLGDKQSPLFQDNRTQPAAISNEPRHHFHSDKIQPQ
FT                   HQEGIGDSQMFSQLPNLDDITASDWSFLKSI"
FT   gene            complement(1542. .2615)
FT                   /gene="Rep"
FT   CDS             complement(1542. .2615)
FT                   /codon_start=1
FT                   /gene="Rep"
FT                   /product="replication associated protein"
FT                   /db_xref="GOA:A1E2T5"
FT                   /db_xref="InterPro:IPR001191"
FT                   /db_xref="InterPro:IPR001301"
FT                   /db_xref="UniProtKB/TrEMBL:A1E2T5"
FT                   /protein_id="ABK96899.1"
FT                   /translation="MPRLFKIYAKNYFLTYPNCYLSKEEALSQLKNLETPTNKKYIKVC
FT                   REFHENGEPHLHVLIQFEGKYQCKNQRFFDLVSPNRSAHFHPNIQAAKSSTDVKTYVEK
FT                   DGEFIDFGVFQIDGRSARGGQQSANDAYAEALNSGSKSEALNILKEKAPKDYILQFHNL
FT                   SSNLDRIFSPPLEVYVSPFLSSSFNQVPDELEEWVAENVVSSAARPWRPISIVIEGDSR
FT                   TGKTMWARSLGPHNYLCGHLDLSPKVYNNDVWYNVIDDVDPHYLKHFKEFMGAQRDWQS
FT                   NTKYGKPIQIKGGIPTIFLCNPGPTSSYREYLDEEKNISLKNWSLKNATFVTLYEPLFA
FT                   SINQGPTQDSQEETNKA"
FT   gene            complement(2171. .2464)
FT                   /gene="C4"
FT   CDS             complement(2171. .2464)
FT                   /codon_start=1
FT                   /gene="C4"
FT                   /product="C4 protein"
FT                   /db_xref="InterPro:IPR002488"
FT                   /db_xref="UniProtKB/TrEMBL:A1E2T6"
FT                   /protein_id="ABK96900.1"
FT                   /translation="MGNHISMCLSNSKANTNVKTNGSSTWYPQTGQHISIRTFRQLRAQ
FT                   QMSRPTWRKTENSLILEFSKSMADQLEEVSNLPTTHMPKHSIQAVNPRPSIY"
XX
SQ   Sequence 2781 BP; 756 A; 541 C; 606 G; 878 T; 0 other;

ef101929 Length: 2781  08-JAN-2008  Type: N  Check: 2615  ..

       1  accggatggc cgcgcctttt ccttttatgt ggtccccacg agggttacac
      51  agacgtcact gtcaaccaat caaattgcat cctcaaacgt tagataagtg
     101  ttcatttgtc tttatatact tggtccccaa gtattttgtc ttgcaatatg
     151  tgggaccctc ttctaaatga atttcctgaa tctgttcacg gatttcgttg
     201  tatgttagct attaaatatt tgcagtccgt tgaggaaact tacgagccca
     251  atacattggg ccacgattta attagggatc ttatatctgt tgtaagggcc
     301  cgtgactatg tcgaagcgac ccggcgatat aatcatttcc acgcccgtct
     351  cgaaggttcg ccgaaggctg aacttcgaca gcccatacag cagccgtgct
     401  gctgtcccca ttgtccaagg cacaaacaag cgacgatcat ggacgtacag
     451  gcccatgtac cgaaagccca gaatatacag aatgtatcga agccctgatg
     501  ttccccgtgg atgtgaaggc ccatgtaaag tgcagtctta tgagcaacgg
     551  gatgatatta agcataccgg tattgttcgt tgtgtcagtg atgttactcg
     601  tggatctgga attactcaca gagtgggtaa gaggttctgt gttaaatcga
     651  tatatttttt aggtaaagtc tggatggatg aaaacatcaa gaagcagaat
     701  cacactaatc aggtcatgtt cttcttggtc cgtgatagga ggccctatgg
     751  aaacagccca atggattttg gacaggtttt taatatgttc gataatgagc
     801  ccagtaccgc aaccgtgaag aatgatttgc gtgataggtt tcaagtgatg
     851  aggaaatttc atgctacagt tattggtggg ccctctggaa tgaaggaaca
     901  ggcattagtt aagagatttt ttaaaattaa cagtcatgta acttataatc
     951  atcaggaggc agccaagtat gagaaccata ctgaaaacgc cttgttattg
    1001  tatatggcat gtacgcatgc ctctaatcct gtgtatgcaa ctatgaaaat
    1051  acgcatctat ttctatgatt caatatcaaa ttaataacat ttatatttta
    1101  tatcatgagc ttctgttaca tttattgtgt tttcaagtac atcatacaat
    1151  acatgatcaa ctgctctgat tacattgtta atggaaatta caccaagact
    1201  atctaaatac ttaagaactt catatctaaa tactcttaag aaacgaccag
    1251  tctgaggctg taatgtcgtc caaattcgga agttgagaaa acatttgtga
    1301  atccccaata ccttcctgat gttgtggttg aatcttatct gaatggaaat
    1351  gatgtcgtgg ttcattagaa atggccgctg gctgtgttct gttatcttga
    1401  aatagagggg attgtttatc tcccagataa aaacgccatt ctctgcctga
    1451  ggagcagtga tgagttcccc tgtgcgtgaa tccatgattg ttgcagttga
    1501  ggtggaggta gtatgagcag ccacagtcta ggtctacacg cttacgcctt
    1551  attggtttct tcttggctat cttgtgttgg accttgattg atacttgcga
    1601  acagtggctc gtagagggtg acgaaggttg cattcttgag agaccaattt
    1651  ttcaaggata tgtttttttc ttcatctaga tattccctat atgaggaggt
    1701  aggtcctgga ttgcagagga agatagtggg aattccccct ttaatttgaa
    1751  tgggcttccc gtactttgtg ttgctttgcc agtccctctg ggcccccatg
    1801  aattccttga agtgctttaa ataatgcggg tctacgtcat caatgacgtt
    1851  gtaccacaca tcgttattgt acacctttgg gcttaggtct agatgtccac
    1901  ataaataatt atgtgggcct agagacctgg cccacattgt cttgcccgtt
    1951  ctgctatcac cctcaatgac aatacttatg ggtctccatg gccgcgcagc
    2001  ggaagatacg acgttctcgg cgacccactc ttcaagttca tctggaactt
    2051  gattaaaaga agaagaaaga aatggagaaa cataaacttc taaaggagga
    2101  ctaaaaatcc tatctaaatt tgaacttaaa ttatgaaatt gtaaaatata
    2151  gtcctttggg gccttctctt ttaatatatt gagggcctcg gatttactgc
    2201  ctgaattgag tgcttcggca tatgcgtcgt tggcagattg ctgacctcct
    2251  ctagctgatc tgccatcgat ttggaaaact ccaaaatcaa tgaattctcc
    2301  gtctttctcc acgtaggtct tgacatctgt tgagctctta gctgcctgaa
    2351  tgttcggatg gaaatgtgct gacctgtttg gggataccag gtcgaagaac
    2401  cgttggtttt tacattggta tttgccttcg aattggataa gcacatggag
    2451  atgtggttcc ccattctcgt ggaattctct gcaaactttg atgtattttt
    2501  tatttgttgg ggtttctagg ttttttaatt gggaaagtgc ttcctcttta
    2551  gagagataac aattgggata tgtcaggaaa taatttttgg catatatttt
    2601  aaataaacga ggcatgttga aatgaatcgg tgtccctcaa agctctatgg
    2651  caatcggtgt attggtgtct tacttatacc tggacaccta atggctatat
    2701  ggtaatttca taaatgttta ttgcaattca aaattcaaac ttcaaaaatc
    2751  aaatcattaa agcggccatc cgtataatat t