Sequence of DPV Cotton leaf curl Gezira virus

Cotton leaf curl Gezira virus DNA-A, complete sequence, clone NT31

ACC No: FR751145

Dated: 2011-06-22 | Length: 2764 | CRC: 1247719698

                
ID   FR751145; SV 1; circular; genomic DNA; STD; VRL; 2764 BP.
XX
AC   FR751145;
XX
DT   22-JUN-2011 (Rel. 109, Created)
DT   22-JUN-2011 (Rel. 109, Last updated, Version 1)
XX
DE   Cotton leaf curl Gezira virus DNA-A, complete sequence, clone NT31
XX
KW   complete genome.
XX
OS   Cotton leaf curl Gezira virus
OC   Viruses; ssDNA viruses; Geminiviridae; Begomovirus.
XX
RN   [1]
RP   1-2764
RA   Tahir M.N.;
RT   ;
RL   Submitted (25-DEC-2010) to the INSDC.
RL   Tahir M.N., Agricultural Biotechnology Division, National Institute for
RL   Biotechnology & Genetic Engineering, NIBGE, Jhang Road, Faisalabad, Punjab,
RL   38000, PAKISTAN.
XX
RN   [2]
RX   DOI; 10.1371/journal.pone.0020366.
RX   PUBMED; 21637815.
RA   Tahir M.N., Amin I., Briddon R.W., Mansoor S.;
RT   "The merging of two dynasties-identification of an african cotton leaf curl
RT   disease-associated begomovirus with cotton in pakistan";
RL   PLoS One 6(5):e20366-e20366(2011).
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .2764
FT                   /organism="Cotton leaf curl Gezira virus"
FT                   /segment="DNA-A"
FT                   /host="Gossypium hirsutum"
FT                   /mol_type="genomic DNA"
FT                   /country="Pakistan:Sindh, Hala"
FT                   /collection_date="2005"
FT                   /clone="NT31"
FT                   /tissue_type="Leaf"
FT                   /db_xref="taxon:222459"
FT   CDS             162. .530
FT                   /gene="V2"
FT                   /product="precoat protein"
FT                   /protein_id="CBY85327.1"
FT                   /translation="MWDPLLNDFPESVHGFRCMLAVKYLQAVRESYDPSTLGYDLLSDL
FT                   IGVVRRTNYVEATSRYHHFHSRLESASPSELRQSRVILCTCPHCPRHKQTSGVVQQTQL
FT                   QEAQNLQTVSEPRCPKGV"
FT   CDS             322. .1098
FT                   /gene="V1"
FT                   /product="coat protein"
FT                   /protein_id="CBY85328.1"
FT                   /translation="MSKRPADIIISTPASKVRRRLNFDSPGLSSARAPTVLVTNKRRAW
FT                   SNRPNYRKPRIYRLYRSPDVPKGCEGPCKVQSYEQRDDVKHTGIVRCVSDVTKGTGLTH
FT                   RTGKRFTIKSIYILGKVWMDENIKKQNHTNNVMFFLVRDRRPYGNSPMDFGQVFNMFDN
FT                   EPSTATVKNDYRDRFQVMRKFSATVTGGPSGMKEQALVRRFFKINSQIVYNHQEAAKYE
FT                   NHTENALLLYMACTHASNPVYATLKIRIYFYDSVSN"
FT   CDS             complement(550. .1032)
FT                   /gene="C5"
FT                   /product="hypothetical protein"
FT                   /protein_id="CBY85329.1"
FT                   /translation="MSTCHIQQQSILSMVLILRSFLMVINNLTVNLKKPTNKSLFLHPR
FT                   RTTSYSSTKLTHHLETISIIILYSSSTWLIIKHVKNLTKIHRGIPIRSSITNKEKHDIV
FT                   GVILLLNVLIHPYLTKNINGFDCETLSSTMSKPSSLGHIRNTTNNTSMLHIITLFI"
FT   CDS             complement(1095. .1496)
FT                   /gene="C3"
FT                   /product="replication enhancer protein"
FT                   /protein_id="CBY85330.1"
FT                   /translation="MDSRTGELITAHQTENGVLIWTINNPLYFKTIKEIPLTHGNQTMV
FT                   EMQIRFNYNLRKELGIHKCFMNFRVWTISRPPTGLFLNVFRKQIMKYLYRIGVISINNV
FT                   IRAVNHVLYDVLQTTVESEFTHNIQIKLY"
FT   CDS             complement(1240. .1644)
FT                   /gene="C2"
FT                   /product="transcriptional activator protein"
FT                   /protein_id="CBY85331.1"
FT                   /translation="MRPSSPSQIRCTQVPIKVQHREAKKRAIRRRRIDIPCGCTVYVAF
FT                   TCRDNGFTHRGTHHCASDREWRTYLDNQQSPVFQNHKGDSSNARESNNGRDADKIQLQP
FT                   QEGIGDSQMFHELQGLDDLTPSDWSFLKRI"
FT   CDS             complement(1544. .2095)
FT                   /gene="C1"
FT                   /product="Rep protein"
FT                   /note="Disrupted replication associated protein"
FT                   /protein_id="CBY85332.1"
FT                   /translation="MFLLFLSSSFDQVPEELEEWAAENVVEAAARPSRPISIVIEGESR
FT                   TGKTVWARSLGPHNYLCGHLDLSPKVFSNDAWYNVIDDVDPHYLKHFKEFMGAQKDWQS
FT                   NTKYGKPVQIKGGIPTIFLCNPGPNSSYKEYLDEDKNAHLKSWALKNATFITLSNPLYS
FT                   GTNQSSASGGQEESNQETQD"
FT   CDS             complement(2043. .2633)
FT                   /gene="C1"
FT                   /product="Rep protein"
FT                   /note="Disrupted replication associated protein"
FT                   /protein_id="CBY85333.1"
FT                   /translation="MAPPHRFQIYAKNYFLTFPKCSLTKEEALEQIQQISTASNKKYIK
FT                   ICRELHEDGQPHLHVLLQFEGKFKCQNQRLFDLVSPNRSTHFHPNIQGAKSSSDVKSYI
FT                   DKDGDTLEWGEFQIDGRSARGGQQTANDAYAAALNAGSKAEALRVIRELAPKDFVLQFH
FT                   NLNSNLERIFQEPPAPYVSPFFVFFVRPSSRRT"
FT   exon            complement(2043. .2633)
FT                   /gene="C1"
FT                   /number=1
FT   mRNA            complement(2183. .2476)
FT                   /gene="C4"
FT   CDS             complement(2183. .2476)
FT                   /gene="C4"
FT                   /product="C4 protein"
FT                   /protein_id="CBY85334.1"
FT                   /translation="MANLIYMCFSSSKGSSSAKIRDSSTWSPQIGQHISIQTFRELNPA
FT                   PTSSPTSTRMETHWNGENSRSTVDLQEEDNRPPMTLTPQRLTQEVRQRLLGL"
XX
SQ   Sequence 2764 BP; 689 A; 599 C; 619 G; 857 T; 0 other;

fr751145 Length: 2764  22-JUN-2011  Type: N  Check: 4676  ..

       1  accggtgggc gcgaaaaaaa aagtggtccc cgccccacgt gaacatgtcg
      51  cgcgagtgct gtacatgtcg cgcgatgctg tccaatcaga actcgcgctc
     101  tacgcattat aatttgaaat ttgaaatata aacttgctcc ctaagtttgt
     151  taggcataac tatgtgggat ccgttattga acgacttccc tgaatccgtt
     201  cacggtttcc gttgtatgct agccgtgaaa tatttgcagg ctgttcgaga
     251  gtcgtatgat ccttccactc ttgggtacga tcttcttagc gatctaatcg
     301  gagttgttcg ccgtaccaac tatgtcgaag cgaccagcag atatcatcat
     351  ttccactccc gcctcgaaag tgcgtcgccg tctgaacttc gacagtcccg
     401  ggttatcctc tgcacgtgcc cccactgtcc tcgtcacaaa caaacgtcgg
     451  gcgtggtcca acagacccaa ttacaggaag cccagaattt acagactgta
     501  tcggagccca gatgtcccaa aggggtgtga aggtccatgt aaggtccagt
     551  catatgaaca gcgtgatgat gtgaagcata ctggtattgt tcgttgtgtt
     601  tctgatgtga ccaagggaac tgggcttact catcgtactg gaaagcgttt
     651  cacaatcaaa tccatttata ttcttggtaa ggtatggatg gatgagaaca
     701  ttaagaagca gaatcacacc aacaatgtca tgtttttcct tgttcgtgat
     751  agaagaccgt atgggaattc cccgatggat tttggtcaag tttttaacat
     801  gtttgataat gagccaagta ctgctactgt aaagaatgat tatcgagatc
     851  gtttccaggt gatgcgtaag tttagtgcta ctgtaactgg tggtccttct
     901  gggatgaagg aacaggctct tgttcgtagg ttttttaaga ttaacagtca
     951  gattgtttat aaccatcagg aagctgcgaa gtatgagaac catactgaga
    1001  atgctttgtt gttgtatatg gcatgtactc atgcttctaa tcctgtgtat
    1051  gctacgttaa agatacggat ctacttctac gattcggtat ctaattaata
    1101  taattttatt tgaatattgt gggtgaattc actttcgacg gttgtttgca
    1151  atacatcgta caaaacatga ttgactgccc taattacatt attaattgaa
    1201  attacgccta ttctatacaa atatttcata atctgcttcc taaatacgtt
    1251  taagaaaaga ccagtcggag ggcgtgagat cgtccagacc ctgaagttca
    1301  tgaaacattt gtgaatcccc aattccttcc tgaggttgta gttgaatctt
    1351  atctgcatct ctaccattgt ttgattcccg tgcgttagag gaatctcctt
    1401  tatggttttg aaatacaggg gattgttgat tgtccagata agtacgccat
    1451  tctctgtctg atgcgcagtg atgagttccc ctgtgcgtga atccattgtc
    1501  tctgcacgtg aaggctacgt atactgtgca accacaaggg atgtcaatcc
    1551  tgcgtctcct gattgctctc ttcttggcct cccgatgctg aactttgatt
    1601  ggtacctgag tacagcggat ttgagagggt gatgaaggtc gcattcttta
    1651  atgcccagga tttgagatgc gcgttcttgt cctcgtctaa atactcttta
    1701  tacgaggaat tgggacctgg attgcagagg aagattgttg ggatgccccc
    1751  tttaatttga actggtttcc cgtattttgt gttgctttgc cagtcctttt
    1801  gtgcacccat gaactcctta aagtgtttca gataatgcgg gtcaacatca
    1851  tctatgacgt tataccatgc gtcattactg aacacttttg ggcttagatc
    1901  aagatggccg cacagataat tgtgaggccc aagacttctg gcccatacgg
    1951  tctttcctgt tctgctttcc ccttcaatca ctatactaat cggtctactt
    2001  ggccgcgcag cggcctcaac gacgttctcc gccgcccatt cttcaagttc
    2051  ttctggaact tggtcgaacg aagaagacaa aaaaaggaga aacataagga
    2101  gctggaggct cctgaaaaat cctttctaaa ttactattta aattatgaaa
    2151  ttgtaaaaca aaatctttgg gagctaactc ccttataacc ctaagagcct
    2201  ctgccttact tcctgcgtta agcgctgcgg cgtaagcgtc attggcggtc
    2251  tgttgtcctc ctcttgcaga tctaccgtcg atctggaatt ctccccattc
    2301  cagtgtgtct ccatccttgt cgatgtagga cttgacgtcg gagctggatt
    2351  tagctccctg aatgtttgga tggaaatgtg ttgacctatt tggggagacc
    2401  aggtcgaaga gtctctgatt ttggcacttg aacttccctt cgaactggag
    2451  aagcacatgt agatgaggtt ggccatcctc gtgaagctct ctgcagatct
    2501  tgatatattt cttgtttgaa gctgtgctta tttgctgaat ttgctctagg
    2551  gcttcttctt tggttagaga acattttggg aaagttagga aataattctt
    2601  ggcatatatt tggaatctgt ggggaggagc cattgacttc gtcaatcggt
    2651  actcagatgc ttctctccaa tatatcggta ctcaatatat agtgagtacc
    2701  aaatggcatt ttggtaaaaa tcccaataat tttgaacccc catagcgccc
    2751  accgttctaa tatt