Sequence of DPV Bean golden yellow mosaic virus

Bean golden yellow mosaic virus-[Puerto Rico] DNA1, complete sequence.

ACC No: M10070

Dated: 2005-04-17 | Length: 2646 | CRC: -1018423052

                !!NA_SEQUENCE 1.0
ID   GEBGMV1    standard; circular genomic DNA; VRL; 2646 BP.
XX
AC   M10070;
XX
SV   M10070.1
XX
DT   02-JUL-1986 (Rel. 09, Created)
DT   17-APR-2005 (Rel. 83, Last updated, Version 9)
XX
DE   Bean golden yellow mosaic virus-[Puerto Rico] DNA1, complete sequence.
XX
KW   .
XX
OS   Bean golden yellow mosaic virus-[Puerto Rico]
OC   Viruses; ssDNA viruses; Geminiviridae; Begomovirus.
XX
RN   [1]
RP   1-2646
RA   Howarth A.J., Caton J., Bossert M., Goodman R.M.;
RT   "Nucleotide sequence of bean golden mosaic virus and a model for gene
RT   regulation in geminiviruses";
RL   Proc. Natl. Acad. Sci. U.S.A. 82:3572-3576(1985).
XX
CC   Draft entry and sequence in computer readable form kindly provided
CC   by A.J.Howarth (26-SEP-1985).
CC   Bean golden mosaic virus consists of two circular ss-DNA molecules,
CC   DNA 1 and DNA 2.  The sense of the strand below is identical to
CC   that of the viral ss-DNA.
CC   The 'common regions', positions 1-205, are identical in DNA 1 and
CC   DNA 2.  There is no sequence homology in the regions flanking these
CC   areas.  A repeat is located at nucleotides 18-37 and 40-59 and an
CC   inverted repeat at 149-184.  This inverted repeat may form a stable
CC   stem-loop structure with a 12 bp stem and a 12 bp loop.  'at' rich
CC   regions are found at positions 220-240 and 320-340. REFERENCE 1
CC   suggests that late genes are expressed in a clockwise and early
CC   genes in a counterclockwise direction.  Four ORFs (see FEATURES) on
CC   DNA 1 compare favorably with the ORFs of tomato golden mosaic virus
CC   and cassava latent virus with respect to length, position and
CC   partial homology. REFERENCE 1 noted additional ORFs also.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .2646
FT                   /db_xref="taxon:222448"
FT                   /mol_type="genomic DNA"
FT                   /note="ICTV7 type organism"
FT                   /organism="Bean golden yellow mosaic virus-[Puerto Rico]"
FT                   /segment="DNA1"
FT   CDS             join(complement(1591. .2646),complement(1. .6))
FT                   /codon_start=1
FT                   /db_xref="GOA:P05175"
FT                   /db_xref="HSSP:1L5I"
FT                   /db_xref="InterPro:IPR001191"
FT                   /db_xref="InterPro:IPR001301"
FT                   /db_xref="UniProt/Swiss-Prot:P05175"
FT                   /product="40.2 kDa protein"
FT                   /protein_id="AAA46318.1"
FT                   /translation="MPPPQRFRVQSKNYFLTYPRCTIPKEEALSQLQKIHTTTNKKFIK
FT                   VCEERHDNGEPHLHALIQFEGKFICTNKRLFDLVSTTRSAHFHPNIQGAKSSSDVKEYI
FT                   DKDGVTIEWGQFQVDGRSARGGQQSANDSYAKALNADSIESALTILKEEQPKDYVLQNH
FT                   NIRSNLERIFFKVPEPWVPPFPLSSFVNIPVVMQDWVDDYFGRGSAARPERPISIIVEG
FT                   DSRTGKTMWARALGPHNYLSGHLDFNSLVYSNSVEYNVIDDITPNYLKLKDWKELIGEQ
FT                   KDWQSNCKYGKPVQIKGGIPSIVLCNPGEGSSYKDFLNKEEKPALHNWTIHNAIFVTLT
FT                   APLYQSTAQDCQT"
FT   repeat_region   1. .205
FT                   /note="common region repeat"
FT   CDS             399. .1124
FT                   /codon_start=1
FT                   /db_xref="GOA:P05152"
FT                   /db_xref="InterPro:IPR000263"
FT                   /db_xref="InterPro:IPR000650"
FT                   /db_xref="UniProt/Swiss-Prot:P05152"
FT                   /note="27.7 kDa protein"
FT                   /product="putative coat protein"
FT                   /protein_id="AAA46319.1"
FT                   /translation="MAGTSKVSRSGNYSPSGGMGSKSNKANAWVNRPMYRKPRIYRMYK
FT                   SPDVPKGCEGPCKVQSYEQRHDISHVGKVMCISDITRGNGITHRVGKRFCVKSVYILGK
FT                   IWMDENIMLKNHTNSVIFWLVRDRRPYGTPMDFGQVFNMFDNEPSTATVKNDFRDRYQV
FT                   MHRFNAKVSGGQYASNDQALVRRFWKVNNHVVYNHQEAGKYENHTENALLLYMACTHAS
FT                   NPVYATLKIRIYVYDSITN"
FT   CDS             complement(1121. .1519)
FT                   /codon_start=1
FT                   /db_xref="InterPro:IPR000657"
FT                   /db_xref="InterPro:IPR008973"
FT                   /db_xref="UniProt/Swiss-Prot:P05173"
FT                   /product="15.6 kDa protein"
FT                   /protein_id="AAA46320.1"
FT                   /translation="MDSRTGENITAHQAENSVFIWEVPNPLYFKIMRVEDPAYTRTRIY
FT                   HIQIRFNHNLRKALDLHKAFLNFQVWTTSIQASGTTYLNRFRLLVLLYLHRLGVIGINN
FT                   VIRAVQFATNKSYVNTVLENHDIKYKFY"
FT   CDS             complement(1266. .1784)
FT                   /codon_start=1
FT                   /db_xref="GOA:P05174"
FT                   /db_xref="InterPro:IPR000942"
FT                   /db_xref="InterPro:IPR009072"
FT                   /db_xref="UniProt/Swiss-Prot:P05174"
FT                   /product="19.6 kDa protein"
FT                   /protein_id="AAA46321.1"
FT                   /translation="MESRFKLKEEYHQSCCAIQVRVPVIKTSSTKRKNQLYTTGPFIMR
FT                   SSSPSQPPSIKAQHRIAKHKAIRRRRIDLNCGCSIFYHIKCADHGFTHRGEHHCASGRE
FT                   FRFYLGGTKSPLFQDHAGGRSSIHTDKDIPHPNQVQSQPQESTGSPQSIPELPSLDDID
FT                   SSFWDDIFK"
XX
SQ   Sequence 2646 BP; 709 A; 517 C; 587 G; 833 T; 0 other;

   M10070  Length: 2646  May 10, 2005 10:09  Type: N  Check: 9501  ..

       1  tggcatattt gtaaatatgc gagtgtctcc aaatgagttt gcgagtgtct
      51  ccaattgagg ctcctcaaac tctcgctatg caattggaga ctggagtaca
     101  atatatacta gaaccctcaa tctcttgaat tatcacatcc atacacgtgg
     151  cggccatccg atataatatt accggatggc cgcccgcgcc cctttatatc
     201  cgtactgcta cacgtggtgc tttaatttaa attaaagatg tctatttttg
     251  actgaccaat gcttttgcat gtgagaagct tagatatttg tgtaaaactt
     301  ggcgactaag ttttaccttc gtttataaat ttaaattaaa tgtatgccca
     351  ttccacgtgt aagtccagaa tgcctaagcg tgatgcgccg tggctcatat
     401  ggcgggaacc tccaaggttt cccgttctgg caattattct ccaagtggtg
     451  gaatgggctc aaaatccaac aaggccaatg catgggtcaa caggcccatg
     501  tatagaaagc caaggatata tcggatgtac aaaagcccag atgtgccaaa
     551  gggatgtgaa ggaccttgca aggtccaatc atatgaacaa cgccatgata
     601  tatctcatgt tggtaaggtt atgtgtatat ccgatatcac acgtggtaat
     651  ggtattactc atcgtgttgg taaacgtttt tgtgtgaagt ctgtgtatat
     701  tttaggtaag atatggatgg atgaaaacat catgcttaag aaccatacca
     751  atagtgtcat tttttggttg gttcgtgacc gtagaccata tggaacccct
     801  atggattttg gtcaagtttt taacatgttt gacaatgaac ctagtactgc
     851  tacggtcaag aacgattttc gtgatcgtta tcaagttatg cataggttca
     901  atgcaaaggt ttctggtggt caatatgcaa gcaacgatca agccttggta
     951  aggcgttttt ggaaggtgaa caaccatgtc gtctataacc accaggaagc
    1001  aggaaaatac gagaatcata cggagaatgc gttattgttg tatatggcat
    1051  gtacacatgc ctctaatcct gtatatgcga cattgaaaat tcggatctat
    1101  gtctatgatt cgataaccaa ttaataaaat ttatatttta tatcatgatt
    1151  ctcaagtaca gtatttacat atgatttgtt tgttgcgaac tgaacagctc
    1201  taatgacatt gtttattcct attacgccta acctatgtaa atacaataaa
    1251  actaagagtc taaatctatt taaatatgtc gtcccagaag cttgaatcga
    1301  tgtcgtccag acttggaagt tcaggaatgc tttgtggaga tccagtgctt
    1351  tcctgaggtt gtgattgaac ctgatttgga tgtggtatat ccttgtccgt
    1401  gtgtatgctg gatcttccac ccgcatgatc ttgaaataaa ggggatttgg
    1451  tacctcccaa ataaaaacgg aattctctgc ctgatgcgca gtgatgttct
    1501  cccctgtgcg tgaatccatg atctgcgcac ttgatatggt aaaatatgga
    1551  acagccgcag ttcaagtcaa tgcgtcgtcg acgaatggct ttatgtttgg
    1601  caatcctgtg ctgtgctttg atagaggggg gctgtgaggg tgacgaagat
    1651  cgcattatga atggtccagt tgtgtaaagc tggtttttcc tctttgttga
    1701  ggaagtcttt ataactggaa ccctcacctg gattgcacag cacgattgat
    1751  ggtattcctc ctttaatttg aaccggcttt ccatatttac agttggattg
    1801  ccagtccttt tgttccccaa ttagctcttt ccagtccttt aacttcaaat
    1851  aattcggggt tatgtcatca atgacgttgt attccactga gttcgaatag
    1901  acaagtgaat taaagtccaa atgaccgctc aaataattat gtgggcctaa
    1951  tgcacgagcc cacattgtct ttccagttcg tgaatcacct tcgacgatga
    2001  tactaatagg tctttctggc cgcgcagcgg aaccccttcc gaaatagtcg
    2051  tcaacccagt cttgcataac aaccggaata ttgacgaatg atgacaacgg
    2101  aaatggagga acccatggtt ccggcacttt gaagaagatc cgttcgagat
    2151  tagaacggat gttgtgattt tgaaggacgt aatctttcgg ttgttcttcc
    2201  ttcaatattg tcaaggcaga ttcaattgaa tctgcgttta atgcctttgc
    2251  gtatgagtcg ttggcagact gctgacctcc tcttgcagat ctgccgtcga
    2301  cttggaattg tccccattcg attgtgactc catctttgtc gatgtattct
    2351  ttgacgtcgg aacttgattt agctccctga atgttcggat ggaaatgtgc
    2401  tgacctggtt gtggatacca ggtcgaacaa tcttttattt gtgcagatga
    2451  atttaccttc gaactgaata agcgcatgaa gatggggttc accattatcg
    2501  tgacgttcct cacagacctt gatgaatttc ttattcgtcg ttgtatgaat
    2551  cttttgaagt tgcgaaagag cttcttcttt cggtatagtg caacgaggat
    2601  aagtgaggaa atagtttttg gactgaactc taaatctttg aggtgg