Sequence of DPV Bean golden yellow mosaic virus
Bean golden yellow mosaic virus-[Puerto Rico] DNA1, complete sequence.
ACC No: M10070
Dated: 2005-04-17 | Length: 2646 | CRC: -1018423052
!!NA_SEQUENCE 1.0
ID GEBGMV1 standard; circular genomic DNA; VRL; 2646 BP.
XX
AC M10070;
XX
SV M10070.1
XX
DT 02-JUL-1986 (Rel. 09, Created)
DT 17-APR-2005 (Rel. 83, Last updated, Version 9)
XX
DE Bean golden yellow mosaic virus-[Puerto Rico] DNA1, complete sequence.
XX
KW .
XX
OS Bean golden yellow mosaic virus-[Puerto Rico]
OC Viruses; ssDNA viruses; Geminiviridae; Begomovirus.
XX
RN [1]
RP 1-2646
RA Howarth A.J., Caton J., Bossert M., Goodman R.M.;
RT "Nucleotide sequence of bean golden mosaic virus and a model for gene
RT regulation in geminiviruses";
RL Proc. Natl. Acad. Sci. U.S.A. 82:3572-3576(1985).
XX
CC Draft entry and sequence in computer readable form kindly provided
CC by A.J.Howarth (26-SEP-1985).
CC Bean golden mosaic virus consists of two circular ss-DNA molecules,
CC DNA 1 and DNA 2. The sense of the strand below is identical to
CC that of the viral ss-DNA.
CC The 'common regions', positions 1-205, are identical in DNA 1 and
CC DNA 2. There is no sequence homology in the regions flanking these
CC areas. A repeat is located at nucleotides 18-37 and 40-59 and an
CC inverted repeat at 149-184. This inverted repeat may form a stable
CC stem-loop structure with a 12 bp stem and a 12 bp loop. 'at' rich
CC regions are found at positions 220-240 and 320-340. REFERENCE 1
CC suggests that late genes are expressed in a clockwise and early
CC genes in a counterclockwise direction. Four ORFs (see FEATURES) on
CC DNA 1 compare favorably with the ORFs of tomato golden mosaic virus
CC and cassava latent virus with respect to length, position and
CC partial homology. REFERENCE 1 noted additional ORFs also.
XX
FH Key Location/Qualifiers
FH
FT source 1. .2646
FT /db_xref="taxon:222448"
FT /mol_type="genomic DNA"
FT /note="ICTV7 type organism"
FT /organism="Bean golden yellow mosaic virus-[Puerto Rico]"
FT /segment="DNA1"
FT CDS join(complement(1591. .2646),complement(1. .6))
FT /codon_start=1
FT /db_xref="GOA:P05175"
FT /db_xref="HSSP:1L5I"
FT /db_xref="InterPro:IPR001191"
FT /db_xref="InterPro:IPR001301"
FT /db_xref="UniProt/Swiss-Prot:P05175"
FT /product="40.2 kDa protein"
FT /protein_id="AAA46318.1"
FT /translation="MPPPQRFRVQSKNYFLTYPRCTIPKEEALSQLQKIHTTTNKKFIK
FT VCEERHDNGEPHLHALIQFEGKFICTNKRLFDLVSTTRSAHFHPNIQGAKSSSDVKEYI
FT DKDGVTIEWGQFQVDGRSARGGQQSANDSYAKALNADSIESALTILKEEQPKDYVLQNH
FT NIRSNLERIFFKVPEPWVPPFPLSSFVNIPVVMQDWVDDYFGRGSAARPERPISIIVEG
FT DSRTGKTMWARALGPHNYLSGHLDFNSLVYSNSVEYNVIDDITPNYLKLKDWKELIGEQ
FT KDWQSNCKYGKPVQIKGGIPSIVLCNPGEGSSYKDFLNKEEKPALHNWTIHNAIFVTLT
FT APLYQSTAQDCQT"
FT repeat_region 1. .205
FT /note="common region repeat"
FT CDS 399. .1124
FT /codon_start=1
FT /db_xref="GOA:P05152"
FT /db_xref="InterPro:IPR000263"
FT /db_xref="InterPro:IPR000650"
FT /db_xref="UniProt/Swiss-Prot:P05152"
FT /note="27.7 kDa protein"
FT /product="putative coat protein"
FT /protein_id="AAA46319.1"
FT /translation="MAGTSKVSRSGNYSPSGGMGSKSNKANAWVNRPMYRKPRIYRMYK
FT SPDVPKGCEGPCKVQSYEQRHDISHVGKVMCISDITRGNGITHRVGKRFCVKSVYILGK
FT IWMDENIMLKNHTNSVIFWLVRDRRPYGTPMDFGQVFNMFDNEPSTATVKNDFRDRYQV
FT MHRFNAKVSGGQYASNDQALVRRFWKVNNHVVYNHQEAGKYENHTENALLLYMACTHAS
FT NPVYATLKIRIYVYDSITN"
FT CDS complement(1121. .1519)
FT /codon_start=1
FT /db_xref="InterPro:IPR000657"
FT /db_xref="InterPro:IPR008973"
FT /db_xref="UniProt/Swiss-Prot:P05173"
FT /product="15.6 kDa protein"
FT /protein_id="AAA46320.1"
FT /translation="MDSRTGENITAHQAENSVFIWEVPNPLYFKIMRVEDPAYTRTRIY
FT HIQIRFNHNLRKALDLHKAFLNFQVWTTSIQASGTTYLNRFRLLVLLYLHRLGVIGINN
FT VIRAVQFATNKSYVNTVLENHDIKYKFY"
FT CDS complement(1266. .1784)
FT /codon_start=1
FT /db_xref="GOA:P05174"
FT /db_xref="InterPro:IPR000942"
FT /db_xref="InterPro:IPR009072"
FT /db_xref="UniProt/Swiss-Prot:P05174"
FT /product="19.6 kDa protein"
FT /protein_id="AAA46321.1"
FT /translation="MESRFKLKEEYHQSCCAIQVRVPVIKTSSTKRKNQLYTTGPFIMR
FT SSSPSQPPSIKAQHRIAKHKAIRRRRIDLNCGCSIFYHIKCADHGFTHRGEHHCASGRE
FT FRFYLGGTKSPLFQDHAGGRSSIHTDKDIPHPNQVQSQPQESTGSPQSIPELPSLDDID
FT SSFWDDIFK"
XX
SQ Sequence 2646 BP; 709 A; 517 C; 587 G; 833 T; 0 other;
M10070 Length: 2646 May 10, 2005 10:09 Type: N Check: 9501 ..
1 tggcatattt gtaaatatgc gagtgtctcc aaatgagttt gcgagtgtct
51 ccaattgagg ctcctcaaac tctcgctatg caattggaga ctggagtaca
101 atatatacta gaaccctcaa tctcttgaat tatcacatcc atacacgtgg
151 cggccatccg atataatatt accggatggc cgcccgcgcc cctttatatc
201 cgtactgcta cacgtggtgc tttaatttaa attaaagatg tctatttttg
251 actgaccaat gcttttgcat gtgagaagct tagatatttg tgtaaaactt
301 ggcgactaag ttttaccttc gtttataaat ttaaattaaa tgtatgccca
351 ttccacgtgt aagtccagaa tgcctaagcg tgatgcgccg tggctcatat
401 ggcgggaacc tccaaggttt cccgttctgg caattattct ccaagtggtg
451 gaatgggctc aaaatccaac aaggccaatg catgggtcaa caggcccatg
501 tatagaaagc caaggatata tcggatgtac aaaagcccag atgtgccaaa
551 gggatgtgaa ggaccttgca aggtccaatc atatgaacaa cgccatgata
601 tatctcatgt tggtaaggtt atgtgtatat ccgatatcac acgtggtaat
651 ggtattactc atcgtgttgg taaacgtttt tgtgtgaagt ctgtgtatat
701 tttaggtaag atatggatgg atgaaaacat catgcttaag aaccatacca
751 atagtgtcat tttttggttg gttcgtgacc gtagaccata tggaacccct
801 atggattttg gtcaagtttt taacatgttt gacaatgaac ctagtactgc
851 tacggtcaag aacgattttc gtgatcgtta tcaagttatg cataggttca
901 atgcaaaggt ttctggtggt caatatgcaa gcaacgatca agccttggta
951 aggcgttttt ggaaggtgaa caaccatgtc gtctataacc accaggaagc
1001 aggaaaatac gagaatcata cggagaatgc gttattgttg tatatggcat
1051 gtacacatgc ctctaatcct gtatatgcga cattgaaaat tcggatctat
1101 gtctatgatt cgataaccaa ttaataaaat ttatatttta tatcatgatt
1151 ctcaagtaca gtatttacat atgatttgtt tgttgcgaac tgaacagctc
1201 taatgacatt gtttattcct attacgccta acctatgtaa atacaataaa
1251 actaagagtc taaatctatt taaatatgtc gtcccagaag cttgaatcga
1301 tgtcgtccag acttggaagt tcaggaatgc tttgtggaga tccagtgctt
1351 tcctgaggtt gtgattgaac ctgatttgga tgtggtatat ccttgtccgt
1401 gtgtatgctg gatcttccac ccgcatgatc ttgaaataaa ggggatttgg
1451 tacctcccaa ataaaaacgg aattctctgc ctgatgcgca gtgatgttct
1501 cccctgtgcg tgaatccatg atctgcgcac ttgatatggt aaaatatgga
1551 acagccgcag ttcaagtcaa tgcgtcgtcg acgaatggct ttatgtttgg
1601 caatcctgtg ctgtgctttg atagaggggg gctgtgaggg tgacgaagat
1651 cgcattatga atggtccagt tgtgtaaagc tggtttttcc tctttgttga
1701 ggaagtcttt ataactggaa ccctcacctg gattgcacag cacgattgat
1751 ggtattcctc ctttaatttg aaccggcttt ccatatttac agttggattg
1801 ccagtccttt tgttccccaa ttagctcttt ccagtccttt aacttcaaat
1851 aattcggggt tatgtcatca atgacgttgt attccactga gttcgaatag
1901 acaagtgaat taaagtccaa atgaccgctc aaataattat gtgggcctaa
1951 tgcacgagcc cacattgtct ttccagttcg tgaatcacct tcgacgatga
2001 tactaatagg tctttctggc cgcgcagcgg aaccccttcc gaaatagtcg
2051 tcaacccagt cttgcataac aaccggaata ttgacgaatg atgacaacgg
2101 aaatggagga acccatggtt ccggcacttt gaagaagatc cgttcgagat
2151 tagaacggat gttgtgattt tgaaggacgt aatctttcgg ttgttcttcc
2201 ttcaatattg tcaaggcaga ttcaattgaa tctgcgttta atgcctttgc
2251 gtatgagtcg ttggcagact gctgacctcc tcttgcagat ctgccgtcga
2301 cttggaattg tccccattcg attgtgactc catctttgtc gatgtattct
2351 ttgacgtcgg aacttgattt agctccctga atgttcggat ggaaatgtgc
2401 tgacctggtt gtggatacca ggtcgaacaa tcttttattt gtgcagatga
2451 atttaccttc gaactgaata agcgcatgaa gatggggttc accattatcg
2501 tgacgttcct cacagacctt gatgaatttc ttattcgtcg ttgtatgaat
2551 cttttgaagt tgcgaaagag cttcttcttt cggtatagtg caacgaggat
2601 aagtgaggaa atagtttttg gactgaactc taaatctttg aggtgg