Sequence of DPV Cauliflower mosaic virus
Cauliflower mosaic virus (altered virulence isolate D/H), complete genome.
ACC No: M10376
Dated: 2000-03-04 | Length: 8016 | CRC: -172464704
!!NA_SEQUENCE 1.0
ID MCACGDH standard; circular DNA; VRL; 8016 BP.
XX
AC M10376; J02047;
XX
SV M10376.1
XX
DT 09-MAR-1987 (Rel. 11, Created)
DT 04-MAR-2000 (Rel. 63, Last updated, Version 5)
XX
DE Cauliflower mosaic virus (altered virulence isolate D/H), complete genome.
XX
KW coat protein; complete genome.
XX
OS Cauliflower mosaic virus
OC Viruses; Retroid viruses; Caulimovirus.
XX
RN [1]
RP 1-8016
RX MEDLINE; 83106468.
RA Balazs E., Guilly H., Jonard G., Richards K.;
RT "Nucleotide sequence of DNA from an altered-virulence isolate D/H of the
RT cauliflower mosaic virus";
RL Gene 19:239-249(1982).
XX
DR EPD; EP07015; CAMV_35MJ.
DR SPTREMBL; Q83163; Q83163.
DR SPTREMBL; Q83164; Q83164.
DR SWISS-PROT; P03544; COAT_CAMVD.
DR SWISS-PROT; P03547; VMP_CAMVD.
DR SWISS-PROT; P03550; VAT_CAMVD.
DR SWISS-PROT; P03553; VDBP_CAMVD.
DR SWISS-PROT; P03556; POL_CAMVD.
DR SWISS-PROT; P03557; IBMP_CAMVD.
XX
CC The beta-strand is shown below.
XX
FH Key Location/Qualifiers
FH
FT source 1. .8016
FT /db_xref="taxon:10641"
FT /organism="Cauliflower mosaic virus"
FT /isolate="Cabb-D/H"
FT CDS 13. .303
FT /codon_start=1
FT /db_xref="SPTREMBL:Q83163"
FT /note="ORF7; putative"
FT /protein_id="AAA46344.1"
FT /translation="MNRSMTKTQEDKTSPKYQRVLNSKNKRSFKIKNSSLTPVTDRFTT
FT VRFQNNIECVYANFDSQLKSSYDGRSKKIKTLSLKNLRCYETFLRKYLLEQ"
FT CDS 365. .1348
FT /codon_start=1
FT /db_xref="SWISS-PROT:P03547"
FT /note="ORF1; putative"
FT /protein_id="AAA46345.1"
FT /translation="MDLYPEENTQSEQSQNSENNMQIFKSETSDGFSSDLKISNDQLKN
FT ISKTQLTLEKEKIFKMPNVLSQVMKKAFSRKNEILYCVSTKELSVDIHDATGKVYLPLI
FT TKEEINKRLSSLKPEVRRTMSMVHLGAVKILLKAQFRNGIDTPIKIALIDDRINSRRDC
FT LLGAAKGNLAYGKFMFTVYPKFGISLNTQRLNQTLSLIHDFENKNLMNKGDKVMTITYI
FT VGYALTNSHHSIDYQSNATIELEDVFQEIGNIQQSEFCTIQNDECNWAIDIAQNKALLG
FT AKTKTQIGNSLQIGNIASSSSTENELARVSQNIDLLKNKLKEICGE"
FT CDS 1345. .1824
FT /codon_start=1
FT /db_xref="SWISS-PROT:P03550"
FT /note="ORF2; putative"
FT /protein_id="AAA46346.1"
FT /translation="MSITGQPHVYKKDTIIRLKPLSLNSNNRSYVFSSSKGNIQNIINH
FT LNNLNKIVGRSLLGIWKINSYFGLSKDPSESKSKNPSVFNTAKTIFKSGGVDYSSQPKE
FT IKSLLEAQNTRIKSLEKAIQSLDEKIEPEPLTKEEVKELKESINSIKEGLKNIIG"
FT CDS 1826. .2215
FT /codon_start=1
FT /db_xref="SWISS-PROT:P03553"
FT /note="ORF3; putative"
FT /protein_id="AAA46347.1"
FT /translation="MANLNQIQKEVSEILSDQKSMKADIKAILELLGSQNPIKESLETV
FT AAKIVNDLTKLINDCPCNKEILEALGNQPKEQLIGQPKEKGKGLNLGKYSYPNYGVGNE
FT ELGSSGNPKALTWPFKAPAGWPNQY"
FT CDS 2197. .3669
FT /codon_start=1
FT /db_xref="SWISS-PROT:P03544"
FT /note="coat protein (gene IV)"
FT /protein_id="AAA46348.1"
FT /translation="MAESILDRTINRFWYKLGDDCLSESQFDLMIRLMEESLDGDQIID
FT LTSLPSDNLQVEQVMTTTEDSISEEESEFLLAIGETSEEESDSGEEPEFEQVRMDRTGG
FT TEIPKEEDGGEPSRYNERKRKTTEDRYFPTQPKTIPGQKQTTMGMLNIDCQANRRTLID
FT DWAAEIGLIVKTNREDYLDPETILLLMEHKTSGIAKELIRNTRWNRTTGDIIEQVIDAM
FT YTMFLGLNYSDNKVAEKIEEQEKAKIRMTKLQLCDICYLEEFTCDYEKNMYKTELADFP
FT GYINQYLSKIPIIGEKALTRFRHEANGTSIYSLGFAAKIVKEELSKICDLTKKQKKLKK
FT FNKKCCSIGEASVEYGCKKTSKKKYHKRYKKKYKAYKPYKKKKKFRSGKYFKPKEKKGS
FT KQKYCPKGKKDCRCWICNIEGHYANECPNRQSSEKAHILQQAEKLGLQPIEEPYEGVQE
FT VFILEYKEEEEETSTEEDDGSSTSEDSDSESD"
FT CDS 3260. .3583
FT /codon_start=1
FT /db_xref="SPTREMBL:Q83164"
FT /note="ORF8; putative"
FT /protein_id="AAA46349.1"
FT /translation="MDARRHPRRSIIKDTRKNIRLINLIRRRRNSGQENTSSPKKRRAL
FT SKSIAQRARKTADVGSAISKAITPTNVLIDKAQRRLTSFNKQRNWVSSPSKNPTKEFKK
FT YSS"
FT CDS 3623. .5650
FT /codon_start=1
FT /db_xref="SWISS-PROT:P03556"
FT /note="ORF5; putative"
FT /protein_id="AAA46350.1"
FT /translation="MMDHLLQKTQIQNQTEQVMNITNPNSIYIKGRLYFKGYKKIELHC
FT FVDTGASLCIASKFVIPEEHWINAERPIMVKIADGSSITINKVCRDIDLIIAGEIFHIP
FT TVYQQESGIDFIIGNNFCQLYEPFIQFTDRVIFTKDRTYPVHIAKLTRAVRVGTEGFLE
FT SMKKRSKTQQPEPVNISTNKIAILSEGRRLSEEKLFITQQRMQKIEELLEKVCSENPLD
FT PNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAP
FT AFLVNNEAEKRRGKKRMVVNYKAMNKATVGDAYNPPNKDELLTLIRGKKIFSSFHCNSG
FT FWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVFRKFCCVYVD
FT DILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGH
FT ILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDT
FT LYMQKVKKNLQAFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYAS
FT GSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKL
FT GRNIRWQAWLSHYSFDVEHIKGTDNHFADFLSREFNRVNS"
FT misc_feature 5651. .5753
FT /note="small intergenic region"
FT mRNA 5679. .>8016
FT /note="major viral mRNA"
FT CDS 5754. .7322
FT /codon_start=1
FT /db_xref="SWISS-PROT:P03557"
FT /note="major virus-specific in vitro translation product;
FT gene VI)"
FT /product="inclusion body protein"
FT /protein_id="AAA46351.1"
FT /translation="MENIEKLLMQEKILMLELDLVRAKISLARANGSSQQGELSLHRET
FT PEKEVAVHSALVTFTPTQVKAIPEQTAPGKESTNPLMASILPKDMNPVQTGTRLAVPSD
FT FLRPHQGIPIPQKSELSSTVVPLRAESGIQHPHINYYVVYNGPHAGIYDDWGCTKAATN
FT GVPGVAHKKFATITEARAAADAYTTRQQTDRLNFIPKGEAQLKPKSFAEALTSPPKQKA
FT HWLTLGTKKPSSDPAPKEISFAPEITMDDFLYLYDLVRKFDGEGDDTMFTTDNEKISLF
FT NFRKNANPQMVREAYAAGLIKTIYPSNNLQEIKYLPKKVKDAVKRFRTNCIKNTEKDIF
FT LKIRSTIPVWTIQGLLHKPRQVIEIGVSKKVIPTESKAMESRIQIEDLTELAVKTGEQF
FT IQSLLRLNDKKKIFVNMVEHDTLVYSKNIKETDSEDQRAIETFQQRVISGNLLGFHCPA
FT ICHFIMKTVEKEGGAYKCHHCDKGKAIVQDASADEGTTDKSGPPPTRSIVEKEDVPNTS
FT SKQVD"
FT misc_feature join(7323. .8016,1. .364)
FT /note="long intergenic region"
FT mRNA 7419. .>8016
FT /note="major viral mRNA"
XX
SQ Sequence 8016 BP; 2939 A; 1653 C; 1562 G; 1862 T; 0 other;
M10376 Length: 8016 December 21, 2001 15:47 Type: N Check: 644 ..
1 ggtatcagag ccatgaatag gtctatgacc aaaactcaag aggataaaac
51 ctcaccaaaa taccaaagag ttcttaactc taaaaataaa agatctttca
101 agatcaaaaa tagttccctc acaccggtga ccgacaggtt taccaccgta
151 aggtttcaga acaacatcga atgcgtttac gccaacttcg actctcagct
201 caagtcgtcg tacgatggta gatctaaaaa gatcaagact ctaagcctta
251 aaaatcttag atgttacgaa accttcctca ggaagtacct tttggaacaa
301 taaaatctct ctgagaatag tactctattg agtatccaca gaaaaaataa
351 tcttctgtgt tgagatggat ttgtatccag aagaaaacac ccaaagcgag
401 caatcgcaaa attctgaaaa taatatgcaa atatttaaat cagaaacttc
451 ggatggattc tcctccgatt taaagatctc aaacgatcaa ttaaaaaata
501 tctcaaaaac ccaattaact ttggaaaaag aaaagatatt taagatgcct
551 aacgttttat ctcaagttat gaaaaaagcg tttagcagga aaaacgagat
601 tctctactgc gtctcgacaa aagaattatc ggtggacatt catgatgcca
651 caggtaaggt atatcttcct ttaatcacta aagaggaaat taataaaaga
701 ctttccagct taaaacctga agtcagaaga accatgtcca tggtccattt
751 gggcgcggtc aaaatattgc ttaaagctca atttagaaat gggattgata
801 ccccaatcaa aattgcttta atcgatgata gaatcaattc tagaagagat
851 tgtcttcttg gtgcagccaa aggtaatctc gcatacggta agtttatgtt
901 tactgtatac cccaagtttg gaataagcct taatacccaa agacttaacc
951 aaaccttaag ccttattcat gattttgaga ataaaaatct tatgaataaa
1001 ggtgataaag ttatgaccat aacctatatc gtaggatatg cattaacaaa
1051 tagtcatcat agcatagatt atcaatcgaa tgctacaatt gaactagaag
1101 acgtatttca agaaattgga aatatccagc aatctgagtt ctgtacaata
1151 cagaatgatg aatgcaattg ggccattgat atagcccaaa acaaagcctt
1201 attaggagct aaaaccaaaa cccaaattgg taatagtctt caaataggaa
1251 atattgcatc atcctctagt actgaaaatg aattagctag ggtgagccaa
1301 aacatagatc ttttaaaaaa taaattaaaa gaaatctgtg gagaatgagc
1351 ataacgggtc aaccgcatgt ttataaaaaa gatactatta ttagactaaa
1401 accattgtct cttaatagta ataatagaag ttatgttttt agttcctcaa
1451 aagggaacat tcaaaatata attaatcatc ttaacaacct caataagatt
1501 gtaggaagaa gcttactcgg aatatggaag atcaactcat acttcggact
1551 aagcaaagac ccttcggagt ccaaatcgaa aaacccgtca gtttttaata
1601 ctgcaaaaac catttttaag agtggggggg ttgattactc gagccaacca
1651 aaggaaataa aatccctttt agaagctcaa aatactagaa ttaaaagtct
1701 agaaaaagca attcaatcct tagatgaaaa gattgaacca gagcccttaa
1751 ctaaagaaga agttaaagag cttaaagaat cgattaactc gatcaaagaa
1801 ggattaaaga atattattgg ctgaaatggc taatcttaat caaatccaaa
1851 aagaagtctc tgaaatcctc agtgaccaaa aatccatgaa agcggatata
1901 aaagctatct tagaattatt aggatcccaa aatcctatta aagaaagctt
1951 agaaaccgtt gcagcgaaaa tcgttaatga cttaaccaag ctcatcaatg
2001 attgtccttg taacaaagag atattagaag ccttaggcaa ccaacctaaa
2051 gagcaactaa taggacaacc taaagaaaaa ggcaaaggcc ttaatcttgg
2101 aaaatactct taccccaatt acggagtagg aaatgaagaa ttaggatcct
2151 ctggaaaccc taaagcttta acctggccct tcaaagctcc agcaggatgg
2201 ccgaatcaat attagaccga actattaata ggttctggta taaactggga
2251 gatgattgtc tctcagaaag tcaatttgac cttatgataa ggttaatgga
2301 agagtccctt gacggggacc aaattattga tctaacctct ctacctagtg
2351 acaatttgca ggttgaacag gttatgacaa caaccgaaga ctcgatctcg
2401 gaagaagaat cagaattcct tctagcaata ggagaaacgt ctgaagaaga
2451 aagcgattca ggagaagaac ctgaattcga acaagttcga atggatcgaa
2501 caggaggaac ggagattccc aaagaagaag atggcggaga accatctaga
2551 tataatgaga gaaagagaaa gaccactgaa gatcggtact ttccaactca
2601 accaaagacc attccaggcc aaaagcaaac gaccatggga atgctcaaca
2651 ttgactgcca agccaatcgg agaactctaa tcgacgattg ggcagcagaa
2701 atcggattga tagtcaagac caatagagaa gactatcttg atccagaaac
2751 aatcctactt ctgatggaac ataaaacatc aggaatagcc aaggagttaa
2801 tccgaaacac aagatggaac cgcactaccg gcgacatcat agaacaggtg
2851 atcgatgcaa tgtacaccat gttcctagga cttaactact ccgacaacaa
2901 ggtcgccgag aagatcgaag agcaagagaa ggccaaaatc agaatgacca
2951 agcttcagct ctgcgacatc tgctaccttg aagaatttac atgtgattat
3001 gagaagaaca tgtacaagac agaactggcg gatttcccag gatatatcaa
3051 ccagtacctg tcaaaaatcc ccatcattgg agaaaaagcg ttaacacgct
3101 ttaggcatga agccaacgga accagcatct acagtttagg tttcgcggca
3151 aagatagtaa aagaagaact atctaaaatc tgcgacttga ccaagaagca
3201 gaagaagttg aagaaattca acaagaagtg ctgtagcatc ggagaagctt
3251 cagtagaata tggatgcaag aagacatcca agaagaagta tcataaaaga
3301 tacaagaaaa aatataaggc ttataaacct tataagaaga agaagaaatt
3351 ccggtcagga aaatacttca agcccaaaga aaagaagggc tctaagcaaa
3401 agtattgccc aaagggcaag aaagactgca gatgttggat ctgcaatatc
3451 gaaggccatt acgccaacga atgtcctaat cgacaaagct cagagaaggc
3501 tcacatcctt caacaagcag agaaactggg tctccagccc atcgaagaac
3551 cctacgaagg agttcaagaa gtattcatcc tagaatacaa agaagaggaa
3601 gaagaaacct ctacagaaga agatgatgga tcatctactt cagaagactc
3651 agattcagaa tcagactgag caggtgatga acatcaccaa tcccaattcg
3701 atctacatca agggaagact ctacttcaag ggatacaaga agatagagct
3751 tcactgtttt gtagacacgg gagcaagttt atgcatagca tccaagttcg
3801 tcataccaga agaacattgg atcaatgcag aaagaccaat catggtcaaa
3851 attgcagatg gaagttcgat caccatcaac aaagtctgca gagacattga
3901 cctgatcata gccggagaaa tattccatat tcccaccgtc tatcaacagg
3951 aaagtggaat cgatttcatc atcggcaaca acttctgtca gttgtatgaa
4001 cctttcatac aatttacaga tagagttatc ttcacaaagg acagaacata
4051 ccctgttcat attgcgaagc taacaagagc agtgcgagta ggcacagaag
4101 gattcctaga atccatgaag aaacgttcaa agactcagca accggagcct
4151 gtgaacattt caacaaacaa aattgctatt ctttcagagg ggaggaggtt
4201 atcagaagaa aaacttttca tcactcagca aagaatgcaa aaaatcgaag
4251 aactacttga gaaagtatgt tcagaaaatc cattagatcc taacaagact
4301 aagcaatgga tgaaagcttc aatcaagctc agcgacccaa gcaaagctat
4351 caaggttaaa cccatgaagt atagcccaat ggatcgtgaa gaatttgata
4401 agcaaatcaa agaattactg gatctaaaag tcatcaagcc cagtaaaagc
4451 cctcacatgg caccagcctt cttggtcaac aatgaagccg agaagcgaag
4501 aggaaagaaa cgtatggtag tcaactacaa agctatgaac aaagccactg
4551 taggagacgc ttacaatcct cccaacaaag acgagttact tacactcatt
4601 cgaggaaaga agatcttttc ttccttccac tgtaactcag gattctggca
4651 ggttctgcta gatcaagaat caagacctct aacggcattc acatgtcccc
4701 aaggtcacta tgaatggaat gtggtacctt tcggcttaaa gcaagctcca
4751 tccatattcc aaagacacat ggacgaagct ttccgtgtgt tcagaaagtt
4801 ctgttgcgtt tatgtcgacg acattctcgt attcagtaac aatgaagaag
4851 atcacctact tcacgtagca atgatcttac aaaagtgcaa tcaacatgga
4901 attatccttt ccaagaagaa agcacaactc ttcaagaaga agataaactt
4951 ccttggtcta gaaatagatg aaggaacaca caagcctcaa ggacacatct
5001 tggaacatat caacaaattc ccagataccc ttgaagataa gaagcaactt
5051 cagagattct taggcatact cacatatgcc tcagattata ttccgaagct
5101 agcgcaaatc agaaagcctc tgcaagccaa gcttaaggag aacgttccat
5151 ggaaatggac aaaagaggac accctctaca tgcaaaaggt gaagaaaaat
5201 ctgcaagcat ttcctccact acatcatccc ttaccagaag agaagttgat
5251 tatcgagacc gacgcatcag atgactactg gggaggtatg ttaaaagcta
5301 tcaaaattaa cgaaggtact aatactgagt taatttgcag atacgcatct
5351 ggaagcttta aagctgcaga aaagaattac cacagcaatg acaaagagac
5401 actggcggta ataaatacta taaagaaatt tagtatttat ctaactcctg
5451 ttcattttct gatcagaaca gataatactc atttcaagag ttttgttaat
5501 ctcaattaca aaggagattc gaaacttgga agaaacatca gatggcaagc
5551 atggcttagc cattattcat ttgatgttga acacattaaa ggaaccgaca
5601 accactttgc ggacttcctt tcaagagaat tcaatagggt taattcctaa
5651 ttgaaatccg aagataagat tcccacacac ttgtggctga tatcaaaagg
5701 ctactgccta tataaacaca tctctggaga ctgagaaaat cagacctcca
5751 agcatggaga acatagaaaa actcctcatg caagagaaaa tactaatgct
5801 agagctcgat ctagtaagag caaaaataag cttagcaaga gctaacggct
5851 cttcgcaaca aggagaactc tctctccacc gtgaaacacc ggaaaaagaa
5901 gtagcagttc attctgcact ggtcactttt acgccaactc aagtaaaggc
5951 tattccagag caaacggctc ctggtaaaga atcaacaaat ccgttgatgg
6001 ctagtatctt gccaaaagat atgaacccag ttcagactgg gacaaggcta
6051 gcagtgccat cggacttttt acgtcctcat cagggaattc caatcccaca
6101 aaaatctgag cttagcagca cagttgttcc tctcagagca gaatcgggta
6151 ttcaacaccc tcatatcaac tactacgttg tgtataacgg tccacatgcc
6201 ggtatatacg atgactgggg ttgtacaaag gcagcaacaa acggcgtccc
6251 cggagttgcg cataagaagt ttgccactat tacagaggca agagcagcag
6301 ctgacgcgta tacaacaaga cagcaaacag ataggttgaa ctttatcccc
6351 aaaggagaag ctcaactcaa gcccaagagc tttgctgagg ccttaacaag
6401 cccaccaaag caaaaagccc actggctcac gctaggaacc aaaaagccca
6451 gcagtgatcc agccccaaaa gagatctcct ttgccccgga gatcacaatg
6501 gacgacttcc tctatctcta tgatctagtc aggaagttcg acggagaagg
6551 tgacgatacc atgttcacca ctgacaatga gaagattagc ctcttcaatt
6601 tcagaaagaa cgctaaccca cagatggtta gagaggccta cgcagcagga
6651 ctcattaaga cgatctaccc gagcaataat ctccaggaga tcaaatacct
6701 tcccaagaag gttaaagatg cagtcaaaag attcaggact aactgcatca
6751 agaacacaga gaaagatata tttctcaaga tcagaagtac tattccagta
6801 tggacgattc aaggcttgct tcacaaacca aggcaagtaa tagagattgg
6851 agtctctaaa aaggtaattc ctacagaatc aaaggccatg gagtcaagga
6901 ttcaaattga ggatctaaca gaactcgccg tgaagactgg cgaacagttc
6951 atacagagtc tcttacgact caatgacaag aagaaaatct tcgtcaacat
7001 ggtggagcac gacactctcg tctactccaa gaatatcaag gaaacagact
7051 cagaagacca aagggcaatt gagactttcc aacaaagggt aatttcggga
7101 aacctcctcg gattccattg cccagctatc tgtcacttca tcatgaagac
7151 agtagaaaag gaaggtggcg cctacaaatg tcaccattgc gataaaggaa
7201 aggctatcgt tcaagatgcc tctgccgacg aagggaccac agacaaaagt
7251 ggacctccac ccacgaggag catcgtagaa aaagaagacg ttcccaacac
7301 gtcttcaaag caagtggatt gatgtgatat ctccactgac gtaagggatg
7351 acgcacaatc ccactatcct tcgcaagacc cttcctctat ataaggaagt
7401 tcatttcatt tggagaggac acgctgaaat caccagtctc tctctacaac
7451 tctctctctc tctacatttc cataataatg tgtgagtagt tcccagataa
7501 gggaattagg gttcttatag ggtttcgctc atgtgttgag catataagaa
7551 acccttagta tgtatttgta tttgtaaaat acttctatca ataaaatttc
7601 taattcctaa aaccaaaatc cagtactaaa atccagatct cctaaagtcc
7651 ctatagatct ttgtggtgaa tataaaccag acacgagacg actaaacctg
7701 gagcccagac gccgtttgaa gctagaagta ccgcttaggc aggaggccgt
7751 tagggaaaag atgctaaggc agggttggtt acgttgactc ccccgtaggt
7801 ttggtttaaa tatcatgaag tggacggaag gaaggaggaa gacaaggaag
7851 gataaggttg caggccctgt gtaaggtaag acgatggaaa tttgatagag
7901 gtacgctact atacttatac tatatgctaa gggaatgctt gtatttaccc
7951 tatataccct aataacccct tatcgattta aagaaataat ccgcataagc
8001 ccccgcttaa aaaatt