Sequence of DPV Cauliflower mosaic virus
Cauliflower mosaic virus isolate NY8153, complete genome.
ACC No: M90541
Dated: 2010-02-05 | Length: 8030 | CRC: -1386710975
ID M90541; SV 1; circular; genomic DNA; STD; VRL; 8030 BP.
XX
AC M90541;
XX
DT 18-JUN-1992 (Rel. 32, Created)
DT 05-FEB-2010 (Rel. 103, Last updated, Version 10)
XX
DE Cauliflower mosaic virus isolate NY8153, complete genome.
XX
KW aphid transmission protein; capsid protein; DNA-binding protein;
KW inclusion body matrix protein; movement protein; reverse transcriptase.
XX
OS Cauliflower mosaic virus
OC Viruses; Retro-transcribing viruses; Caulimoviridae; Caulimovirus.
XX
RN [1]
RX DOI; 10.1016/0042-6822(90)90538-3.
RX PUBMED; 2371775.
RA Vaden V.R., Melcher U.;
RT "Recombination sites in cauliflower mosaic virus DNAs: implications for
RT mechanisms of recombination";
RL Virology 177(2):717-726(1990).
XX
RN [2]
RP 1-8030
RX PUBMED; 16653000.
RA Chenault K.D., Steffens D.L., Melcher U.;
RT "Nucleotide Sequence of Cauliflower Mosaic Virus Isolate NY8153";
RL Plant Physiol. 100(1):542-545(1992).
XX
DR EPD; EP07015; CAMV_35MJ.
XX
CC Original source text: Cauliflower mosaic virus (individual_isolate
CC NY8153) DNA.
CC Virology 177, 717-726 (1990) reports basepairs 1-96, 624-697,
CC 1474-1705, and 7950-8030.
XX
FH Key Location/Qualifiers
FH
FT source 1. .8030
FT /organism="Cauliflower mosaic virus"
FT /isolate="NY8153"
FT /mol_type="genomic DNA"
FT /db_xref="taxon:10641"
FT CDS 364. .1347
FT /codon_start=1
FT /product="movement protein"
FT /note="ORF 1"
FT /db_xref="GOA:Q00966"
FT /db_xref="InterPro:IPR001022"
FT /db_xref="UniProtKB/Swiss-Prot:Q00966"
FT /protein_id="AAA46354.1"
FT /translation="MDLYPEEKTQSKQSHNSENNMQIFKSENSDGFSSDLMISNDQLKN
FT ISKTQLTLEKEKIFKMPNVLSQVMKKAFSRKNEILYCVSTKELSVDIHDATGKVYLPLI
FT TKEEINKRLSSLKPEVRKTMSMVHLGAVKILLKAQFRNGIDTPIKIALIDDRINSRRDC
FT LLGAAKGNLAYGKFMFTVYPKFGISLNTQRLNQTLSLIHDFENKNLMNKGDKVMTITYI
FT VGYALTNSHHSIDYQSNATIELEDVFQEIGNVQQCDFCTIQNDECNWAIDIAQNKALLG
FT AKTQSQIGNSLQIGNSASSSNTENELARVSQNIDLLKNKLKEICGE"
FT CDS 1349. .1828
FT /codon_start=1
FT /product="aphid transmission protein"
FT /note="ORF 2"
FT /db_xref="GOA:Q00965"
FT /db_xref="InterPro:IPR004917"
FT /db_xref="UniProtKB/Swiss-Prot:Q00965"
FT /protein_id="AAA46355.1"
FT /translation="MSITGQPHVYKKDTIIRLKPLSLNSNNRSYVFSSSKGNIQNIINH
FT LNNLNEIVGRSLLGIWKINSYFGLSKDPSESKSKNPSVFNTAKTIFKSGGVDYSSQLKE
FT IKSLLEAQNTRIKSLENAIQSLDNKIEPEPLTKEEVKELKESINSIKEGLKNIIG"
FT CDS 1830. .2219
FT /codon_start=1
FT /product="DNA-binding protein"
FT /note="ORF 3"
FT /db_xref="GOA:Q00967"
FT /db_xref="InterPro:IPR004986"
FT /db_xref="UniProtKB/Swiss-Prot:Q00967"
FT /protein_id="AAA46356.1"
FT /translation="MANLNQIQKEVSEILSDQKSMKSDIKAILEMLGSQNPIKESLEAV
FT AAKIVNDLTKLINDCPCNKEILEALGNQPKEQLIEQPKEKGKGLNLGKYSYPNYGVGNE
FT ELGSSGNPKALTWPFKAPAGWPNQF"
FT CDS 2201. .3667
FT /codon_start=1
FT /product="capsid protein"
FT /note="ORF 4"
FT /db_xref="GOA:Q00956"
FT /db_xref="InterPro:IPR001878"
FT /db_xref="InterPro:IPR001988"
FT /db_xref="UniProtKB/Swiss-Prot:Q00956"
FT /protein_id="AAA46357.1"
FT /translation="MAESILDRTINRFWYNLGEDCLSESQFDLMIRLMEESLSGDQIID
FT LTSLPSDNLQVEQVMTTTEDSISEESEFLLAIGETSEDESDSGEEPEFEQVRMDRTGGT
FT EIPKEEDGEPSRYNERKRKTTEDRYFPTQPKTIPRQKQTSMGMLNIDCQTNRRTLIDDW
FT AAEIGLIVKTNREDYLNPETILLLMEHKTSGIAKELIRNTRWNRTTGDIIEQVIDRMYT
FT MFLGLNYSDNKVAEKIDEQEKAKIRMTKLQLCDICYLEEFTCDYEKNMYKTELADFPGY
FT INQYLSKIPIIGEKALTRFRHEANGTSIYSLGFERKICKEELSKIRDLSKNEKKLKKFN
FT KKCCSIEEASAEYGCKKTSTKKYHKKRYKKKYKAYKPYKKKKKFRSGKYFKPKEKKGSK
FT QKYCPKGKKDCRCWICNIEGHYANECPNRQSSEKAHILQQAEKVGLQPIEAPYEGVQEV
FT FILEYKEEEEETSTEESDDESSTSEDSDSD"
FT CDS 3627. .5669
FT /codon_start=1
FT /product="reverse transcriptase"
FT /note="ORF 5"
FT /db_xref="GOA:Q00962"
FT /db_xref="InterPro:IPR000477"
FT /db_xref="InterPro:IPR000588"
FT /db_xref="UniProtKB/Swiss-Prot:Q00962"
FT /protein_id="AAA46358.1"
FT /translation="MMNHLLLKTQTQTEQVMNVTNPNSIYIKGRLYFKGYKKIELHCFV
FT DTGASLCIASKFVIPEEHWVNAERPIMVKIADGSSITISKVCKDIDLIIVGVIFKIPTV
FT YQQESGIDFIIGNNFCQLYEPFIQFTDRVIFTKNKSYPVHIAKLTRAVRVGTEGFLESM
FT KKRSKTQQPEPVNISTNKIENPLEEIAILSEGRRLSEEKLFITQQRMQKTEELLEKVCS
FT ENPLDPNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKS
FT PHMAPAFLVNNEAENGRGNKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSF
FT DCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVFRKFC
FT CVYVDDIVVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTH
FT KPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPNLAQMRQPLQAKLKENVPWKW
FT TKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELI
FT CRYRSGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYK
FT GDSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFADFLSREFNKVNS"
FT CDS 5773. .7335
FT /codon_start=1
FT /product="inclusion body matrix protein"
FT /note="ORF 6"
FT /db_xref="GOA:Q00957"
FT /db_xref="InterPro:IPR002609"
FT /db_xref="InterPro:IPR009027"
FT /db_xref="InterPro:IPR011320"
FT /db_xref="UniProtKB/Swiss-Prot:Q00957"
FT /protein_id="AAA46359.1"
FT /translation="MENIEKLLMQEKILMLELDLVRAKISLARANGSSQQGDLPLHRET
FT PEKEEAVHSALATFTPTQVKAIPEQTAPGKESTNPLMASILPKDMNPVQTGIRLAVPGD
FT FLRPHQGIPIPQKSELSSTVAPLRAESGIQHPHINYYVVYNGPHAGIYDDWGCTKAATN
FT GVPGVAHKKFATITEARAAADAYTTSTQTDRLNFIPKGEAQLKPKSFAEALTSPPKQKA
FT HWLTLGTKRPSSDPAPKEISFAPEITMDDFLYLYHLGRKFDGEGDDTIFTTDNEKISLF
FT NFRKNADPQMVREAYAAGLIKTIYPSNNLQEIKYLPKKVKDAVKRFRTNCIKNTEKDIF
FT LKIRSTIPVWTIQGLVHKPRQVIEIGVSKKVVPTESKAMESKIQIEDLTELAVKTGGQF
FT IQSLLRLNDKKKIFVNMVEHDTLVYSKNIKDTVSEDQRAIETFQQRVISGNLLGFHCPS
FT ICHFMERTVEKEGGSYKVHHCDKGKAIVQDASADSGPKDGPPPTRSIVEKEDVPTTSSK
FT QVD"
XX
SQ Sequence 8030 BP; 2934 A; 1653 C; 1567 G; 1876 T; 0 other;
m90541 Length: 8030 05-FEB-2010 Type: N Check: 5888 ..
1 ggtatcagag ccatgaatcg gtttaaagac caaactcaag agggtaaaac
51 ctcatcaaaa tacgaaagag ttcttaactc taaagataaa agatctttca
101 agattaaaac tagttccctc acaccggtga ccgacaggtt taccaccgta
151 aggtttcaga acaacatcga atgcgtttac gccaacttcg actctcagct
201 caagtcgtcg tacgatggta gatctaaaaa gatcaagaat ctaagcctta
251 aaaatcttag atgtcacgaa gccttcctca ggaagtacct tctggaacaa
301 taaatctctc tgagaatagt actctattga gtatccacag ataaaataat
351 cttctgtgtt gagatggatt tgtatccaga agaaaagacc caaagcaagc
401 aatcgcataa ttctgaaaat aatatgcaaa tatttaaatc agaaaattcg
451 gatggattct cctccgatct aatgatctca aacgatcaat taaaaaatat
501 ctctaaaacc caattaactt tggaaaaaga aaagatattt aaaatgccta
551 acgttttatc tcaagttatg aaaaaagcgt ttagcaggaa aaacgagatt
601 ctctactgcg tctcgacaaa agaattatca gtggacattc acgatgccac
651 aggtaaggta tatcttcctt taatcactaa agaggagata aataaaagac
701 tttccagttt aaaacctgaa gtcagaaaga ccatgtccat ggttcatctt
751 ggagcggtca aaatattgct taaagctcaa tttcgaaatg ggattgatac
801 cccaatcaaa attgctttaa tcgatgatag aattaattct agaagagatt
851 gccttctcgg tgcagccaaa ggtaatctag catacggtaa gtttatgttt
901 actgtatacc ccaagtttgg aataagcctt aatacccaaa gacttaacca
951 aaccctaagc cttattcatg attttgaaaa taaaaatctt atgaataaag
1001 gtgataaagt tatgaccata acctatatcg taggatatgc attaactaat
1051 agtcatcata gcatagatta tcaatcgaat gctacaattg aactagaaga
1101 cgtatttcaa gaaattggaa atgtccagca atgtgatttc tgtacaatac
1151 agaatgacga atgtaattgg gccattgata tagcccaaaa caaagcctta
1201 ttaggagcta aaacccaatc ccaaattggt aatagtcttc aaataggaaa
1251 cagtgcttca tcctctaata ctgaaaatga attagctagg gtaagccaaa
1301 acatagatct tttaaagaat aaattaaaag aaatctgtgg agaataaaat
1351 gagcattacg ggtcaaccgc atgtttataa aaaggatact attattagac
1401 taaaaccatt gtctcttaat agtaataata gaagttatgt ttttagttcc
1451 tcaaaaggga acattcaaaa tataattaat catcttaaca acctcaatga
1501 gattgtagga agaagcttac tcggaatatg gaagatcaac tcatacttcg
1551 gactaagcaa agacccttcg gagtccaaat caaaaaaccc gtcagttttt
1601 aatactgcaa aaaccatttt taagagtggg ggggttgatt actcgagcca
1651 attaaaggaa ataaaatccc ttttagaagc tcaaaacact agaattaaaa
1701 gtctagaaaa tgcaattcaa tccttagata ataagattga accagagccc
1751 ttaactaaag aagaagttaa agagctaaaa gaatcgatta actcgatcaa
1801 agaaggatta aagaatatta ttggctgaaa tggctaatct taatcaaatc
1851 caaaaagaag tctctgaaat cctcagtgac caaaaatcca tgaaatcgga
1901 tataaaagct atcttagaaa tgctaggatc ccaaaatcct attaaagaaa
1951 gcttagaagc cgttgcagcg aaaatcgtta atgacttaac caagctcatc
2001 aatgattgtc cttgtaacaa agaaatatta gaagccttag gcaatcagcc
2051 taaagagcaa ctaatagaac aacctaaaga aaaaggcaaa ggtcttaatc
2101 taggaaaata ctcttacccc aattacggtg taggaaatga agaattagga
2151 tcctctggaa accctaaagc tttaacctgg cccttcaaag ctccagcagg
2201 atggccgaat caattttaga cagaaccatt aataggtttt ggtataatct
2251 gggagaagat tgtctctcag aaagtcaatt tgaccttatg ataaggttaa
2301 tggaagagtc cttgagcggg gaccaaatta ttgatctaac ctctctacct
2351 agtgataatt tgcaggtcga acaggttatg acaactaccg aagactcgat
2401 ctcggaagaa tcagaattcc ttctagcaat aggagaaaca tctgaagacg
2451 aaagcgattc aggagaagaa cctgaattcg aacaagttcg aatggatcga
2501 acaggaggaa cggagattcc caaagaagaa gatggtgaac catctagata
2551 caatgagaga aagagaaaga ccacggagga ccggtacttt ccaactcaac
2601 caaagaccat tccaagacaa aagcaaacgt ctatgggaat gctcaacatt
2651 gactgccaaa ccaatcgaag aaccttaatc gatgattggg cagcagaaat
2701 cggactgata gtcaagacca atagagaaga ctatctgaat ccagaaacaa
2751 tactactctt gatggaacac aaaacatcag gaatagccaa ggagttaatc
2801 cgaaatacaa gatggaaccg tactaccggc gatatcatag aacaggtgat
2851 cgatcggatg tacaccatgt tcttaggact taactactcc gacaacaagg
2901 ttgctgaaaa gatagacgag caagagaagg ccaagatcag aatgaccaaa
2951 ctccagctct gcgacatctg ctaccttgaa gaatttacat gtgattatga
3001 aaagaacatg tacaagacgg aactggcgga tttcccagga tatatcaacc
3051 agtacctgtc aaaaatcccc atcataggag aaaaagcgct aacacgcttt
3101 aggcatgaag ccaacggaac cagcatctac agcttaggtt tcgagcgaaa
3151 gatatgcaaa gaagaactat ctaaaattcg cgacttatcc aagaacgaga
3201 agaagttgaa gaaattcaac aagaagtgct gcagcatcga agaagcttca
3251 gcagaatatg gatgtaagaa gacatctacc aaaaagtatc acaagaagcg
3301 atacaagaaa aaatataagg cttataaacc ttataagaag aagaagaaat
3351 tccgatccgg aaaatacttc aagcccaaag agaagaaggg ctcaaagcaa
3401 aagtattgcc caaaaggcaa gaaagactgc aggtgttgga tctgcaatat
3451 cgaaggtcat tacgccaacg aatgtcctaa tcgacaaagc tcggaaaagg
3501 ctcacatcct tcaacaagca gaaaaagttg gcctccagcc cattgaagct
3551 ccctatgaag gagttcaaga agtattcatc ttagaataca aagaagagga
3601 agaagaaacc tctacagaag aaagcgatga tgaatcatct acttctgaag
3651 actcagactc agactgagca ggtgatgaac gtcaccaatc ccaattcgat
3701 ctacatcaag ggcagactct acttcaaggg atacaagaag atagagcttc
3751 actgttttgt agacacggga gcaagcttat gcatagcatc caagttcgtc
3801 attccagaag aacattgggt caatgcagaa agaccaataa tggtcaaaat
3851 agcagatgga agctcaatca ccatcagcaa agtctgcaaa gacatagact
3901 tgatcatagt cggcgtgata ttcaaaattc ccaccgtcta tcagcaagaa
3951 agtggcatcg atttcataat cggcaacaac ttctgtcagc tatatgaacc
4001 attcatacag tttacggata gagttatctt cacaaagaac aagtcttatc
4051 ctgttcatat tgcgaagcta accagagcag tgcgagtagg caccgaagga
4101 tttcttgaat caatgaagaa acgttcaaag actcaacaac ctgagccggt
4151 gaacatttcg acaaacaaga tagaaaatcc gctagaagaa attgctattc
4201 tttcagaggg gaggaggtta tcagaagaaa aactcttcat cactcaacaa
4251 agaatgcaaa aaaccgaaga actacttgag aaagtatgtt cagaaaatcc
4301 attagatcct aacaagacta agcaatggat gaaagcttca atcaagctca
4351 gcgacccaag caaagctatc aaggttaaac ccatgaagta tagcccaatg
4401 gatcgtgaag aatttgacaa gcaaatcaaa gagttactgg accttaaagt
4451 cattaagccc agtaaaagcc ctcacatggc accagccttc ttggtcaaca
4501 atgaagccga gaacggaaga ggaaacaaac gtatggtagt gaactacaaa
4551 gctatgaata aagccaccgt aggagacgca tacaatcttc ccaacaaaga
4601 cgagttactt acactcattc gaggaaagaa gatcttttct tccttcgact
4651 gtaagtcagg attctggcaa gttctgcttg atcaagaatc aagacctcta
4701 acggcgttca catgtccaca aggtcactac gaatggaatg tggtcccttt
4751 cggcctaaag caggcaccat ccatattcca gagacacatg gacgaagcat
4801 ttcgtgtgtt cagaaagttc tgttgcgttt atgtcgacga cattgtcgta
4851 ttcagtaaca acgaagaaga tcatctactt cacgtagcaa tgatcttaca
4901 aaagtgcaat cagcatggaa ttatcctttc caagaagaaa gcacaactct
4951 tcaagaagaa gataaacttc cttggtctag aaatagatga aggaacacat
5001 aagcctcaag gacatatttt ggaacatatc aacaagttcc cagataccct
5051 tgaagacaag aagcaacttc agagattctt aggcatccta acatatgcct
5101 ctgattatat cccgaatcta gctcaaatga gacagcctct gcaagccaag
5151 cttaaagaaa atgttccatg gaaatggaca aaagaggaca ccctctacat
5201 gcaaaaggtg aagaaaaatc tgcaaggatt tcctccacta catcatccct
5251 taccagaaga gaagctgatc atcgaaaccg atgcatcaga cgactactgg
5301 ggaggtatgt taaaagctat caaaattaac gaaggtacta atactgagtt
5351 aatttgcaga taccgatctg gaagctttaa ggctgcagaa aggaattacc
5401 acagcaatga caaagagaca ttggcggtaa taaatactat aaagaaattc
5451 agtatttatc taactcctgt tcattttctg atcaggacag ataatactca
5501 tttcaagagt tttgttaatc tcaattacaa aggtgattca aaacttggaa
5551 gaaacatcag atggcaagca tggcttagcc actattcatt tgatgttgaa
5601 catattaaag gaaccgacaa ccactttgcg gacttccttt caagagaatt
5651 caataaggtt aattcctaat tgaaatccga agataagatt cccacacact
5701 tgtggctgat atcaaaaggc tactgcctat ataaacacat ctctggagac
5751 tgagaaaatc agacctccaa gcatggagaa catagaaaaa ctcctcatgc
5801 aagagaaaat actaatgcta gagctcgatc tagtaagagc aaaaataagc
5851 ttagcaagag ctaacggctc ttcgcaacaa ggagacctcc ctctccaccg
5901 tgaaacaccg gaaaaagaag aagcagttca ttctgcactg gccactttta
5951 cgccaactca agtaaaagct attccagagc aaacggctcc tggtaaagaa
6001 tcaacaaatc cgttgatggc tagtatcttg ccaaaagata tgaacccagt
6051 tcaaactggg ataaggcttg cagtgccagg ggacttttta cgtcctcatc
6101 agggaattcc aatcccacaa aaatctgagc ttagcagcac agttgctcct
6151 ctcagagcag aatcgggtat tcaacaccct catatcaact actacgttgt
6201 gtataacggt ccacacgccg gtatatacga tgactggggt tgtacaaagg
6251 cggcaacaaa cggcgttccc ggagttgcac acaagaagtt tgccactatt
6301 acagaggcaa gagcagcagc tgacgcgtac acaacaagta cgcaaacaga
6351 caggttgaac ttcatcccca aaggagaagc tcaactcaag cccaagagct
6401 ttgcagaggc cttaaccagc ccaccaaagc aaaaagccca ctggctcacg
6451 ctaggaacca aaaggcccag cagtgatcca gccccaaaag agatctcctt
6501 tgccccggag atcaccatgg acgatttcct ctatctctac catctaggaa
6551 gaaagttcga cggagaaggt gacgatacca tcttcaccac tgataatgag
6601 aagattagcc tcttcaattt cagaaagaat gctgacccac agatggttag
6651 agaggcctac gcagcaggtc tcatcaagac gatctacccg agtaataatc
6701 tccaggagat caaatacctt cccaagaagg ttaaagatgc agtcaaaaga
6751 ttcaggacta actgcatcaa gaacacagag aaagatatat ttctcaagat
6801 cagaagtact atcccagtat ggacgattca aggcctcgtt cataaaccaa
6851 ggcaagtaat agagattgga gtctctaaga aagtagttcc tactgaatca
6901 aaggccatgg agtcaaaaat tcagatcgag gatctaacag aactcgccgt
6951 gaagactggc ggacagttca tacagagtct tttacgactc aatgacaaga
7001 agaaaatctt cgtcaacatg gtggagcacg acactctcgt ctactccaag
7051 aatatcaaag atacagtctc agaagaccaa agggctattg agacttttca
7101 acaaagggta atatcaggaa acctcctcgg attccattgc ccatctatct
7151 gtcacttcat ggaaaggaca gtagaaaagg aaggtggctc ctacaaagtc
7201 catcattgcg ataaaggaaa ggctatcgtt caagatgcct ctgccgacag
7251 tggtcccaaa gatggacccc cacccacgag gagcatcgtg gaaaaagaag
7301 acgttccaac cacgtcttca aagcaagtgg attgatgtga tatctccact
7351 gacgtaaggg atgacgcaca atcccactat ccttcgcaag actcttcctc
7401 tatataagga agttcatttc atttggagag gacacgctga aatcaccagt
7451 ctctctctac aaatctatct ctctctattt tctccataat aatgtgtgag
7501 tagttcccag ataagggaat taggattctt atagggtttc gctgatgtgt
7551 tgagcatata agaaaccctt agtatgtatt agtattagta agatacttct
7601 atcaataaaa tttctaattc ctaaaaccaa aatccagtac taaaatccag
7651 atctcctaaa gtccctatag atctatgtcg agaatataaa ccagacacga
7701 gacgactaaa cctggagccc agacgccgat tgaagctaga agtaccgctt
7751 aggcaggagg ccgttaggga aaagatgcta aggcagggtt ggttacgttg
7801 actcccccgt aggtttggtt taaatatgat gaagtggacg gaaggaagga
7851 ggaagacaag gaaggataag gttgcaggcc ctgtgcaagg taagaagatg
7901 gaaatttgat agaggtacgt tactatacct atactataag ctaagggaat
7951 gcttgtattt accctatata ccctaataac cccttatcga tttaaagaaa
8001 taatccgcat aagcccccgc ttaaaaaatt