Sequence of DPV Cauliflower mosaic virus

Cauliflower mosaic virus isolate NY8153, complete genome.

ACC No: M90541

Dated: 2010-02-05 | Length: 8030 | CRC: -1386710975

                
ID   M90541; SV 1; circular; genomic DNA; STD; VRL; 8030 BP.
XX
AC   M90541;
XX
DT   18-JUN-1992 (Rel. 32, Created)
DT   05-FEB-2010 (Rel. 103, Last updated, Version 10)
XX
DE   Cauliflower mosaic virus isolate NY8153, complete genome.
XX
KW   aphid transmission protein; capsid protein; DNA-binding protein;
KW   inclusion body matrix protein; movement protein; reverse transcriptase.
XX
OS   Cauliflower mosaic virus
OC   Viruses; Retro-transcribing viruses; Caulimoviridae; Caulimovirus.
XX
RN   [1]
RX   DOI; 10.1016/0042-6822(90)90538-3.
RX   PUBMED; 2371775.
RA   Vaden V.R., Melcher U.;
RT   "Recombination sites in cauliflower mosaic virus DNAs: implications for
RT   mechanisms of recombination";
RL   Virology 177(2):717-726(1990).
XX
RN   [2]
RP   1-8030
RX   PUBMED; 16653000.
RA   Chenault K.D., Steffens D.L., Melcher U.;
RT   "Nucleotide Sequence of Cauliflower Mosaic Virus Isolate NY8153";
RL   Plant Physiol. 100(1):542-545(1992).
XX
DR   EPD; EP07015; CAMV_35MJ.
XX
CC   Original source text: Cauliflower mosaic virus (individual_isolate
CC   NY8153) DNA.
CC   Virology 177, 717-726 (1990) reports basepairs 1-96, 624-697,
CC   1474-1705, and 7950-8030.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .8030
FT                   /organism="Cauliflower mosaic virus"
FT                   /isolate="NY8153"
FT                   /mol_type="genomic DNA"
FT                   /db_xref="taxon:10641"
FT   CDS             364. .1347
FT                   /codon_start=1
FT                   /product="movement protein"
FT                   /note="ORF 1"
FT                   /db_xref="GOA:Q00966"
FT                   /db_xref="InterPro:IPR001022"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q00966"
FT                   /protein_id="AAA46354.1"
FT                   /translation="MDLYPEEKTQSKQSHNSENNMQIFKSENSDGFSSDLMISNDQLKN
FT                   ISKTQLTLEKEKIFKMPNVLSQVMKKAFSRKNEILYCVSTKELSVDIHDATGKVYLPLI
FT                   TKEEINKRLSSLKPEVRKTMSMVHLGAVKILLKAQFRNGIDTPIKIALIDDRINSRRDC
FT                   LLGAAKGNLAYGKFMFTVYPKFGISLNTQRLNQTLSLIHDFENKNLMNKGDKVMTITYI
FT                   VGYALTNSHHSIDYQSNATIELEDVFQEIGNVQQCDFCTIQNDECNWAIDIAQNKALLG
FT                   AKTQSQIGNSLQIGNSASSSNTENELARVSQNIDLLKNKLKEICGE"
FT   CDS             1349. .1828
FT                   /codon_start=1
FT                   /product="aphid transmission protein"
FT                   /note="ORF 2"
FT                   /db_xref="GOA:Q00965"
FT                   /db_xref="InterPro:IPR004917"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q00965"
FT                   /protein_id="AAA46355.1"
FT                   /translation="MSITGQPHVYKKDTIIRLKPLSLNSNNRSYVFSSSKGNIQNIINH
FT                   LNNLNEIVGRSLLGIWKINSYFGLSKDPSESKSKNPSVFNTAKTIFKSGGVDYSSQLKE
FT                   IKSLLEAQNTRIKSLENAIQSLDNKIEPEPLTKEEVKELKESINSIKEGLKNIIG"
FT   CDS             1830. .2219
FT                   /codon_start=1
FT                   /product="DNA-binding protein"
FT                   /note="ORF 3"
FT                   /db_xref="GOA:Q00967"
FT                   /db_xref="InterPro:IPR004986"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q00967"
FT                   /protein_id="AAA46356.1"
FT                   /translation="MANLNQIQKEVSEILSDQKSMKSDIKAILEMLGSQNPIKESLEAV
FT                   AAKIVNDLTKLINDCPCNKEILEALGNQPKEQLIEQPKEKGKGLNLGKYSYPNYGVGNE
FT                   ELGSSGNPKALTWPFKAPAGWPNQF"
FT   CDS             2201. .3667
FT                   /codon_start=1
FT                   /product="capsid protein"
FT                   /note="ORF 4"
FT                   /db_xref="GOA:Q00956"
FT                   /db_xref="InterPro:IPR001878"
FT                   /db_xref="InterPro:IPR001988"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q00956"
FT                   /protein_id="AAA46357.1"
FT                   /translation="MAESILDRTINRFWYNLGEDCLSESQFDLMIRLMEESLSGDQIID
FT                   LTSLPSDNLQVEQVMTTTEDSISEESEFLLAIGETSEDESDSGEEPEFEQVRMDRTGGT
FT                   EIPKEEDGEPSRYNERKRKTTEDRYFPTQPKTIPRQKQTSMGMLNIDCQTNRRTLIDDW
FT                   AAEIGLIVKTNREDYLNPETILLLMEHKTSGIAKELIRNTRWNRTTGDIIEQVIDRMYT
FT                   MFLGLNYSDNKVAEKIDEQEKAKIRMTKLQLCDICYLEEFTCDYEKNMYKTELADFPGY
FT                   INQYLSKIPIIGEKALTRFRHEANGTSIYSLGFERKICKEELSKIRDLSKNEKKLKKFN
FT                   KKCCSIEEASAEYGCKKTSTKKYHKKRYKKKYKAYKPYKKKKKFRSGKYFKPKEKKGSK
FT                   QKYCPKGKKDCRCWICNIEGHYANECPNRQSSEKAHILQQAEKVGLQPIEAPYEGVQEV
FT                   FILEYKEEEEETSTEESDDESSTSEDSDSD"
FT   CDS             3627. .5669
FT                   /codon_start=1
FT                   /product="reverse transcriptase"
FT                   /note="ORF 5"
FT                   /db_xref="GOA:Q00962"
FT                   /db_xref="InterPro:IPR000477"
FT                   /db_xref="InterPro:IPR000588"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q00962"
FT                   /protein_id="AAA46358.1"
FT                   /translation="MMNHLLLKTQTQTEQVMNVTNPNSIYIKGRLYFKGYKKIELHCFV
FT                   DTGASLCIASKFVIPEEHWVNAERPIMVKIADGSSITISKVCKDIDLIIVGVIFKIPTV
FT                   YQQESGIDFIIGNNFCQLYEPFIQFTDRVIFTKNKSYPVHIAKLTRAVRVGTEGFLESM
FT                   KKRSKTQQPEPVNISTNKIENPLEEIAILSEGRRLSEEKLFITQQRMQKTEELLEKVCS
FT                   ENPLDPNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKS
FT                   PHMAPAFLVNNEAENGRGNKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSF
FT                   DCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVFRKFC
FT                   CVYVDDIVVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTH
FT                   KPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPNLAQMRQPLQAKLKENVPWKW
FT                   TKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELI
FT                   CRYRSGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYK
FT                   GDSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFADFLSREFNKVNS"
FT   CDS             5773. .7335
FT                   /codon_start=1
FT                   /product="inclusion body matrix protein"
FT                   /note="ORF 6"
FT                   /db_xref="GOA:Q00957"
FT                   /db_xref="InterPro:IPR002609"
FT                   /db_xref="InterPro:IPR009027"
FT                   /db_xref="InterPro:IPR011320"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q00957"
FT                   /protein_id="AAA46359.1"
FT                   /translation="MENIEKLLMQEKILMLELDLVRAKISLARANGSSQQGDLPLHRET
FT                   PEKEEAVHSALATFTPTQVKAIPEQTAPGKESTNPLMASILPKDMNPVQTGIRLAVPGD
FT                   FLRPHQGIPIPQKSELSSTVAPLRAESGIQHPHINYYVVYNGPHAGIYDDWGCTKAATN
FT                   GVPGVAHKKFATITEARAAADAYTTSTQTDRLNFIPKGEAQLKPKSFAEALTSPPKQKA
FT                   HWLTLGTKRPSSDPAPKEISFAPEITMDDFLYLYHLGRKFDGEGDDTIFTTDNEKISLF
FT                   NFRKNADPQMVREAYAAGLIKTIYPSNNLQEIKYLPKKVKDAVKRFRTNCIKNTEKDIF
FT                   LKIRSTIPVWTIQGLVHKPRQVIEIGVSKKVVPTESKAMESKIQIEDLTELAVKTGGQF
FT                   IQSLLRLNDKKKIFVNMVEHDTLVYSKNIKDTVSEDQRAIETFQQRVISGNLLGFHCPS
FT                   ICHFMERTVEKEGGSYKVHHCDKGKAIVQDASADSGPKDGPPPTRSIVEKEDVPTTSSK
FT                   QVD"
XX
SQ   Sequence 8030 BP; 2934 A; 1653 C; 1567 G; 1876 T; 0 other;

m90541 Length: 8030  05-FEB-2010  Type: N  Check: 5888  ..

       1  ggtatcagag ccatgaatcg gtttaaagac caaactcaag agggtaaaac
      51  ctcatcaaaa tacgaaagag ttcttaactc taaagataaa agatctttca
     101  agattaaaac tagttccctc acaccggtga ccgacaggtt taccaccgta
     151  aggtttcaga acaacatcga atgcgtttac gccaacttcg actctcagct
     201  caagtcgtcg tacgatggta gatctaaaaa gatcaagaat ctaagcctta
     251  aaaatcttag atgtcacgaa gccttcctca ggaagtacct tctggaacaa
     301  taaatctctc tgagaatagt actctattga gtatccacag ataaaataat
     351  cttctgtgtt gagatggatt tgtatccaga agaaaagacc caaagcaagc
     401  aatcgcataa ttctgaaaat aatatgcaaa tatttaaatc agaaaattcg
     451  gatggattct cctccgatct aatgatctca aacgatcaat taaaaaatat
     501  ctctaaaacc caattaactt tggaaaaaga aaagatattt aaaatgccta
     551  acgttttatc tcaagttatg aaaaaagcgt ttagcaggaa aaacgagatt
     601  ctctactgcg tctcgacaaa agaattatca gtggacattc acgatgccac
     651  aggtaaggta tatcttcctt taatcactaa agaggagata aataaaagac
     701  tttccagttt aaaacctgaa gtcagaaaga ccatgtccat ggttcatctt
     751  ggagcggtca aaatattgct taaagctcaa tttcgaaatg ggattgatac
     801  cccaatcaaa attgctttaa tcgatgatag aattaattct agaagagatt
     851  gccttctcgg tgcagccaaa ggtaatctag catacggtaa gtttatgttt
     901  actgtatacc ccaagtttgg aataagcctt aatacccaaa gacttaacca
     951  aaccctaagc cttattcatg attttgaaaa taaaaatctt atgaataaag
    1001  gtgataaagt tatgaccata acctatatcg taggatatgc attaactaat
    1051  agtcatcata gcatagatta tcaatcgaat gctacaattg aactagaaga
    1101  cgtatttcaa gaaattggaa atgtccagca atgtgatttc tgtacaatac
    1151  agaatgacga atgtaattgg gccattgata tagcccaaaa caaagcctta
    1201  ttaggagcta aaacccaatc ccaaattggt aatagtcttc aaataggaaa
    1251  cagtgcttca tcctctaata ctgaaaatga attagctagg gtaagccaaa
    1301  acatagatct tttaaagaat aaattaaaag aaatctgtgg agaataaaat
    1351  gagcattacg ggtcaaccgc atgtttataa aaaggatact attattagac
    1401  taaaaccatt gtctcttaat agtaataata gaagttatgt ttttagttcc
    1451  tcaaaaggga acattcaaaa tataattaat catcttaaca acctcaatga
    1501  gattgtagga agaagcttac tcggaatatg gaagatcaac tcatacttcg
    1551  gactaagcaa agacccttcg gagtccaaat caaaaaaccc gtcagttttt
    1601  aatactgcaa aaaccatttt taagagtggg ggggttgatt actcgagcca
    1651  attaaaggaa ataaaatccc ttttagaagc tcaaaacact agaattaaaa
    1701  gtctagaaaa tgcaattcaa tccttagata ataagattga accagagccc
    1751  ttaactaaag aagaagttaa agagctaaaa gaatcgatta actcgatcaa
    1801  agaaggatta aagaatatta ttggctgaaa tggctaatct taatcaaatc
    1851  caaaaagaag tctctgaaat cctcagtgac caaaaatcca tgaaatcgga
    1901  tataaaagct atcttagaaa tgctaggatc ccaaaatcct attaaagaaa
    1951  gcttagaagc cgttgcagcg aaaatcgtta atgacttaac caagctcatc
    2001  aatgattgtc cttgtaacaa agaaatatta gaagccttag gcaatcagcc
    2051  taaagagcaa ctaatagaac aacctaaaga aaaaggcaaa ggtcttaatc
    2101  taggaaaata ctcttacccc aattacggtg taggaaatga agaattagga
    2151  tcctctggaa accctaaagc tttaacctgg cccttcaaag ctccagcagg
    2201  atggccgaat caattttaga cagaaccatt aataggtttt ggtataatct
    2251  gggagaagat tgtctctcag aaagtcaatt tgaccttatg ataaggttaa
    2301  tggaagagtc cttgagcggg gaccaaatta ttgatctaac ctctctacct
    2351  agtgataatt tgcaggtcga acaggttatg acaactaccg aagactcgat
    2401  ctcggaagaa tcagaattcc ttctagcaat aggagaaaca tctgaagacg
    2451  aaagcgattc aggagaagaa cctgaattcg aacaagttcg aatggatcga
    2501  acaggaggaa cggagattcc caaagaagaa gatggtgaac catctagata
    2551  caatgagaga aagagaaaga ccacggagga ccggtacttt ccaactcaac
    2601  caaagaccat tccaagacaa aagcaaacgt ctatgggaat gctcaacatt
    2651  gactgccaaa ccaatcgaag aaccttaatc gatgattggg cagcagaaat
    2701  cggactgata gtcaagacca atagagaaga ctatctgaat ccagaaacaa
    2751  tactactctt gatggaacac aaaacatcag gaatagccaa ggagttaatc
    2801  cgaaatacaa gatggaaccg tactaccggc gatatcatag aacaggtgat
    2851  cgatcggatg tacaccatgt tcttaggact taactactcc gacaacaagg
    2901  ttgctgaaaa gatagacgag caagagaagg ccaagatcag aatgaccaaa
    2951  ctccagctct gcgacatctg ctaccttgaa gaatttacat gtgattatga
    3001  aaagaacatg tacaagacgg aactggcgga tttcccagga tatatcaacc
    3051  agtacctgtc aaaaatcccc atcataggag aaaaagcgct aacacgcttt
    3101  aggcatgaag ccaacggaac cagcatctac agcttaggtt tcgagcgaaa
    3151  gatatgcaaa gaagaactat ctaaaattcg cgacttatcc aagaacgaga
    3201  agaagttgaa gaaattcaac aagaagtgct gcagcatcga agaagcttca
    3251  gcagaatatg gatgtaagaa gacatctacc aaaaagtatc acaagaagcg
    3301  atacaagaaa aaatataagg cttataaacc ttataagaag aagaagaaat
    3351  tccgatccgg aaaatacttc aagcccaaag agaagaaggg ctcaaagcaa
    3401  aagtattgcc caaaaggcaa gaaagactgc aggtgttgga tctgcaatat
    3451  cgaaggtcat tacgccaacg aatgtcctaa tcgacaaagc tcggaaaagg
    3501  ctcacatcct tcaacaagca gaaaaagttg gcctccagcc cattgaagct
    3551  ccctatgaag gagttcaaga agtattcatc ttagaataca aagaagagga
    3601  agaagaaacc tctacagaag aaagcgatga tgaatcatct acttctgaag
    3651  actcagactc agactgagca ggtgatgaac gtcaccaatc ccaattcgat
    3701  ctacatcaag ggcagactct acttcaaggg atacaagaag atagagcttc
    3751  actgttttgt agacacggga gcaagcttat gcatagcatc caagttcgtc
    3801  attccagaag aacattgggt caatgcagaa agaccaataa tggtcaaaat
    3851  agcagatgga agctcaatca ccatcagcaa agtctgcaaa gacatagact
    3901  tgatcatagt cggcgtgata ttcaaaattc ccaccgtcta tcagcaagaa
    3951  agtggcatcg atttcataat cggcaacaac ttctgtcagc tatatgaacc
    4001  attcatacag tttacggata gagttatctt cacaaagaac aagtcttatc
    4051  ctgttcatat tgcgaagcta accagagcag tgcgagtagg caccgaagga
    4101  tttcttgaat caatgaagaa acgttcaaag actcaacaac ctgagccggt
    4151  gaacatttcg acaaacaaga tagaaaatcc gctagaagaa attgctattc
    4201  tttcagaggg gaggaggtta tcagaagaaa aactcttcat cactcaacaa
    4251  agaatgcaaa aaaccgaaga actacttgag aaagtatgtt cagaaaatcc
    4301  attagatcct aacaagacta agcaatggat gaaagcttca atcaagctca
    4351  gcgacccaag caaagctatc aaggttaaac ccatgaagta tagcccaatg
    4401  gatcgtgaag aatttgacaa gcaaatcaaa gagttactgg accttaaagt
    4451  cattaagccc agtaaaagcc ctcacatggc accagccttc ttggtcaaca
    4501  atgaagccga gaacggaaga ggaaacaaac gtatggtagt gaactacaaa
    4551  gctatgaata aagccaccgt aggagacgca tacaatcttc ccaacaaaga
    4601  cgagttactt acactcattc gaggaaagaa gatcttttct tccttcgact
    4651  gtaagtcagg attctggcaa gttctgcttg atcaagaatc aagacctcta
    4701  acggcgttca catgtccaca aggtcactac gaatggaatg tggtcccttt
    4751  cggcctaaag caggcaccat ccatattcca gagacacatg gacgaagcat
    4801  ttcgtgtgtt cagaaagttc tgttgcgttt atgtcgacga cattgtcgta
    4851  ttcagtaaca acgaagaaga tcatctactt cacgtagcaa tgatcttaca
    4901  aaagtgcaat cagcatggaa ttatcctttc caagaagaaa gcacaactct
    4951  tcaagaagaa gataaacttc cttggtctag aaatagatga aggaacacat
    5001  aagcctcaag gacatatttt ggaacatatc aacaagttcc cagataccct
    5051  tgaagacaag aagcaacttc agagattctt aggcatccta acatatgcct
    5101  ctgattatat cccgaatcta gctcaaatga gacagcctct gcaagccaag
    5151  cttaaagaaa atgttccatg gaaatggaca aaagaggaca ccctctacat
    5201  gcaaaaggtg aagaaaaatc tgcaaggatt tcctccacta catcatccct
    5251  taccagaaga gaagctgatc atcgaaaccg atgcatcaga cgactactgg
    5301  ggaggtatgt taaaagctat caaaattaac gaaggtacta atactgagtt
    5351  aatttgcaga taccgatctg gaagctttaa ggctgcagaa aggaattacc
    5401  acagcaatga caaagagaca ttggcggtaa taaatactat aaagaaattc
    5451  agtatttatc taactcctgt tcattttctg atcaggacag ataatactca
    5501  tttcaagagt tttgttaatc tcaattacaa aggtgattca aaacttggaa
    5551  gaaacatcag atggcaagca tggcttagcc actattcatt tgatgttgaa
    5601  catattaaag gaaccgacaa ccactttgcg gacttccttt caagagaatt
    5651  caataaggtt aattcctaat tgaaatccga agataagatt cccacacact
    5701  tgtggctgat atcaaaaggc tactgcctat ataaacacat ctctggagac
    5751  tgagaaaatc agacctccaa gcatggagaa catagaaaaa ctcctcatgc
    5801  aagagaaaat actaatgcta gagctcgatc tagtaagagc aaaaataagc
    5851  ttagcaagag ctaacggctc ttcgcaacaa ggagacctcc ctctccaccg
    5901  tgaaacaccg gaaaaagaag aagcagttca ttctgcactg gccactttta
    5951  cgccaactca agtaaaagct attccagagc aaacggctcc tggtaaagaa
    6001  tcaacaaatc cgttgatggc tagtatcttg ccaaaagata tgaacccagt
    6051  tcaaactggg ataaggcttg cagtgccagg ggacttttta cgtcctcatc
    6101  agggaattcc aatcccacaa aaatctgagc ttagcagcac agttgctcct
    6151  ctcagagcag aatcgggtat tcaacaccct catatcaact actacgttgt
    6201  gtataacggt ccacacgccg gtatatacga tgactggggt tgtacaaagg
    6251  cggcaacaaa cggcgttccc ggagttgcac acaagaagtt tgccactatt
    6301  acagaggcaa gagcagcagc tgacgcgtac acaacaagta cgcaaacaga
    6351  caggttgaac ttcatcccca aaggagaagc tcaactcaag cccaagagct
    6401  ttgcagaggc cttaaccagc ccaccaaagc aaaaagccca ctggctcacg
    6451  ctaggaacca aaaggcccag cagtgatcca gccccaaaag agatctcctt
    6501  tgccccggag atcaccatgg acgatttcct ctatctctac catctaggaa
    6551  gaaagttcga cggagaaggt gacgatacca tcttcaccac tgataatgag
    6601  aagattagcc tcttcaattt cagaaagaat gctgacccac agatggttag
    6651  agaggcctac gcagcaggtc tcatcaagac gatctacccg agtaataatc
    6701  tccaggagat caaatacctt cccaagaagg ttaaagatgc agtcaaaaga
    6751  ttcaggacta actgcatcaa gaacacagag aaagatatat ttctcaagat
    6801  cagaagtact atcccagtat ggacgattca aggcctcgtt cataaaccaa
    6851  ggcaagtaat agagattgga gtctctaaga aagtagttcc tactgaatca
    6901  aaggccatgg agtcaaaaat tcagatcgag gatctaacag aactcgccgt
    6951  gaagactggc ggacagttca tacagagtct tttacgactc aatgacaaga
    7001  agaaaatctt cgtcaacatg gtggagcacg acactctcgt ctactccaag
    7051  aatatcaaag atacagtctc agaagaccaa agggctattg agacttttca
    7101  acaaagggta atatcaggaa acctcctcgg attccattgc ccatctatct
    7151  gtcacttcat ggaaaggaca gtagaaaagg aaggtggctc ctacaaagtc
    7201  catcattgcg ataaaggaaa ggctatcgtt caagatgcct ctgccgacag
    7251  tggtcccaaa gatggacccc cacccacgag gagcatcgtg gaaaaagaag
    7301  acgttccaac cacgtcttca aagcaagtgg attgatgtga tatctccact
    7351  gacgtaaggg atgacgcaca atcccactat ccttcgcaag actcttcctc
    7401  tatataagga agttcatttc atttggagag gacacgctga aatcaccagt
    7451  ctctctctac aaatctatct ctctctattt tctccataat aatgtgtgag
    7501  tagttcccag ataagggaat taggattctt atagggtttc gctgatgtgt
    7551  tgagcatata agaaaccctt agtatgtatt agtattagta agatacttct
    7601  atcaataaaa tttctaattc ctaaaaccaa aatccagtac taaaatccag
    7651  atctcctaaa gtccctatag atctatgtcg agaatataaa ccagacacga
    7701  gacgactaaa cctggagccc agacgccgat tgaagctaga agtaccgctt
    7751  aggcaggagg ccgttaggga aaagatgcta aggcagggtt ggttacgttg
    7801  actcccccgt aggtttggtt taaatatgat gaagtggacg gaaggaagga
    7851  ggaagacaag gaaggataag gttgcaggcc ctgtgcaagg taagaagatg
    7901  gaaatttgat agaggtacgt tactatacct atactataag ctaagggaat
    7951  gcttgtattt accctatata ccctaataac cccttatcga tttaaagaaa
    8001  taatccgcat aagcccccgc ttaaaaaatt