Sequence of DPV Sweet potato pakakuy virus
Sweetpotato badnavirus A, complete genome.
ACC No: FJ560943
Dated: 2011-06-21 | Length: 8082 | CRC: -1377429777
ID FJ560943; SV 2; linear; genomic DNA; STD; VRL; 8082 BP.
XX
AC FJ560943;
XX
DT 25-MAY-2009 (Rel. 100, Created)
DT 21-JUN-2011 (Rel. 109, Last updated, Version 2)
XX
DE Sweetpotato badnavirus A, complete genome.
XX
KW .
XX
OS Sweetpotato badnavirus A
OC Viruses; Retro-transcribing viruses; Caulimoviridae; Badnavirus.
XX
RN [1]
RP 1-8082
RX DOI; 10.1016/j.virol.2009.03.024.
RX PUBMED; 19394993.
RA Kreuze J.F., Perez A., Untiveros M., Quispe D., Fuentes S., Barker I.,
RA Simon R.;
RT "Complete viral genome sequence and discovery of novel viruses by deep
RT sequencing of small RNAs: a generic method for diagnosis, discovery and
RT sequencing of viruses";
RL Virology 388(1):1-7(2009).
XX
RN [2]
RP 1-8082
RA Kreuze J.F., Perez A.;
RT ;
RL Submitted (17-DEC-2008) to the INSDC.
RL Germplasm Enhancement and Crop Improvement, International Potato Center
RL (CIP), Avenida la Molina 1895, Lima Lima 12, Peru
XX
RN [3]
RC Sequence update by submitter
RP 1-8082
RA Kreuze J.F., Perez A.;
RT ;
RL Submitted (14-JUN-2011) to the INSDC.
RL Germplasm Enhancement and Crop Improvement, International Potato Center
RL (CIP), Avenida la Molina 1895, Lima Lima 12, Peru
XX
CC On Jun 14, 2011 this sequence version replaced gi:237846572.
XX
FH Key Location/Qualifiers
FH
FT source 1. .8082
FT /organism="Sweetpotato badnavirus A"
FT /isolate="Huachano1"
FT /mol_type="genomic DNA"
FT /country="Peru"
FT /isolation_source="sweet potato cv. Huachano"
FT /db_xref="taxon:646442"
FT misc_feature 1. .18
FT /note="similar to tRNA-Met sequence"
FT stem_loop 76. .92
FT misc_feature 138. .656
FT /note="putative leader sequence"
FT CDS 657. .1085
FT /codon_start=1
FT /product="hypothetical protein"
FT /note="ORF1"
FT /db_xref="InterPro:IPR010746"
FT /db_xref="UniProtKB/TrEMBL:C4NFM2"
FT /protein_id="ACR22895.2"
FT /translation="MSERWEKSLQNWYDSRRSHLEYLDLESVSKPTLSQLAHNLSIVRD
FT NNNLHTKVLLKRCYTLEEKLEEQSLLIKKLEKGLEALTEEFLSSRPLTAKQVKELVVEI
FT AEQPKLVEQEALKLTEELKGKLDKVEGLIRDLKEFITG"
FT CDS 1082. .1540
FT /codon_start=1
FT /product="hypothetical protein"
FT /note="ORF2"
FT /db_xref="UniProtKB/TrEMBL:C4NFM3"
FT /protein_id="ACN56744.1"
FT /translation="MNYREVQDLPGFKRALTGTASLGAEGFNQPSGKTQSSLDTLIRQN
FT NTLLFLAGVSDDRIQGIEEELVEIRKAISKRETPDLSGVVGQLQQLTIKGKAPEPRGVL
FT RVYQDPYLQTRRQINEETRGRSSHRSRSSRGGSEPPRRTPDPPRGSDS"
FT CDS 1437. .5006
FT /codon_start=1
FT /product="polyprotein"
FT /note="ORF3a; contains coat protein & movement protein"
FT /db_xref="GOA:C4NFM4"
FT /db_xref="InterPro:IPR001022"
FT /db_xref="InterPro:IPR001878"
FT /db_xref="InterPro:IPR013084"
FT /db_xref="UniProtKB/TrEMBL:C4NFM4"
FT /protein_id="ACN56745.1"
FT /translation="MRRPGAAAVTEVGLQGEGRNPQGERQILHEDQIRDYRRMAEARYQ
FT LQRQVARVLGRPYRRTLERLMNPDQNLEDSLSRRARIVPAEVLYSSTEGTENQRVYIHR
FT SEEEITCLDNQQVDLPLITPQSHAQLLRQNYRFIHIGAIQVRVQALHRTHAGTMVLVLN
FT TDRRWNGDLSLFGGIEGDLTEGAFMTYIIPNVTMTVEDFCQNIMVEFQTRGYSEWVHGS
FT NLLITRGMVGRLSNTPNVGFNYNVSAVTDYLVSKGVRALPGRRYSTADIQGLRWNVRRP
FT REIIPRRPTEMISRNLLGGGFSLSFREYQPITAEQRRAAQHPEEDALEELEHEVLGVLD
FT VEVADWDNLDFPEPDPTMVNIEIPEPEPQQPALEPVEHLTDEVLGFRREASSWNYLGSD
FT SEEEALLEFLYYRNTPDDGPYWNMEYSMAYQQLAAALEEDVSPNDVESLDKNTVEGGTS
FT TPFFGGEEAVPPKNSENSCIMCNKKGIPGDKILCQNCMDITDDSDDEREVQRQERRKRM
FT QRQKEKRVTPSTSDQTHHEVLGAINDEMPNTAEEFEEMVERLYAQVSQQAPEASNPAAP
FT STSRGQSAXSPSXPPEDISMGQPSYAPARPGTTTEISGPPTFRTDSRFLKRGTNNENWS
FT LPPAQQQGGVLLTLPEQMGLLNDVFMRWETTTLNHMSLMNIQDTQEKVDYMENLLGETA
FT KLAWIQWRTVYEDEYKTIVGQAEGRMGTQNVISQVRRILTLSDPVQGSTAVQDQAYRDL
FT ERLQCNDVKDMVKFLNDYMRLATKTGRLYMGAELSEKLWIKMPGDLGTKIKEEFNKAHP
FT GAQIAVIPRIFFAHKYLEDRCKEAAFARSLKSVSFCKDIPIQGYYGNDKPKYTPRKART
FT YKGKPHETHVRIDRRKNLDRNSHCKCFICEQPGHYARDCPNQKRNINRVMMFNQVNIPD
FT NYDIVSVSENAEDSDAIYSLTEGDDAEETNFGLVHESVHMITHQVIGSWRAYIEPSETQ
FT KVCRHQWQDHQEIELPGEDTCLWCKHHINIRTRSHCPACLLTVCNICSLRYLGREVPPK
FT AQERVLPFPDQSALIQQQQAYMNWADQDRARLKQEVNDERRRGQLLFEEERRRAERLGD
FT EIAQLKLRMESMEEEQKLKNDLYTQTEKDLKNRIRVLKDKKMELKEXLKKAKXXRYEAL
FT EKERKSGRKVLKMRGVAPSLKRC"
FT CDS 5116. .7500
FT /codon_start=1
FT /product="RNaseH/reverse transcriptase"
FT /note="replication protein; ORF3b; polyprotein; contains
FT aspartic protease domain and RNaseH/reverse transcriptase
FT domain"
FT /db_xref="GOA:C4NFM5"
FT /db_xref="InterPro:IPR000477"
FT /db_xref="InterPro:IPR002156"
FT /db_xref="InterPro:IPR018061"
FT /db_xref="InterPro:IPR021109"
FT /db_xref="UniProtKB/TrEMBL:C4NFM5"
FT /protein_id="ACN56746.1"
FT /translation="MIQGAVSPQIVSSASGELNNRLYNMKVCIRIRGCPEFSVNAILDT
FT GATVCCIEEERVPKEGLEESKMTAQFTGLNSTQQTRKKLKEGYMLIGEHMFPLPFVYAL
FT NPMRIGRGIQFIIGCNFIRRMKGGLRIEGPTVTFYRNVSTIETQEKSTVAATIGSINEG
FT RTLIFPRFRKEVAALIKEGFIGNNPLLHWTKNRVYCKLQIKNTDLIIQDPPLKHVTPAA
FT REFFKSQISDLLKAKLIRPSKSKHRTTAFMVESGTIVDPKTGKEIRGKQRMVYNYKRLN
FT DNTEKDQYSLPGINTIVSRISGKKIFSKFDLKAGFHQIRMEEKSKPWTAFWTPEGLYEF
FT EVMPFGLMNAPADFQRKMDNAFRGTDAFIAVYIDDILVFSENEEEHEDHLLNLAQIVRR
FT EGLILSPTKMKIGVKEVDFLGIKIQGNKIQLQEHILKKIGDFKEKDLLTKKGLRSWLGI
FT LNYARQYIPNLGKLLGPLYGKTSPTGEIRFNAQDWKLVREIKRKVQQLPPLEIPPKDCC
FT IVLEADGCMEGWGAVCKWKQSAYDPRSKERIAAYASGKFQPIKSTIDAEIFAIMNAMEA
FT FKIYYLDKKEMIIRTDCEAIVSFFNKSASNKPSRARWISFTDYITGTGIKIRIEHIDGK
FT DNILADYLSRLVFSLIIAEWKTQEKSIAPLQAPRITLTKXSCSKQQEPLLLRELSMKRP
FT LEDKEDQGPWSILLLTQPIEHLLKGSRNGQDRSKPITETGSMTSLKIALMILDLSPEVI
FT ISTYGLKEGSLEKACQEMWVHNSKQADRPYCMRRPRRPGEA"
FT CDS 7077. .7658
FT /codon_start=1
FT /product="hypothetical protein"
FT /note="ORF4"
FT /db_xref="UniProtKB/TrEMBL:C4NFM6"
FT /protein_id="ACN56747.1"
FT /translation="MENAREINSPPSSPQNNSNQRVMQQAARAFALERAIYETTARRQR
FT GPRTMEHPAVNPANRTPAQRFEEWSRQEQTYHRDWINDQFENSLNDLGPLARGYNFNLW
FT LERRLLREGLPGDVGAQLQASRQALLHEAAQEARRGVMTLQRMIRNRAQHNRRYITRDN
FT AAGDLQGSYSEAQAQVEAAAIILMALIEDL"
XX
SQ Sequence 8082 BP; 2698 A; 1574 C; 1923 G; 1870 T; 17 other;
fj560943 Length: 8082 21-JUN-2011 Type: N Check: 7259 ..
1 tggtatcaga gcgagtatca gtcaatctgt aagtacttgc tctacgttgt
51 attttggttt ttggtatttt agtatggcag gctaagccta ccgtcgggga
101 aagaggtccc tgacagtaga gtttcacgcc agtccttgtt ttcaatgttt
151 gtattataac gttataagca tttttaatga agaatatttt gatatacgct
201 tgaatgttta tctttcagct tagtatgatt atttgttttg ctagtaaata
251 aattatactt ctccgagtaa agtttagatt tgagaaattc ttcgctagta
301 ctgagtggtt cgctagtaaa aacttgattc aataccgaag ggatgccttg
351 agtaaactaa ttccaataat aaggcaaaag acgtaaatca cagtaagtcc
401 cgcgggaggg ccgtaagatg atgaaaaggt gagcctaata tttggaataa
451 aatttacttt gagacgatga tgtaagtatt gaattaagag ttgaaaggtc
501 cattcgggaa aggttttatt tcaaaaatct ctatgatgac ttctgtcttt
551 tggaggaata taatttgttg aaagatttca aattttcaga aaagagagct
601 tatatcaacg tattcagatt tgtccttgct tatgactttt ataatctttt
651 gaaaacatgt ctgaaagatg ggagaaatcc cttcagaact ggtatgattc
701 tcgaaggtca cacctcgagt accttgattt agagtctgtt agtaaaccta
751 cgcttagcca gttagcccat aatctaagta ttgttagaga taacaataac
801 ctccacacaa aggtcctttt aaaaaggtgt tataccttag aagaaaaatt
851 agaagagcag tctcttctga ttaaaaaatt agaaaaaggc ttagaagcct
901 taactgagga gttcctttcc tcaagacctc ttaccgccaa acaagtaaaa
951 gaattggttg tggaaatagc agagcagcca aagttggtgg aacaggaagc
1001 acttaagctt actgaggagc ttaaggggaa actcgataag gttgaaggcc
1051 ttatcaggga tctaaaggag ttcatcaccg gatgaactac agggaagtcc
1101 aggatcttcc tggatttaaa agggcgttaa ccggaacagc gtccttggga
1151 gcagaaggct ttaatcagcc ttcaggaaaa acacaatcca gccttgatac
1201 actcataagg cagaataaca ctctgttatt cttggccggg gtttctgacg
1251 ataggatcca aggaatcgag gaggaactag tcgagatcag aaaggcaatc
1301 tcaaaaagag agactccaga tctctctggt gtggtgggtc agcttcagca
1351 gctgactata aaaggaaagg ctccagaacc aagaggtgta cttcgtgtat
1401 accaagatcc ttatctgcaa actaggaggc agataaatga ggagaccagg
1451 ggccgcagca gtcacagaag tcggtcttca aggggagggt cggaaccccc
1501 aaggagaacg ccagatcctc cacgaggatc agattcgtga ttacagaagg
1551 atggcagaag cacggtacca actacagcgt caggtagccc gtgttctagg
1601 aaggccatac agaaggacgc tggaaaggct tatgaatcca gatcaaaact
1651 tggaggattc attaagcagg agagcaagga tagtacctgc tgaggtacta
1701 tattcgtcca ctgaagggac agaaaaccag agggtgtata ttcacagaag
1751 tgaggaagag atcacttgcc tggacaatca gcaagtggat ctaccactca
1801 ttacccctca aagccatgcg cagctcctga gacaaaatta cagattcata
1851 catatcggag caatacaggt tagggttcaa gccctacaya ggacgcatgc
1901 rgggaccatg gtgttggtcc taaayacgga yagacggtgg aacggggact
1951 tatccctgtt tggaggaatt gaaggcgatc ttacggaagg tgccttcatg
2001 acatatatca tccccaacgt aacaatgaca gttgaggatt tctgccaaaa
2051 tatcatggta gaattccaga ctaggggata ctctgagtgg gttcatgggt
2101 cgaacctact aatcactcga ggaatggtag gaagattatc caacactcca
2151 aatgttggat ttaactataa cgtctccgct gtaaccgatt atttggttag
2201 caaaggcgta cgagccctac caggacgacg atacagtacg gctgacatcc
2251 agggcctaag gtggaacgtc aggagacctc gtgagatcat cccaagaagg
2301 cccacagaga tgataagcag aaacctccta gggggaggat tttccctatc
2351 ctttagggaa tatcagccaa taacagctga gcaaaggagg gcagctcagc
2401 atcctgagga agatgctcta gaagaattgg agcatgaagt gttgggagta
2451 ctagacgtag aagtcgcaga ttgggacaat ctggatttcc cagaaccaga
2501 cccaacaatg gtgaacatag agatccctga gccagaacca caacaaccgg
2551 ctctagaacc agtagagcac ttaactgatg aagtgctagg tttccgaaga
2601 gaagcaagct cttggaacta cttgggatca gacagtgaag aagaagcact
2651 gctagaattc ctgtattaca ggaatactcc agatgatgga ccatactgga
2701 atatggaata tagtatggca tatcagcagt tagcagctgc tttggaagaa
2751 gacgtctcgc caaacgacgt ggaaagtttg gacaaaaata cagtggaagg
2801 tggaacttcc actccttttt tcgggggaga agaggcagta ccccctaaaa
2851 attctgaaaa cagttgcatc atgtgcaaca agaaaggcat cccaggggat
2901 aaaattctct gccagaactg tatggatatc actgatgata gtgatgatga
2951 aagagaagtt caacgacaag aaaggagaaa aaggatgcag aggcaaaagg
3001 aaaagagggt aaccccctct acatctgacc aaactcacca tgaggtactt
3051 ggggcgataa atgatgaaat gcctaatacc gcagaagagt ttgaagaaat
3101 ggtggaaaga ttgtatgcgc aggtgtcaca acaagctccg gaagcatcaa
3151 accctgctgc accatccacc tcaagagggc agtcagccca wtccccwtcc
3201 rgscccccag aagacatatc aatgggccaa ccctcctatg cacctgcaag
3251 gccaggaaca acgacggaga taagtggacc cccgactttc aggacagact
3301 caaggttctt gaaaagaggg actaataacg aaaattggtc gttgccacca
3351 gcacagcaac aaggaggagt gctgttaact ctcccggaac aaatgggatt
3401 gttaaatgat gttttcatga gatgggagac aactacactg aaccatatgt
3451 ctctcatgaa catccaggac acccaggaaa aagtggatta catggaaaac
3501 ctactgggtg aaacagcaaa actggcatgg atccaatgga gaacggtcta
3551 tgaggatgag tataagacta tcgtaggcca agccgaaggg agaatgggga
3601 cacagaatgt catctcccaa gtcagaagga tcctgacctt aagtgaccca
3651 gtgcaagggt caacagcagt acaggaccag gcatacagag atctagaacg
3701 cctacaatgt aatgatgtta aggatatggt caaattcctt aatgattata
3751 tgaggctagc aaccaaaact ggtcgactgt acatgggagc tgagctttcc
3801 gagaagctct ggataaagat gccgggtgac cttggtacta aaatcaagga
3851 agagttcaac aaggctcacc ctggagcaca aatagctgtc ataccgcgca
3901 tattctttgc gcataaatac cttgaagata ggtgtaaaga agctgctttt
3951 gcaaggagct taaagagtgt atccttctgc aaggatatac caattcaagg
4001 gtattatggt aatgacaagc caaagtatac ccccagaaaa gcaaggacat
4051 acaaggggaa gcctcatgaa acacacgtga ggattgatag gcgaaagaac
4101 ctggatagga actcacactg taagtgtttt atatgtgagc aaccggggca
4151 ctacgcaagg gattgcccaa accaaaaaag gaacatcaac agagtaatga
4201 tgttcaacca ggtcaatata ccagacaatt acgatattgt ctcggtatct
4251 gaaaatgcag aagatagcga tgctatctac agcctgactg aaggagatga
4301 tgcggaggaa accaattttg gattggtaca cgaatccgta cacatgatta
4351 ctcatcaggt gatagggtca tggagggcct atatwgagcc aagtgaaacg
4401 caaaaggtct gtcgccatca atggcaagac catcaggaga ttgarctccc
4451 aggagaagat acttgcctct ggtgcaagca tcatattaac atcaggacta
4501 ggagccattg tcctgcatgt ttgctcacag tctgcaacat ctgttcgttg
4551 cgatatttgg gaagagaagt cccaccaaag gctcaagaaa gggtcttacc
4601 ctttcctgat caaagcgccc ttattcaaca acaacaagcg tacatgaatt
4651 gggcagatca agatcgagcg cgcctcaaac aggaggtgaa tgacgaaaga
4701 agaagaggac aactcctctt tgaagaagaa aggagacgag ctgaaaggct
4751 cggtgatgaa atcgctcagc taaaactgag gatggaatct atggaggagg
4801 aacaaaagct aaaaaatgac ctctatactc agacagaaaa ggacctcaaa
4851 aatagaataa gggtcctgaa ggataagaag atggagctta aagaaasctt
4901 aaaaaaggct aagargmtga ggtacgaagc mttagaaaag gaaaggaaat
4951 caggacggaa agttctgaag atgagaggag tagcacccag tctgaagagg
5001 tgctagattc ggaaaactag caagggagca ggaggagaat ggtggctcct
5051 gttctcaaat aacagaatcc accatagytg ttgaggagct tagacctcaa
5101 wctgagatcg tagggatgat acaaggtgcc gtatctcctc agatcgtatc
5151 ttcggcatca ggagagctga ataacagact ttacaacatg aaagtctgta
5201 taagaatacg aggttgccca gaattctcgg tcaatgcaat tttagatact
5251 ggggcaacag tttgctgcat agaggaagaa agggttccca aagaaggact
5301 agaagagagc aaaatgactg ctcaattcac aggccttaac tcaacccagc
5351 aaacaaggaa gaagctcaag gaaggatata tgttgatagg agaacatatg
5401 ttcccacttc ctttcgtata tgctctcaac cccatgagaa taggaagagg
5451 catccagttc ataattggat gtaatttcat cagaaggatg aaagggggac
5501 tccgtataga aggaccaaca gtcaccttct atagaaatgt ttctacaata
5551 gaaacacagg agaaatcgac tgtagcagct actataggaa gcatcaatga
5601 aggaagaacc ttaattttcc caaggtttag aaaagaagta gccgccctga
5651 taaaggaagg atttattggg aataatcctc tccttcactg gacaaaaaat
5701 agggtttact gtaaattgca gatcaaaaac actgatctaa tcatacaaga
5751 ccctccactg aagcatgtca cacctgctgc aagagagttc tttaagagcc
5801 aaataagtga cctgctaaag gctaagctta tccggccttc aaagagtaag
5851 cataggacca cggccttcat ggttgaatca ggaacgattg tggatcctaa
5901 aacaggaaaa gagattagag gcaaacaaag gatggtttac aactacaagc
5951 gcctcaatga caatacagaa aaagatcagt attccctacc gggaatcaac
6001 actattgtca gtagaatatc tggaaaaaag atattctcaa aatttgatct
6051 caaagcaggc tttcatcaga taagaatgga ggaaaagtcc aaaccatgga
6101 ctgctttttg gactccagaa gggctctatg aatttgaagt catgcccttt
6151 ggactgatga atgcaccagc agattttcag agaaagatgg ataatgcatt
6201 taggggaaca gatgcattta tcgcagtata tattgatgac atattggtat
6251 tctctgaaaa tgaagaagag catgaggatc atctcttaaa tttggcccaa
6301 attgtgagaa gagaagggct tattctaagc ccaacaaaaa tgaaaattgg
6351 agtcaaagaa gtggatttcc taggaataaa gatccaagga aataagatcc
6401 aattgcagga gcatatcctt aaaaagatag gagactttaa ggaaaaggat
6451 ttgctcacaa agaaagggtt aagatcttgg ctaggcatcc ttaactatgc
6501 taggcaatat attcctaatt taggaaaatt gctaggccca ttatatggga
6551 agacatcgcc taccggagag ataaggttta atgcccaaga ttggaagttg
6601 gtacgtgaaa tcaaaagaaa agtccaacaa cttccccctt tggaaatacc
6651 cccaaaggac tgctgtatag tccttgaagc tgacggttgc atggaaggat
6701 ggggagccgt ctgtaaatgg aaacagtccg catatgatcc acgttcaaag
6751 gaacgtatcg cagcatatgc atctggcaag tttcagccta tcaaatcaac
6801 aattgatgct gaaatctttg caataatgaa tgccatggaa gccttcaaga
6851 tttactatct tgacaaaaag gaaatgataa taaggacaga ttgtgaagca
6901 atagtgtcct tcttcaataa aagtgcttcc aacaagccaa gcagggctcg
6951 gtggatatca ttcaccgact atataactgg aacggggatt aagataagga
7001 tagagcatat agatggcaag gataatattc ttgcagatta tttatctagg
7051 cttgtatttt ctctgattat tgcagaatgg aaaacgcaag agaaatcaat
7101 agcccccctt caagccccca gaataactct aaccaaagrg tcatgcagca
7151 agcagcaaga gcctttgctc ttgagagagc tatctatgaa acgaccgcta
7201 gaagacaaag aggaccaagg accatggagc atcctgctgt taacccagcc
7251 aatagaacac ctgctcaaag gttcgaggaa tggtcaagac aggagcaaac
7301 ctatcacaga gactggatca atgaccagtt tgaaaatagc cttaatgatc
7351 ttggacctct cgccagaggt tataatttca acttatggct tgaaagaagg
7401 ctccttagag aaggcctgcc aggagatgtg ggtgcacaac tccaagcaag
7451 cagacaggcc ttactgcatg aggcggccca ggaggccagg agaggcgtaa
7501 tgacgcttca aaggatgatc agaaatagag cccagcataa cagacgatac
7551 atcaccagag acaatgctgc tggtgactta caaggctcat actctgaagc
7601 ccaggcccaa gtggaagctg cggctatcat cctgatggcc ctgattgagg
7651 atctttaggt tgcttaacct aaagagacga cagaggacgg tggtaggtcc
7701 ggacaacgac tatcacctta tctgctatgt agtgcggcat gtgcactatt
7751 ttgtttttgt cggccatttg ctgacttttg tgtctagggt tttgaaaagc
7801 agatgcttct cggtgaagcg agtcttttct tttaagaaaa gagagataag
7851 ataaggtatc ttatcttgct ttgtcggcca ctcttccttt aggccatttg
7901 tcatcttatg aagatatcat cctatataag gatctcatat tttcatgaat
7951 gacatataga agaaagaggg aatcacccaa taaaaaaaaa aaaaaaaaca
8001 gagagaaaat cggcaaaatc atcacgactg agccgacgat ccaatagttt
8051 aaaaatttcc gctgtgtgtg ctagtgtctt tt