Sequence of DPV Equine arteritis virus
Equine arteritis virus (EAV) RNA genome
ACC No: X53459
Dated: 2001-06-25 | Length: 12704 | CRC: 462137819
!!NA_SEQUENCE 1.0
ID TOEAV standard; RNA; VRL; 12704 BP.
XX
AC X53459;
XX
SV X53459.3
XX
DT 16-JUL-1991 (Rel. 28, Created)
DT 25-JUN-2001 (Rel. 68, Last updated, Version 13)
XX
DE Equine arteritis virus (EAV) RNA genome
XX
KW complete genome; envelope protein; glycoprotein 2b; glycoprotein 3;
KW glycoprotein 4; glycoprotein 5; membrane protein; nucleocapsid gene; ORF1a;
KW ORF1ab; ORF2; ORF2a; ORF3; ORF4; ORF5; ORF6; ORF7; replicase polyprotein;
KW ribosomal frameshift signal.
XX
OS Equine arteritis virus
OC Viruses; ssRNA positive-strand viruses, no DNA stage; Nidovirales;
OC Arteriviridae; Arterivirus.
XX
RN [1]
RC revised by [3]
RA Spaan W.J.M.;
RT ;
RL Submitted (25-MAY-1990) to the EMBL/GenBank/DDBJ databases.
RL Spaan W.J.M., Dept. of Virology, Institute of Medical Microbiology, Faculty
RL of Medicine, State University of Leiden, The Netherlands.
XX
RN [2]
RX MEDLINE; 91237805.
RA den Boon J.A., Chirnside E.D., De Vries A.A.F., Snijder E.J., Spaan W.J.M.;
RT "Equine Arteritis Virus is not a Togavirus but belongs to the
RT Coronaviruslike Superfamily";
RL J. Virol. 65:2910-2920(1991).
XX
RN [3]
RP 1-12704
RA Snijder E.J.;
RT ;
RL Submitted (22-JUN-2001) to the EMBL/GenBank/DDBJ databases.
RL E.J. Snijder, Department of Virology, Leiden University Medical center LUMC
RL P4-26, PO Box 9600, 2300 RC Leiden, The Netherlands
XX
RN [4]
RA Snijder E.J., Meulenberg J.J.M.;
RT "The molecular biology of arteriviruses";
RL J. Gen. Virol. 79:961-979(1998).
XX
RN [5]
RA Snijder E.J., van Tol H., Pedersen K.W., Raamsman M.J.B., De Vries A.A.F.;
RT "Identification of a novel structural protein of arteriviruses";
RL J. Virol. 73:6335-6345(1999).
XX
DR SPTREMBL; Q91DM1; Q91DM1.
DR SPTREMBL; Q91DM2; Q91DM2.
DR SWISS-PROT; P19810; NCAP_EAV.
DR SWISS-PROT; P28991; VENV_EAV.
DR SWISS-PROT; P28992; YOR2_EAV.
DR SWISS-PROT; P28993; YOR3_EAV.
DR SWISS-PROT; P28994; YOR4_EAV.
DR SWISS-PROT; P28995; YOR5_EAV.
XX
CC See also to for overlapping sequence.
XX
FH Key Location/Qualifiers
FH
FT source 1. .12704
FT /db_xref="taxon:11047"
FT /organism="Equine arteritis virus"
FT /strain="Bucyrus"
FT /cell_line="EAV-infected BHK-21 cells"
FT misc_feature 1. .211
FT /note="EAV leader sequence"
FT CDS 225. .5408
FT /note="ORF1a"
FT /product="replicase ORF1a polyprotein"
FT /protein_id="CAC42774.2"
FT /translation="MATFSATGFGGSFVRDWSLDLPDACEHGAGLCCEVDGSTLCAECF
FT RGCEGMEQCPGLFMGLLKLASPVPVGHKFLIGWYRAAKVTGRYNFLELLQHPAFAQLRV
FT VDARLAIEEASVFISTDHASAKRFPGARFALTPVYANAWVVSPAANSLIVTTDQEQDGF
FT CWLKLLPPDRREAGLRLYYNHYREQRTGWLSKTGLRLWLGDLGLGINASSGGLKFHIMR
FT GSPQRAWHITTRSCKLKSYYVCDISEADWSCLPAGNYGGYNPPGDGACGYRCLAFMNGA
FT TVVSAGCSSDLWCDDELAYRVFQLSPTFTVTIPGGRVCPNAKYAMICDKQHWRVKRAKG
FT VGLCLDESCFRGICNCQRMSGPPPAPVSAAVLDHILEAATFGNVRVVTPEGQPRPVPAP
FT RVRPSANSSGDVKDPAPVPPVPKPRTKLATPNPTQAPIPAPRTRLQGASTQEPLASAGV
FT ASDSAPKWRVAKTVYSSAERFRTELVQRARSVGDVLVQALPLKTPAVQRYTMTLKMMRS
FT RFSWHCDVWYPLAVIACLLPIWPSLALLLSFAIGLIPSVGNNVVLTALLVSSANYVASM
FT DHQCEGAACLALLEEEHYYRAVRWRPITGALSLVLNLLGQVGYVARSTFDAAYVPCTVF
FT DLCSFAILYLCRNRCWRCFGRCVRVGPATHVLGSTGQRVSKLALIDLCDHFSKPTIDVV
FT GMATGWSGCYTGTAAMERQCASTVDPHSFDQKKAGATVYLTPPVNSGSALQCLNVMWKR
FT PIGSTVLGEQTGAVVTAVKSISFSPPCCVSTTLPTRPGVTVVDHALYNRLTASGVDPAL
FT LRVGQGDFLKLNPGFRLIGGWIYGICYFVLVVVSTFTCLPIKCGIGTRDPFCRRVFSVP
FT VTKTQEHCHAGMCASAEGISLDSLGLTQLQSYWIAAVTSGLVILLVCHRLAISALDLLT
FT LASPLVLLVFPWASVGLLLACSLAGAAVKIQLLATLFVNLFFPQATLVTMGYWACVAAL
FT AVYSLMGLRVKVNVPMCVTPAHFLLLARSAGQSREQMLRVSAAAPTNSLLGVARDCYVT
FT GTTRLYIPKEGGMVFEGLFRSPKARGNVGFVAGSSYGTGSVWTRNNEVVVLTASHVVGR
FT ANMATLKIGDAMLTLTFKKNGDFAEAVTTQSELPGNWPQLHFAQPTTGPASWCTATGDE
FT EGLLSGEVCLAWTTSGDSGSAVVQGDAVVGVHTGSNTSGVAYVTTPSGKLLGADTVTLS
FT SLSKHFTGPLTSIPKDIPDNIIADVDAVPRSLAMLIDGLSNRESSLSGPQLLLIACFMW
FT SYLNQPAYLPYVLGFFAANFFLPKSVGRPVVTGLLWLCCLFTPLSMRLCLFHLVCATVT
FT GNVISLWFYITAAGTSYLSEMWFGGYPTMLFVPRFLVYQFPGWAIGTVLAVCSITMLAA
FT ALGHTLLLDVFSASGRFDRTFMMKYFLEGGVKESVTASVTRAYGKPITQESLTATLAAL
FT TDDDFQFLSDVLDCRAVRSAMNLRAALTSFQVAQYRNILNASLQVDRDAARSRRLMAKL
FT ADFAVEQEVTAGDRVVVIDGLDRMAHFKDDLVLVPLTTKVVGGSRCTICDVVKEEANDT
FT PVKPMPSRRRRKGLPKGAQLEWDRHQEEKRNAGDDDFAVSNDYVKRVPKYWDPSDTRGT
FT TVKIAGTTYQKVVDYSGNVHYVEHQEDLLDYVLGKGSYEGLDQDKVLDLTNMLKVDPTE
FT LSSKDKAKARQLAHLLLDLANPVEAVNQLN"
FT CDS join(225. .5405,5405. .9751)
FT /db_xref="SPTREMBL:Q91DM2"
FT /note="ORF1ab"
FT /note="slippery sequence causes -1 frameshift"
FT /product="replicase ORF1b polyprotein"
FT /protein_id="CAC42775.2"
FT /translation="MATFSATGFGGSFVRDWSLDLPDACEHGAGLCCEVDGSTLCAECF
FT RGCEGMEQCPGLFMGLLKLASPVPVGHKFLIGWYRAAKVTGRYNFLELLQHPAFAQLRV
FT VDARLAIEEASVFISTDHASAKRFPGARFALTPVYANAWVVSPAANSLIVTTDQEQDGF
FT CWLKLLPPDRREAGLRLYYNHYREQRTGWLSKTGLRLWLGDLGLGINASSGGLKFHIMR
FT GSPQRAWHITTRSCKLKSYYVCDISEADWSCLPAGNYGGYNPPGDGACGYRCLAFMNGA
FT TVVSAGCSSDLWCDDELAYRVFQLSPTFTVTIPGGRVCPNAKYAMICDKQHWRVKRAKG
FT VGLCLDESCFRGICNCQRMSGPPPAPVSAAVLDHILEAATFGNVRVVTPEGQPRPVPAP
FT RVRPSANSSGDVKDPAPVPPVPKPRTKLATPNPTQAPIPAPRTRLQGASTQEPLASAGV
FT ASDSAPKWRVAKTVYSSAERFRTELVQRARSVGDVLVQALPLKTPAVQRYTMTLKMMRS
FT RFSWHCDVWYPLAVIACLLPIWPSLALLLSFAIGLIPSVGNNVVLTALLVSSANYVASM
FT DHQCEGAACLALLEEEHYYRAVRWRPITGALSLVLNLLGQVGYVARSTFDAAYVPCTVF
FT DLCSFAILYLCRNRCWRCFGRCVRVGPATHVLGSTGQRVSKLALIDLCDHFSKPTIDVV
FT GMATGWSGCYTGTAAMERQCASTVDPHSFDQKKAGATVYLTPPVNSGSALQCLNVMWKR
FT PIGSTVLGEQTGAVVTAVKSISFSPPCCVSTTLPTRPGVTVVDHALYNRLTASGVDPAL
FT LRVGQGDFLKLNPGFRLIGGWIYGICYFVLVVVSTFTCLPIKCGIGTRDPFCRRVFSVP
FT VTKTQEHCHAGMCASAEGISLDSLGLTQLQSYWIAAVTSGLVILLVCHRLAISALDLLT
FT LASPLVLLVFPWASVGLLLACSLAGAAVKIQLLATLFVNLFFPQATLVTMGYWACVAAL
FT AVYSLMGLRVKVNVPMCVTPAHFLLLARSAGQSREQMLRVSAAAPTNSLLGVARDCYVT
FT GTTRLYIPKEGGMVFEGLFRSPKARGNVGFVAGSSYGTGSVWTRNNEVVVLTASHVVGR
FT ANMATLKIGDAMLTLTFKKNGDFAEAVTTQSELPGNWPQLHFAQPTTGPASWCTATGDE
FT EGLLSGEVCLAWTTSGDSGSAVVQGDAVVGVHTGSNTSGVAYVTTPSGKLLGADTVTLS
FT SLSKHFTGPLTSIPKDIPDNIIADVDAVPRSLAMLIDGLSNRESSLSGPQLLLIACFMW
FT SYLNQPAYLPYVLGFFAANFFLPKSVGRPVVTGLLWLCCLFTPLSMRLCLFHLVCATVT
FT GNVISLWFYITAAGTSYLSEMWFGGYPTMLFVPRFLVYQFPGWAIGTVLAVCSITMLAA
FT ALGHTLLLDVFSASGRFDRTFMMKYFLEGGVKESVTASVTRAYGKPITQESLTATLAAL
FT TDDDFQFLSDVLDCRAVRSAMNLRAALTSFQVAQYRNILNASLQVDRDAARSRRLMAKL
FT ADFAVEQEVTAGDRVVVIDGLDRMAHFKDDLVLVPLTTKVVGGSRCTICDVVKEEANDT
FT PVKPMPSRRRRKGLPKGAQLEWDRHQEEKRNAGDDDFAVSNDYVKRVPKYWDPSDTRGT
FT TVKIAGTTYQKVVDYSGNVHYVEHQEDLLDYVLGKGSYEGLDQDKVLDLTNMLKVDPTE
FT LSSKDKAKARQLAHLLLDLANPVEAVNQLNLRAPHIFPGDVGRRTFADSKDKGFVALHS
FT RTMFLAARDFLFNIKFVCDEEFTKTPKDTLLGYVRACPGYWFIFRRTHRSLIDAYWDSM
FT ECVYALPTISDFDVSPGDVAVTGERWDFESPGGGRAKRLTADLVHAFQGFHGASYSYDD
FT KVAAAVSGDPYRSDGVLYNTRWGNIPYSVPTNALEATACYRAGCEAVTDGTNVIATIGP
FT FPEQQPIPDIPKSVLDNCADISCDAFIAPAAETALCGDLEKYNLSTQGFVLPSVFSMVR
FT AYLKEEIGDAPPLYLPSTVPSKNSQAGINGAEFPTKSLQSYCLIDDMVSQSMKSNLQTA
FT TMATCKRQYCSKYKIRSILGTNNYIGLGLRACLSGVTAAFQKAGKDGSPIYLGKSKFDP
FT IPAPDKYCLETDLESCDRSTPALVRWFATNLIFELAGQPELVHSYVLNCCHDLVVAGSV
FT AFTKRGGLSSGDPITSISNTIYSLVLYTQHMLLCGLEGYFPEIAEKYLDGSLELRDMFK
FT YVRVYIYSDDVVLTTPNQHYAASFDRWVPHLQALLGFKVDPKKTVNTSSPSFLGCRFKQ
FT VDGKCYLASLQDRVTRSLLYHIGAKNPSEYYEAAVSIFKDSIICCDEDWWTDLHRRISG
FT AARTDGVEFPTIEMLTSFRTKQYESAVCTVCGAAPVAKSACGGWFCGNCVPYHAGHCHT
FT TSLFANCGHDIMYRSTYCTMCEGSPKQMVPKVPHPILDHLLCHIDYGSKEELTLVVADG
FT RTTSPPGRYKVGHKVVAVVADVGGNIVFGCGPGSHIAVPLQDTLKGVVVNKALKNAAAS
FT EYVEGPPGSGKTFHLVKDVLAVVGSATLVVPTHASMLDCINKLKQAGADPYFVVPKYTV
FT LDFPRPGSGNITVRLPQVGTSEGETFVDEVAYFSPVDLARILTQGRVKGYGDLNQLGCV
FT GPASVPRNLWLRHFVSLEPLRVCHRFGAAVCDLIKGIYPYYEPAPHTTKVVFVPNPDFE
FT KGVVITAYHKDRGLGHRTIDSIQGCTFPVVTLRLPTPQSLTRPRAVVAVTRASQELYIY
FT DPFDQLSGLLKFTKEAEAQDLIHGPPTACHLGQEIDLWSNEGLEYYKEVNLLYTHVPIK
FT DGVIHSYPNCGPACGWEKQSNKISCLPRVAQNLGYHYSPDLPGFCPIPKELAEHWPVVS
FT NDRYPNCLQITLQQVCELSKPCSAGYMVGQSVFVQTPGVTSYWLTEWVDGKARALPDSL
FT FSSGRFETNSRAFLDEAEEKFAAAHPHACLGEINKSTVGGSHFIFSQYLPPLLPADAVA
FT LVGASLAGKAAKAACSVVDVYAPSFEPYLHPETLSRVYKIMIDFKPCRLMVWRNATFYV
FT QEGVDAVTSALAAVSKLIKVPANEPVSFHVASGYRTNALVAPQAKISIGAYAAEWALST
FT EPPPAGYAIVRRYIVKRLLSSTEVFLCRRGVVSSTSVQTICALEGCKPLFNFLQIGSVI
FT GPV"
FT misc_feature 5399. .5405
FT /note="ribosomal frameshift slippery sequence"
FT CDS 9751. .9954
FT /db_xref="SPTREMBL:Q91DM1"
FT /note="ORF2a"
FT /product="envelope (E) protein"
FT /protein_id="CAC42776.1"
FT /translation="MGLVWSLISNSIQTIIADFAISVIDAALFFLMLLALAVVTVFLFW
FT LIVAIGRSLVARCSRGARYRPV"
FT CDS 9824. .10507
FT /db_xref="SWISS-PROT:P28992"
FT /note="ORF2b"
FT /product="glycoprotein 2b (GP2b)"
FT /protein_id="CAA37541.1"
FT /translation="MQRFSFSCYLHWLLLLCFFSGSLLPSAAAWWRGVHEVRVTDLFKD
FT LQCDNLRAKDAFPSLGYALSIGQSRLSYMLQDWLLAAHRKEVMPSNIMPMPGLTPDCFD
FT HLESSSYAPFINAYRQAILSQYPQELQLEAINCKLLAVVAPALYHNYHLANLTGPATWV
FT VPTVGQLHYYASSSIFASSVEVLAAIILLFACIPLVTRVYISFTRLMSPSRRTSSGTLP
FT RRKIL"
FT CDS 10306. .10797
FT /db_xref="SWISS-PROT:P28993"
FT /note="ORF3"
FT /product="glycoprotein 3 (GP3)"
FT /protein_id="CAA37542.1"
FT /translation="MGRAYSGPVALLCFFLYFCFICGSVGSNNTTICMHTTSDTSVHLF
FT YAANVTFPSHFQRHFAAAQDFVVHTGYEYAGVTMLVHLFANLVLTFPSLVNCSRPVNVF
FT ANASCVQVVCSHTNSTTGLGQLSFSFVDEDLRLHIRPTLICWFALLLVHFLPMPRCRGS
FT "
FT CDS 10700. .11158
FT /db_xref="SWISS-PROT:P28994"
FT /note="ORF4"
FT /product="glycoprotein 4 (GP4)"
FT /protein_id="CAA37543.1"
FT /translation="MKIYGCISGLLLFVGLPCCWCTFYPCHAAEARNFTYISHGLGHVH
FT GHEGCRNFINVTHSAFLYLNPTTPTAPAITHCLLLVLAAKMEHPNATIWLQLQPFGYHV
FT AGDVIVNLEEDKRHPYFKLLRAPALPLGFVAIVYVLLRLVRWAQRCYL"
FT CDS 11146. .11913
FT /db_xref="SWISS-PROT:P28995"
FT /note="ORF5"
FT /product="glycoprotein 5 (GP5)"
FT /protein_id="CAA37544.1"
FT /translation="MLSMIVLLFLLWGAPSHAYFSYYTAQRFTDFTLCMLTDRGVIANL
FT LRYDEHTALYNCSASKTCWYCTFLDEQIITFGTDCDDTYAVPVAEVLEQAHGPYSALFD
FT DMPPFIYYGREFGIVVLDVFMFYPVLVLFFLSVLPYATLILEMCVSILFIIYGIYSGAY
FT LAMGIFAATLAIHSIVVLRQLLWLCLAWRYRCTLHASFISAEGKVYPVDPGLPVAAVGN
FT RLLVPGRPTIDYAVAYGSKVNLVRLGAAEVWEP"
FT CDS 11901. .12389
FT /db_xref="SWISS-PROT:P28991"
FT /note="ORF6"
FT /product="hypothetical protein"
FT /protein_id="CAA37545.1"
FT /translation="MGAIDSFCGDGILGEYLDYFILSVPLLLLLTRYVASGLVYVLTAL
FT FYSFVLAAYIWFVIVGRAFSTAYAFVLLAAFLLLVMRMIVGMMPRLRSIFNHRQLVVAD
FT FVDTPSGPVPIPRSTTQVVVRGNGYTAVGNKLVDGVKTITSAGRLFSKRTAATAYKLQ"
FT CDS 12313. .12645
FT /db_xref="SWISS-PROT:P19810"
FT /note="ORF7"
FT /product="hypothetical protein"
FT /protein_id="CAA37546.1"
FT /translation="MASRRSRPQAASFRNGRRRQPTSYNDLLRMFGQMRVRKPPAQPTQ
FT AIIAEPGDLRHDLNQQERATLSSNVQRFFMIGHGSLTADAGGLTYTVSWVPTKQIQRKV
FT APPAGP"
FT polyA_site 12704
XX
SQ Sequence 12704 BP; 2692 A; 3258 C; 3305 G; 3449 T; 0 other;
X53459 Length: 12704 May 27, 2002 15:30 Type: N Check: 8573 ..
1 gctcgaagtg tgtatggtgc catatacggc tcaccaccat atacactgca
51 agaattacta ttcttgtggg cccctctcgg taaatcctag agggctttcc
101 tctcgttatt gcgagattcg tcgttagata acggcaagtt ccctttctta
151 ctatcctatt ttcatcttgt ggcttgacgg gtcactgcca tcgtcgtcga
201 tctctatcaa ctacccttgc gactatggca accttctccg ctactggatt
251 tggagggagt tttgttaggg actggtccct ggacttaccc gacgcttgtg
301 agcatggcgc gggattgtgc tgcgaagtgg acggctccac cttatgcgcc
351 gagtgttttc gcggttgcga aggaatggag caatgtcctg gcttgttcat
401 gggactgtta aaactggctt cgccagttcc agtgggacat aagttcctga
451 ttggttggta tcgagctgcc aaagtcaccg ggcgttacaa tttccttgag
501 ctgttgcaac accctgcttt cgcccagctg cgtgtggttg atgctaggtt
551 agccattgaa gaggcaagtg tgtttatttc cactgaccac gcgtctgcta
601 agcgtttccc tggcgctaga tttgcgctga caccggtgta tgctaacgct
651 tgggttgtga gcccggctgc taacagtttg atagtgacca ctgaccagga
701 acaagatggg ttctgctggt taaaactttt gccacctgac cgccgtgagg
751 ctggtttgcg gttgtattac aaccattacc gcgaacaaag gaccgggtgg
801 ctgtctaaaa caggacttcg cttatggctt ggagacctgg gtttgggcat
851 caatgcgagc tctggagggc tgaaattcca cattatgagg ggttcgcctc
901 agcgagcttg gcatatcaca acacgcagct gcaagctgaa gagctactac
951 gtttgtgaca tctctgaagc agactggtcc tgtttgcctg ctggcaacta
1001 cggcggctac aatccaccag gggacggagc ttgcggttac aggtgcttgg
1051 ccttcatgaa tggcgccact gttgtgtcgg ctggttgcag ttctgacttg
1101 tggtgtgatg atgagttggc ttatcgagtc tttcaattgt cacccacgtt
1151 cacggttacc atcccaggtg ggcgagtttg tccgaatgcc aagtacgcaa
1201 tgatttgtga caagcagcac tggcgcgtca aacgtgcaaa gggcgtcggc
1251 ctgtgtctcg atgaaagctg tttcaggggc atctgcaatt gccaacgcat
1301 gagtggacca ccacctgcac ccgtgtcagc cgccgtgtta gatcacatac
1351 tggaggcggc gacgtttggc aacgttcgcg tggttacacc tgaagggcag
1401 ccacgccccg taccagcgcc gcgagttcgt cccagcgcca actcttctgg
1451 agatgtcaaa gatccggcgc ccgttccgcc agtaccaaaa ccaaggacca
1501 agcttgccac accgaaccca actcaggcgc ccatcccagc accgcgcacg
1551 cgacttcaag gggcctcaac acaggagcca ctggcgagtg caggagttgc
1601 ttctgactcg gcacctaaat ggcgtgtggc caaaactgtg tacagctccg
1651 cggagcgctt tcggaccgaa ctggtacaac gtgctcggtc cgttggggac
1701 gttcttgttc aagcgctacc gctcaaaacc ccagcagtgc agcggtatac
1751 catgactctg aagatgatgc gttcacgctt cagttggcac tgcgacgtgt
1801 ggtacccttt ggctgtaatc gcttgtttgc tccctatatg gccatctctt
1851 gctttgctcc ttagctttgc cattgggttg atacccagtg tgggcaataa
1901 tgttgttctg acagcgcttc tggtttcatc agctaattat gttgcgtcaa
1951 tggaccatca atgtgaaggt gcggcttgct tagccttgct ggaagaagaa
2001 cactattata gagcggtccg ttggcgcccg attacaggcg cgctgtcgct
2051 tgtgctcaat ttactggggc aggtaggcta tgtagctcgt tccacctttg
2101 atgcagctta tgttccttgc actgtgttcg atctttgcag ctttgctatt
2151 ctgtacctct gccgcaatcg ttgctggaga tgcttcggac gctgtgtgcg
2201 agttgggcct gccacgcatg ttttgggctc caccgggcaa cgagtttcca
2251 aactggcgct cattgatttg tgtgaccact tttcaaagcc caccatcgat
2301 gttgtgggca tggcaactgg ttggagcgga tgttacacag gaaccgccgc
2351 aatggagcgt cagtgtgcct ctacggtgga ccctcactcg ttcgaccaga
2401 agaaggcagg agcgactgtt tacctcaccc cccctgtcaa cagcgggtca
2451 gcgctgcagt gcctcaatgt catgtggaag cgaccaattg ggtccactgt
2501 ccttggggaa caaacaggag ctgttgtgac ggcggtcaag agtatctctt
2551 tctcacctcc ctgctgcgtc tctaccactt tgcccacccg acccggtgtg
2601 accgttgtcg accatgctct ttacaaccgg ttgactgctt caggggtcga
2651 tcccgcttta ttgcgtgttg ggcaaggtga ttttctaaaa cttaatccgg
2701 ggttccggct gataggtgga tggatttatg ggatatgcta ttttgtgttg
2751 gtggttgtgt caacttttac ctgcttacct atcaaatgtg gcattggcac
2801 ccgcgaccct ttctgccgca gagtgttttc tgtacccgtc accaagaccc
2851 aagagcactg ccatgctgga atgtgtgcta gcgctgaagg catctctctg
2901 gactctctgg ggttaactca gttacaaagt tactggatcg cagccgtcac
2951 tagcggatta gtgatcttgt tggtctgcca ccgcctggcc atcagcgcct
3001 tggacttgtt gactctagct tcccctttag tgttgcttgt gttcccttgg
3051 gcatctgtgg ggcttttact tgcttgcagt ctcgctggtg ctgctgtgaa
3101 aatacagttg ttggcgacgc tttttgtgaa tctgttcttt ccccaagcta
3151 cccttgtcac tatgggatac tgggcgtgcg tggcggcttt ggccgtttac
3201 agtttgatgg gcttgcgagt gaaagtgaat gtgcccatgt gtgtgacacc
3251 tgcccatttt ctgctgctgg cgaggtcagc tggacagtca agagagcaga
3301 tgctccgggt cagcgctgct gcccccacca attcactgct tggagtggct
3351 cgtgattgtt atgtcacagg cacaactcgg ctgtacatac ccaaggaagg
3401 cgggatggtg tttgaagggc tattcaggtc accgaaggcg cgcggcaacg
3451 tcggcttcgt ggctggtagc agctacggca cagggtcagt gtggaccagg
3501 aacaacgagg tcgtcgtact gacagcgtca cacgtggttg gccgcgctaa
3551 catggccact ctgaagatcg gtgacgcaat gctgactctg actttcaaaa
3601 agaatggcga cttcgccgag gcagtgacga cacagtccga gctcccaggc
3651 aattggccac agttgcattt cgcccaacca acaaccgggc ccgcttcatg
3701 gtgcactgcc acaggagatg aagaaggctt gctcagtggc gaggtttgtc
3751 tggcgtggac tactagtggc gactctggat ctgcagtggt tcagggtgac
3801 gctgtggtag gggtccacac cggttcgaac acaagtggtg ttgcctacgt
3851 gaccacccca agcggaaaac tccttggcgc cgacaccgtg actttgtcat
3901 cactgtcaaa gcatttcaca ggccctttga catcaatccc gaaggacatc
3951 cctgacaaca ttattgccga tgttgatgct gttcctcgtt ctctggccat
4001 gctgattgat ggcttatcca atagagagag cagcctttct ggacctcagt
4051 tgttgttaat tgcttgtttt atgtggtctt atcttaacca acctgcttac
4101 ttgccttatg tgctgggctt ctttgccgct aacttcttcc tgccaaaaag
4151 tgttggccgc cctgtggtca ctgggcttct atggttgtgc tgcctcttca
4201 caccgctttc catgcgcttg tgcttgttcc atctggtctg tgctaccgtc
4251 acgggaaacg tgatatcttt gtggttctac atcactgccg ctggcacgtc
4301 ttacctttct gagatgtggt tcggaggcta tcccaccatg ttgtttgtgc
4351 cacggttcct agtgtaccag ttccccggct gggctattgg cacagtacta
4401 gcggtatgca gcatcaccat gctggctgct gccctcggtc acaccctgtt
4451 actggatgtg ttctccgcct caggtcgctt tgacaggact ttcatgatga
4501 aatacttcct ggagggagga gtgaaagaga gtgtcaccgc ctcagtcacc
4551 cgcgcttatg gcaaaccaat tacccaggag agtctcactg caacattagc
4601 tgccctcact gatgatgact tccaattcct ctctgatgtg cttgactgtc
4651 gggccgtccg atcggcaatg aatctgcgtg ccgctctcac aagttttcaa
4701 gtggcgcagt atcgtaacat ccttaatgca tccttgcaag tcgatcgtga
4751 cgctgctcgt agtcgcagac taatggcaaa actggctgat tttgcggttg
4801 aacaagaagt aacagctgga gaccgtgttg tggttatcga cggtctggac
4851 cgcatggctc acttcaaaga cgatttggtg ctggttcctt tgaccaccaa
4901 agtagtaggc ggttctaggt gcaccatttg tgacgtcgtt aaggaagaag
4951 ccaatgacac cccagttaag ccaatgccca gcaggagacg ccgcaagggc
5001 ctgcctaaag gtgctcagtt ggagtgggac cgtcaccagg aagagaagag
5051 gaacgccggt gatgatgatt ttgcggtctc gaatgattat gtcaagagag
5101 tgccaaagta ctgggatccc agcgacaccc gaggcacgac agtgaaaatc
5151 gccggcacta cctatcagaa agtggttgac tattcaggca atgtgcatta
5201 cgtggagcat caggaagatc tgctagacta cgtgctgggc aaggggagct
5251 atgaaggcct agatcaggac aaagtgttgg acctcacaaa catgcttaaa
5301 gtggacccca cggagctctc ctccaaagac aaagccaagg cgcgtcagct
5351 tgctcatctg ctgttggatc tggctaaccc agttgaggca gtgaatcagt
5401 taaactgaga gcgccccaca tctttcccgg cgatgtgggg cgtcggacct
5451 ttgctgactc taaagacaag ggtttcgtgg ctctacacag tcgcacaatg
5501 tttttagctg cccgggactt tttatttaac atcaaatttg tgtgcgacga
5551 agagttcaca aagaccccaa aagacacact gcttgggtac gtacgcgcct
5601 gccctggtta ctggtttatt ttccgtcgta cgcaccggtc gctgattgat
5651 gcatactggg acagtatgga gtgcgtttac gcgcttccca ccatatctga
5701 ttttgatgtg agcccaggtg acgtcgcagt gacgggcgag cgatgggatt
5751 ttgaatctcc cggaggaggc cgtgcaaaac gtctcacagc tgatctggtg
5801 cacgcttttc aagggttcca cggagcctct tattcctatg atgacaaggt
5851 ggcagctgct gtcagtggtg acccgtatcg gtcggacggc gtcttgtata
5901 acacccgttg gggcaacatt ccatattctg tcccaaccaa tgctttggaa
5951 gccacagctt gctaccgtgc tggatgtgag gccgttaccg acgggaccaa
6001 cgtcatcgca acaattgggc ccttcccgga gcaacaaccc ataccggaca
6051 tcccaaagag cgtgcttgac aactgcgctg acatcagctg tgacgctttc
6101 atagcgcccg ctgcagagac agccctgtgt ggagatttag agaaatacaa
6151 cctatccacg cagggttttg tgttgcctag tgttttctcc atggtgcggg
6201 cgtacttaaa agaggagatt ggagacgctc caccactcta cttgccatct
6251 actgtaccat ctaaaaattc acaagccgga attaacggcg ctgagtttcc
6301 tacaaagtct ttacagagct actgtttgat tgatgacatg gtgtcacagt
6351 ccatgaaaag caatctacaa accgccacca tggcgacttg taaacggcaa
6401 tactgttcca aatacaagat taggagcatt ctgggcacca acaattacat
6451 tggcctaggt ttgcgtgcct gcctttcggg ggttacggcc gcattccaaa
6501 aagctggaaa ggatgggtca ccgatttatt tgggcaagtc aaaattcgac
6551 ccgataccag ctcctgacaa gtactgcctt gaaacagacc tggagagttg
6601 tgatcgctcc accccggctt tggtgcgttg gttcgctact aatcttattt
6651 ttgagctagc tggccagccc gagttggtgc acagctacgt gttgaattgc
6701 tgtcacgatc tagttgtggc gggtagtgta gcattcacca aacgcggggg
6751 tttgtcatct ggagacccta tcacttccat ttccaatacc atctattcat
6801 tggtgctgta cacccagcac atgttgctat gtggacttga aggctatttc
6851 ccagagattg cagaaaaata tcttgatggc agcctggagc tgcgggacat
6901 gttcaagtac gttcgagtgt acatctactc ggacgatgtg gttctaacca
6951 cacccaacca gcattacgcg gccagctttg accgctgggt cccccacctg
7001 caggcgctgc taggtttcaa ggttgaccca aagaaaactg tgaacaccag
7051 ctccccttcc tttttgggct gccggttcaa gcaagtggac ggcaagtgtt
7101 atctagccag tcttcaggac cgcgttacac gctctctgtt ataccacatt
7151 ggtgcaaaga atccctcaga gtactatgaa gctgctgttt ccatctttaa
7201 ggactccatt atctgctgtg atgaagactg gtggacggac ctccatcgac
7251 gtatcagtgg cgctgcgcgt accgacggag ttgagttccc caccattgaa
7301 atgttaacat ccttccgcac caagcagtat gagagtgccg tgtgcacagt
7351 ttgtggggcc gcccccgtgg ccaagtctgc ttgtggaggg tggttctgtg
7401 gcaattgtgt cccgtaccac gcgggtcatt gtcacacaac ctcgctcttc
7451 gccaactgcg ggcacgacat catgtaccgc tccacttact gcacaatgtg
7501 tgagggttcc ccaaaacaga tggtaccaaa agtgcctcac ccgatcctgg
7551 atcatttgct gtgccacatt gattacggca gtaaagagga actaactctg
7601 gtagtggcgg atggtcgaac aacatcaccg cccgggcgct acaaagtggg
7651 tcacaaggta gtcgccgtgg ttgcagatgt gggaggcaac attgtgtttg
7701 ggtgcggtcc tggatcacac atcgcagtac cacttcagga tacgctcaag
7751 ggcgtggtgg tgaataaagc tctgaagaac gccgccgcct ctgagtacgt
7801 ggaaggaccc cctgggagtg ggaagacttt tcacctggtc aaagatgtgc
7851 tagccgtggt cggtagcgcg accttggttg tgcccaccca cgcgtccatg
7901 ctggactgca tcaacaagct caaacaagcg ggcgccgatc catactttgt
7951 ggtgcccaag tatacagttc ttgactttcc ccggcctggc agtggaaaca
8001 tcacagtgcg actgccacag gtcggaacca gtgagggaga aacctttgtg
8051 gatgaggtgg cctacttctc accagtggat ctggcgcgca ttttaaccca
8101 gggtcgagtc aagggttacg gtgatttaaa tcagctcggg tgcgtcggac
8151 ccgcgagcgt gccacgtaac ctttggctcc gacattttgt cagcctggag
8201 cccttgcgag tgtgccatcg attcggcgct gctgtgtgtg atttgatcaa
8251 gggcatttat ccttattatg agccagctcc acataccact aaagtggtgt
8301 ttgtgccaaa tccagacttt gagaaaggtg tagtcatcac cgcctaccac
8351 aaagatcgcg gtcttggtca ccgcacaatt gattcaattc aaggctgtac
8401 attccctgtt gtgactcttc gactgcccac accccaatca ctgacgcgcc
8451 cgcgcgcagt tgtggcggtt actagggcgt ctcaggaatt atacatctac
8501 gacccctttg atcagcttag cgggttgttg aagttcacca aggaagcaga
8551 ggcgcaggac ttgatccatg gcccacctac agcatgccac ctgggccaag
8601 aaattgacct ttggtccaat gagggcctcg aatattacaa ggaagtcaac
8651 ctgctgtaca cacacgtccc catcaaggat ggtgtaatac acagttaccc
8701 taattgtggc cctgcctgtg gctgggaaaa gcaatccaac aaaatttcgt
8751 gcctcccgag agtggcacaa aatttgggct accactattc cccagactta
8801 ccaggatttt gccccatacc aaaagaactc gctgagcatt ggcccgtagt
8851 gtccaatgat agatacccga attgcttgca aattacctta cagcaagtat
8901 gtgaactcag taaaccgtgc tcagcgggct atatggttgg acaatctgtt
8951 ttcgtgcaga cgcctggtgt gacatcttac tggcttactg aatgggtcga
9001 cggcaaagcg cgtgctctac cagattcctt attctcgtcc ggtaggttcg
9051 agactaacag ccgcgctttc ctcgatgaag ccgaggaaaa gtttgccgcc
9101 gctcaccctc atgcctgttt gggagaaatt aataagtcca ccgtgggagg
9151 atcccacttc atcttttccc aatatttacc accattgcta cccgcagacg
9201 ctgttgccct ggtaggtgct tcattggctg ggaaagctgc taaagctgct
9251 tgcagcgttg ttgatgtcta tgctccatca tttgaacctt atctacaccc
9301 tgagacactg agtcgcgtgt acaagattat gatcgatttc aagccgtgta
9351 ggcttatggt gtggagaaac gcgacctttt atgtccaaga gggtgttgat
9401 gcagttacat cagcactagc agctgtgtcc aaactcatca aagtgccggc
9451 caatgagcct gtttcattcc atgtggcatc agggtacaga accaacgcgc
9501 tggtagcgcc ccaggctaaa atttcaattg gagcctacgc cgccgagtgg
9551 gcactgtcaa ctgaaccgcc acctgctggt tatgcgatcg tgcggcgata
9601 tattgtaaag aggctcctca gctcaacaga agtgttcttg tgccgcaggg
9651 gtgttgtgtc ttccacctca gtgcagacca tttgtgcact agagggatgt
9701 aaacctctgt tcaacttctt acaaattggt tcagtcattg ggcccgtgtg
9751 atgggcttag tgtggtcact gatttcaaat tctattcaga ctattattgc
9801 tgattttgct atttctgtga ttgatgcagc gcttttcttt ctcatgctac
9851 ttgcattggc tgttgttact gtgtttcttt tctggctcat tgttgccatc
9901 ggccgcagct tggtggcgcg gtgttcacga ggtgcgcgtt acagacctgt
9951 ttaaggattt gcagtgcgac aacctgcgcg cgaaagatgc cttcccgagt
10001 ctgggatatg ctctgtcgat tggccagtcg aggctatcgt atatgctgca
10051 ggattggttg cttgctgcgc accgcaagga agttatgcct tccaatatca
10101 tgcctatgcc cggtcttact cctgattgct ttgaccatct ggagtcttct
10151 agctatgctc catttatcaa tgcctatcgg caggcaattt tgagtcaata
10201 cccacaagag ctccagctcg aagccatcaa ctgtaaattg cttgctgtgg
10251 ttgcaccggc attgtatcat aattaccatc tagccaattt gaccggaccg
10301 gccacatggg tcgtgcctac agtgggccag ttgcactatt atgcttcttc
10351 ctctattttt gcttcatctg tggaagtgtt ggcagcaata atactactat
10401 ttgcatgcat accactagtg acacgagtgt acatctcttt tacgcggcta
10451 atgtcacctt cccgtcgcac ttccagcggc actttgccgc ggcgcaagat
10501 tttgtagtgc acacgggtta tgaatatgcc ggggtcacta tgttagtgca
10551 cttgtttgcc aacttggttc tgacatttcc gagcttagtt aattgttccc
10601 gccctgtgaa tgtctttgct aatgcttctt gcgtgcaagt ggtttgtagt
10651 cataccaact caactactgg cttgggtcaa ctttcttttt cctttgtaga
10701 tgaagatcta cggctgcata tcaggcctac tcttatttgt tggtttgcct
10751 tgttgttggt gcactttcta cccatgccac gctgcagagg ctcgtaattt
10801 tacttacatt agtcatggat tgggccacgt gcacggtcat gaggggtgta
10851 ggaattttat taatgtcact cattctgcat ttctttatct taatcccacc
10901 actcccactg cgccggctat aactcattgt ttacttctgg ttctggcagc
10951 caaaatggaa cacccaaacg ctactatctg gctgcagctg cagccgtttg
11001 ggtatcatgt ggctggcgat gtcattgtca acttggaaga ggacaagagg
11051 catccttact ttaaactttt gagagcgccg gctttaccgc ttggttttgt
11101 ggctatagtt tatgttcttt tacgactggt acgttgggct caacgatgtt
11151 atctatgatt gtattgctat tcttgctttg gggtgcgcca tcacatgctt
11201 acttctcata ctacaccgct cagcgcttca cagacttcac cttgtgtatg
11251 ctgacggatc gcggcgttat tgccaatttg ctgcgatatg atgagcacac
11301 tgctttgtac aattgttccg ccagtaaaac ctgttggtat tgcacattcc
11351 tggacgaaca gattatcacg tttggaaccg attgtgatga cacctacgcg
11401 gtcccagttg ctgaggtcct ggaacaggcg catggaccgt acagtgcgct
11451 gtttgatgac atgccccctt ttatttacta tggccgtgaa ttcggcatag
11501 ttgtgttgga tgtgtttatg ttctatcccg ttttagttct gtttttctta
11551 tcagtactac cctatgctac gcttattctt gaaatgtgtg tatctattct
11601 gtttataatc tatggcattt acagcggggc ctacttggcc atgggcatat
11651 ttgcggccac gcttgctata cattcaattg tggtcctccg ccaattactg
11701 tggttatgcc tggcttggcg ataccgctgt acgcttcacg cgtcctttat
11751 atcagctgag gggaaagtgt accccgtaga ccccggactc ccggttgccg
11801 ccgtgggcaa tcggttgtta gtcccaggta ggcccactat cgattatgca
11851 gtggcctacg gcagcaaagt caaccttgtg aggttggggg cagctgaggt
11901 atgggagcca tagattcatt ttgtggtgac gggattttag gtgagtatct
11951 agattacttt attctgtccg tcccactctt gctgttgctt actaggtatg
12001 tagcatctgg gttagtgtat gttttgactg ccttgttcta ttcctttgta
12051 ttagcagctt atatttggtt tgttatagtt ggaagagcct tttctactgc
12101 ttatgctttt gtgcttttgg ctgcttttct gttattagta atgaggatga
12151 ttgtgggtat gatgcctcgt cttcggtcca ttttcaacca tcgccaactg
12201 gtggtagctg attttgtgga cacacctagt ggacctgttc ccatcccccg
12251 ctcaactact caggtagtgg ttcgcggcaa cgggtacacc gcagttggta
12301 acaagcttgt cgatggcgtc aagacgatca cgtccgcagg ccgcctcttt
12351 tcgaaacgga cggcggcgac agcctacaag ctacaatgac ctactgcgca
12401 tgtttggtca gatgcgggtc cgcaaaccgc ccgcgcaacc cactcaggct
12451 attattgcag agcctggaga ccttaggcat gatttaaatc aacaggagcg
12501 cgccaccctt tcgtcgaacg tacaacggtt cttcatgatt gggcatggtt
12551 cactcactgc agatgccgga ggactcacgt acaccgtcag ttgggttcct
12601 accaaacaaa tccagcgcaa agttgcgcct ccagcagggc cgtaagacgt
12651 ggatattctc ctgtgtggcg tcatgttgaa gtagttatta gccacccagg
12701 aacc