Sequence of DPV Equine arteritis virus
Equine arteritis virus (EAV) RNA genome
ACC No: X53459
Dated: 2001-06-25 | Length: 12704 | CRC: 462137819
!!NA_SEQUENCE 1.0 ID TOEAV standard; RNA; VRL; 12704 BP. XX AC X53459; XX SV X53459.3 XX DT 16-JUL-1991 (Rel. 28, Created) DT 25-JUN-2001 (Rel. 68, Last updated, Version 13) XX DE Equine arteritis virus (EAV) RNA genome XX KW complete genome; envelope protein; glycoprotein 2b; glycoprotein 3; KW glycoprotein 4; glycoprotein 5; membrane protein; nucleocapsid gene; ORF1a; KW ORF1ab; ORF2; ORF2a; ORF3; ORF4; ORF5; ORF6; ORF7; replicase polyprotein; KW ribosomal frameshift signal. XX OS Equine arteritis virus OC Viruses; ssRNA positive-strand viruses, no DNA stage; Nidovirales; OC Arteriviridae; Arterivirus. XX RN [1] RC revised by [3] RA Spaan W.J.M.; RT ; RL Submitted (25-MAY-1990) to the EMBL/GenBank/DDBJ databases. RL Spaan W.J.M., Dept. of Virology, Institute of Medical Microbiology, Faculty RL of Medicine, State University of Leiden, The Netherlands. XX RN [2] RX MEDLINE; 91237805. RA den Boon J.A., Chirnside E.D., De Vries A.A.F., Snijder E.J., Spaan W.J.M.; RT "Equine Arteritis Virus is not a Togavirus but belongs to the RT Coronaviruslike Superfamily"; RL J. Virol. 65:2910-2920(1991). XX RN [3] RP 1-12704 RA Snijder E.J.; RT ; RL Submitted (22-JUN-2001) to the EMBL/GenBank/DDBJ databases. RL E.J. Snijder, Department of Virology, Leiden University Medical center LUMC RL P4-26, PO Box 9600, 2300 RC Leiden, The Netherlands XX RN [4] RA Snijder E.J., Meulenberg J.J.M.; RT "The molecular biology of arteriviruses"; RL J. Gen. Virol. 79:961-979(1998). XX RN [5] RA Snijder E.J., van Tol H., Pedersen K.W., Raamsman M.J.B., De Vries A.A.F.; RT "Identification of a novel structural protein of arteriviruses"; RL J. Virol. 73:6335-6345(1999). XX DR SPTREMBL; Q91DM1; Q91DM1. DR SPTREMBL; Q91DM2; Q91DM2. DR SWISS-PROT; P19810; NCAP_EAV. DR SWISS-PROT; P28991; VENV_EAV. DR SWISS-PROT; P28992; YOR2_EAV. DR SWISS-PROT; P28993; YOR3_EAV. DR SWISS-PROT; P28994; YOR4_EAV. DR SWISS-PROT; P28995; YOR5_EAV. XX CC See alsoto for overlapping sequence. XX FH Key Location/Qualifiers FH FT source 1. .12704 FT /db_xref="taxon:11047" FT /organism="Equine arteritis virus" FT /strain="Bucyrus" FT /cell_line="EAV-infected BHK-21 cells" FT misc_feature 1. .211 FT /note="EAV leader sequence" FT CDS 225. .5408 FT /note="ORF1a" FT /product="replicase ORF1a polyprotein" FT /protein_id="CAC42774.2" FT /translation="MATFSATGFGGSFVRDWSLDLPDACEHGAGLCCEVDGSTLCAECF FT RGCEGMEQCPGLFMGLLKLASPVPVGHKFLIGWYRAAKVTGRYNFLELLQHPAFAQLRV FT VDARLAIEEASVFISTDHASAKRFPGARFALTPVYANAWVVSPAANSLIVTTDQEQDGF FT CWLKLLPPDRREAGLRLYYNHYREQRTGWLSKTGLRLWLGDLGLGINASSGGLKFHIMR FT GSPQRAWHITTRSCKLKSYYVCDISEADWSCLPAGNYGGYNPPGDGACGYRCLAFMNGA FT TVVSAGCSSDLWCDDELAYRVFQLSPTFTVTIPGGRVCPNAKYAMICDKQHWRVKRAKG FT VGLCLDESCFRGICNCQRMSGPPPAPVSAAVLDHILEAATFGNVRVVTPEGQPRPVPAP FT RVRPSANSSGDVKDPAPVPPVPKPRTKLATPNPTQAPIPAPRTRLQGASTQEPLASAGV FT ASDSAPKWRVAKTVYSSAERFRTELVQRARSVGDVLVQALPLKTPAVQRYTMTLKMMRS FT RFSWHCDVWYPLAVIACLLPIWPSLALLLSFAIGLIPSVGNNVVLTALLVSSANYVASM FT DHQCEGAACLALLEEEHYYRAVRWRPITGALSLVLNLLGQVGYVARSTFDAAYVPCTVF FT DLCSFAILYLCRNRCWRCFGRCVRVGPATHVLGSTGQRVSKLALIDLCDHFSKPTIDVV FT GMATGWSGCYTGTAAMERQCASTVDPHSFDQKKAGATVYLTPPVNSGSALQCLNVMWKR FT PIGSTVLGEQTGAVVTAVKSISFSPPCCVSTTLPTRPGVTVVDHALYNRLTASGVDPAL FT LRVGQGDFLKLNPGFRLIGGWIYGICYFVLVVVSTFTCLPIKCGIGTRDPFCRRVFSVP FT VTKTQEHCHAGMCASAEGISLDSLGLTQLQSYWIAAVTSGLVILLVCHRLAISALDLLT FT LASPLVLLVFPWASVGLLLACSLAGAAVKIQLLATLFVNLFFPQATLVTMGYWACVAAL FT AVYSLMGLRVKVNVPMCVTPAHFLLLARSAGQSREQMLRVSAAAPTNSLLGVARDCYVT FT GTTRLYIPKEGGMVFEGLFRSPKARGNVGFVAGSSYGTGSVWTRNNEVVVLTASHVVGR FT ANMATLKIGDAMLTLTFKKNGDFAEAVTTQSELPGNWPQLHFAQPTTGPASWCTATGDE FT EGLLSGEVCLAWTTSGDSGSAVVQGDAVVGVHTGSNTSGVAYVTTPSGKLLGADTVTLS FT SLSKHFTGPLTSIPKDIPDNIIADVDAVPRSLAMLIDGLSNRESSLSGPQLLLIACFMW FT SYLNQPAYLPYVLGFFAANFFLPKSVGRPVVTGLLWLCCLFTPLSMRLCLFHLVCATVT FT GNVISLWFYITAAGTSYLSEMWFGGYPTMLFVPRFLVYQFPGWAIGTVLAVCSITMLAA FT ALGHTLLLDVFSASGRFDRTFMMKYFLEGGVKESVTASVTRAYGKPITQESLTATLAAL FT TDDDFQFLSDVLDCRAVRSAMNLRAALTSFQVAQYRNILNASLQVDRDAARSRRLMAKL FT ADFAVEQEVTAGDRVVVIDGLDRMAHFKDDLVLVPLTTKVVGGSRCTICDVVKEEANDT FT PVKPMPSRRRRKGLPKGAQLEWDRHQEEKRNAGDDDFAVSNDYVKRVPKYWDPSDTRGT FT TVKIAGTTYQKVVDYSGNVHYVEHQEDLLDYVLGKGSYEGLDQDKVLDLTNMLKVDPTE FT LSSKDKAKARQLAHLLLDLANPVEAVNQLN" FT CDS join(225. .5405,5405. .9751) FT /db_xref="SPTREMBL:Q91DM2" FT /note="ORF1ab" FT /note="slippery sequence causes -1 frameshift" FT /product="replicase ORF1b polyprotein" FT /protein_id="CAC42775.2" FT /translation="MATFSATGFGGSFVRDWSLDLPDACEHGAGLCCEVDGSTLCAECF FT RGCEGMEQCPGLFMGLLKLASPVPVGHKFLIGWYRAAKVTGRYNFLELLQHPAFAQLRV FT VDARLAIEEASVFISTDHASAKRFPGARFALTPVYANAWVVSPAANSLIVTTDQEQDGF FT CWLKLLPPDRREAGLRLYYNHYREQRTGWLSKTGLRLWLGDLGLGINASSGGLKFHIMR FT GSPQRAWHITTRSCKLKSYYVCDISEADWSCLPAGNYGGYNPPGDGACGYRCLAFMNGA FT TVVSAGCSSDLWCDDELAYRVFQLSPTFTVTIPGGRVCPNAKYAMICDKQHWRVKRAKG FT VGLCLDESCFRGICNCQRMSGPPPAPVSAAVLDHILEAATFGNVRVVTPEGQPRPVPAP FT RVRPSANSSGDVKDPAPVPPVPKPRTKLATPNPTQAPIPAPRTRLQGASTQEPLASAGV FT ASDSAPKWRVAKTVYSSAERFRTELVQRARSVGDVLVQALPLKTPAVQRYTMTLKMMRS FT RFSWHCDVWYPLAVIACLLPIWPSLALLLSFAIGLIPSVGNNVVLTALLVSSANYVASM FT DHQCEGAACLALLEEEHYYRAVRWRPITGALSLVLNLLGQVGYVARSTFDAAYVPCTVF FT DLCSFAILYLCRNRCWRCFGRCVRVGPATHVLGSTGQRVSKLALIDLCDHFSKPTIDVV FT GMATGWSGCYTGTAAMERQCASTVDPHSFDQKKAGATVYLTPPVNSGSALQCLNVMWKR FT PIGSTVLGEQTGAVVTAVKSISFSPPCCVSTTLPTRPGVTVVDHALYNRLTASGVDPAL FT LRVGQGDFLKLNPGFRLIGGWIYGICYFVLVVVSTFTCLPIKCGIGTRDPFCRRVFSVP FT VTKTQEHCHAGMCASAEGISLDSLGLTQLQSYWIAAVTSGLVILLVCHRLAISALDLLT FT LASPLVLLVFPWASVGLLLACSLAGAAVKIQLLATLFVNLFFPQATLVTMGYWACVAAL FT AVYSLMGLRVKVNVPMCVTPAHFLLLARSAGQSREQMLRVSAAAPTNSLLGVARDCYVT FT GTTRLYIPKEGGMVFEGLFRSPKARGNVGFVAGSSYGTGSVWTRNNEVVVLTASHVVGR FT ANMATLKIGDAMLTLTFKKNGDFAEAVTTQSELPGNWPQLHFAQPTTGPASWCTATGDE FT EGLLSGEVCLAWTTSGDSGSAVVQGDAVVGVHTGSNTSGVAYVTTPSGKLLGADTVTLS FT SLSKHFTGPLTSIPKDIPDNIIADVDAVPRSLAMLIDGLSNRESSLSGPQLLLIACFMW FT SYLNQPAYLPYVLGFFAANFFLPKSVGRPVVTGLLWLCCLFTPLSMRLCLFHLVCATVT FT GNVISLWFYITAAGTSYLSEMWFGGYPTMLFVPRFLVYQFPGWAIGTVLAVCSITMLAA FT ALGHTLLLDVFSASGRFDRTFMMKYFLEGGVKESVTASVTRAYGKPITQESLTATLAAL FT TDDDFQFLSDVLDCRAVRSAMNLRAALTSFQVAQYRNILNASLQVDRDAARSRRLMAKL FT ADFAVEQEVTAGDRVVVIDGLDRMAHFKDDLVLVPLTTKVVGGSRCTICDVVKEEANDT FT PVKPMPSRRRRKGLPKGAQLEWDRHQEEKRNAGDDDFAVSNDYVKRVPKYWDPSDTRGT FT TVKIAGTTYQKVVDYSGNVHYVEHQEDLLDYVLGKGSYEGLDQDKVLDLTNMLKVDPTE FT LSSKDKAKARQLAHLLLDLANPVEAVNQLNLRAPHIFPGDVGRRTFADSKDKGFVALHS FT RTMFLAARDFLFNIKFVCDEEFTKTPKDTLLGYVRACPGYWFIFRRTHRSLIDAYWDSM FT ECVYALPTISDFDVSPGDVAVTGERWDFESPGGGRAKRLTADLVHAFQGFHGASYSYDD FT KVAAAVSGDPYRSDGVLYNTRWGNIPYSVPTNALEATACYRAGCEAVTDGTNVIATIGP FT FPEQQPIPDIPKSVLDNCADISCDAFIAPAAETALCGDLEKYNLSTQGFVLPSVFSMVR FT AYLKEEIGDAPPLYLPSTVPSKNSQAGINGAEFPTKSLQSYCLIDDMVSQSMKSNLQTA FT TMATCKRQYCSKYKIRSILGTNNYIGLGLRACLSGVTAAFQKAGKDGSPIYLGKSKFDP FT IPAPDKYCLETDLESCDRSTPALVRWFATNLIFELAGQPELVHSYVLNCCHDLVVAGSV FT AFTKRGGLSSGDPITSISNTIYSLVLYTQHMLLCGLEGYFPEIAEKYLDGSLELRDMFK FT YVRVYIYSDDVVLTTPNQHYAASFDRWVPHLQALLGFKVDPKKTVNTSSPSFLGCRFKQ FT VDGKCYLASLQDRVTRSLLYHIGAKNPSEYYEAAVSIFKDSIICCDEDWWTDLHRRISG FT AARTDGVEFPTIEMLTSFRTKQYESAVCTVCGAAPVAKSACGGWFCGNCVPYHAGHCHT FT TSLFANCGHDIMYRSTYCTMCEGSPKQMVPKVPHPILDHLLCHIDYGSKEELTLVVADG FT RTTSPPGRYKVGHKVVAVVADVGGNIVFGCGPGSHIAVPLQDTLKGVVVNKALKNAAAS FT EYVEGPPGSGKTFHLVKDVLAVVGSATLVVPTHASMLDCINKLKQAGADPYFVVPKYTV FT LDFPRPGSGNITVRLPQVGTSEGETFVDEVAYFSPVDLARILTQGRVKGYGDLNQLGCV FT GPASVPRNLWLRHFVSLEPLRVCHRFGAAVCDLIKGIYPYYEPAPHTTKVVFVPNPDFE FT KGVVITAYHKDRGLGHRTIDSIQGCTFPVVTLRLPTPQSLTRPRAVVAVTRASQELYIY FT DPFDQLSGLLKFTKEAEAQDLIHGPPTACHLGQEIDLWSNEGLEYYKEVNLLYTHVPIK FT DGVIHSYPNCGPACGWEKQSNKISCLPRVAQNLGYHYSPDLPGFCPIPKELAEHWPVVS FT NDRYPNCLQITLQQVCELSKPCSAGYMVGQSVFVQTPGVTSYWLTEWVDGKARALPDSL FT FSSGRFETNSRAFLDEAEEKFAAAHPHACLGEINKSTVGGSHFIFSQYLPPLLPADAVA FT LVGASLAGKAAKAACSVVDVYAPSFEPYLHPETLSRVYKIMIDFKPCRLMVWRNATFYV FT QEGVDAVTSALAAVSKLIKVPANEPVSFHVASGYRTNALVAPQAKISIGAYAAEWALST FT EPPPAGYAIVRRYIVKRLLSSTEVFLCRRGVVSSTSVQTICALEGCKPLFNFLQIGSVI FT GPV" FT misc_feature 5399. .5405 FT /note="ribosomal frameshift slippery sequence" FT CDS 9751. .9954 FT /db_xref="SPTREMBL:Q91DM1" FT /note="ORF2a" FT /product="envelope (E) protein" FT /protein_id="CAC42776.1" FT /translation="MGLVWSLISNSIQTIIADFAISVIDAALFFLMLLALAVVTVFLFW FT LIVAIGRSLVARCSRGARYRPV" FT CDS 9824. .10507 FT /db_xref="SWISS-PROT:P28992" FT /note="ORF2b" FT /product="glycoprotein 2b (GP2b)" FT /protein_id="CAA37541.1" FT /translation="MQRFSFSCYLHWLLLLCFFSGSLLPSAAAWWRGVHEVRVTDLFKD FT LQCDNLRAKDAFPSLGYALSIGQSRLSYMLQDWLLAAHRKEVMPSNIMPMPGLTPDCFD FT HLESSSYAPFINAYRQAILSQYPQELQLEAINCKLLAVVAPALYHNYHLANLTGPATWV FT VPTVGQLHYYASSSIFASSVEVLAAIILLFACIPLVTRVYISFTRLMSPSRRTSSGTLP FT RRKIL" FT CDS 10306. .10797 FT /db_xref="SWISS-PROT:P28993" FT /note="ORF3" FT /product="glycoprotein 3 (GP3)" FT /protein_id="CAA37542.1" FT /translation="MGRAYSGPVALLCFFLYFCFICGSVGSNNTTICMHTTSDTSVHLF FT YAANVTFPSHFQRHFAAAQDFVVHTGYEYAGVTMLVHLFANLVLTFPSLVNCSRPVNVF FT ANASCVQVVCSHTNSTTGLGQLSFSFVDEDLRLHIRPTLICWFALLLVHFLPMPRCRGS FT " FT CDS 10700. .11158 FT /db_xref="SWISS-PROT:P28994" FT /note="ORF4" FT /product="glycoprotein 4 (GP4)" FT /protein_id="CAA37543.1" FT /translation="MKIYGCISGLLLFVGLPCCWCTFYPCHAAEARNFTYISHGLGHVH FT GHEGCRNFINVTHSAFLYLNPTTPTAPAITHCLLLVLAAKMEHPNATIWLQLQPFGYHV FT AGDVIVNLEEDKRHPYFKLLRAPALPLGFVAIVYVLLRLVRWAQRCYL" FT CDS 11146. .11913 FT /db_xref="SWISS-PROT:P28995" FT /note="ORF5" FT /product="glycoprotein 5 (GP5)" FT /protein_id="CAA37544.1" FT /translation="MLSMIVLLFLLWGAPSHAYFSYYTAQRFTDFTLCMLTDRGVIANL FT LRYDEHTALYNCSASKTCWYCTFLDEQIITFGTDCDDTYAVPVAEVLEQAHGPYSALFD FT DMPPFIYYGREFGIVVLDVFMFYPVLVLFFLSVLPYATLILEMCVSILFIIYGIYSGAY FT LAMGIFAATLAIHSIVVLRQLLWLCLAWRYRCTLHASFISAEGKVYPVDPGLPVAAVGN FT RLLVPGRPTIDYAVAYGSKVNLVRLGAAEVWEP" FT CDS 11901. .12389 FT /db_xref="SWISS-PROT:P28991" FT /note="ORF6" FT /product="hypothetical protein" FT /protein_id="CAA37545.1" FT /translation="MGAIDSFCGDGILGEYLDYFILSVPLLLLLTRYVASGLVYVLTAL FT FYSFVLAAYIWFVIVGRAFSTAYAFVLLAAFLLLVMRMIVGMMPRLRSIFNHRQLVVAD FT FVDTPSGPVPIPRSTTQVVVRGNGYTAVGNKLVDGVKTITSAGRLFSKRTAATAYKLQ" FT CDS 12313. .12645 FT /db_xref="SWISS-PROT:P19810" FT /note="ORF7" FT /product="hypothetical protein" FT /protein_id="CAA37546.1" FT /translation="MASRRSRPQAASFRNGRRRQPTSYNDLLRMFGQMRVRKPPAQPTQ FT AIIAEPGDLRHDLNQQERATLSSNVQRFFMIGHGSLTADAGGLTYTVSWVPTKQIQRKV FT APPAGP" FT polyA_site 12704 XX SQ Sequence 12704 BP; 2692 A; 3258 C; 3305 G; 3449 T; 0 other; X53459 Length: 12704 May 27, 2002 15:30 Type: N Check: 8573 .. 1 gctcgaagtg tgtatggtgc catatacggc tcaccaccat atacactgca 51 agaattacta ttcttgtggg cccctctcgg taaatcctag agggctttcc 101 tctcgttatt gcgagattcg tcgttagata acggcaagtt ccctttctta 151 ctatcctatt ttcatcttgt ggcttgacgg gtcactgcca tcgtcgtcga 201 tctctatcaa ctacccttgc gactatggca accttctccg ctactggatt 251 tggagggagt tttgttaggg actggtccct ggacttaccc gacgcttgtg 301 agcatggcgc gggattgtgc tgcgaagtgg acggctccac cttatgcgcc 351 gagtgttttc gcggttgcga aggaatggag caatgtcctg gcttgttcat 401 gggactgtta aaactggctt cgccagttcc agtgggacat aagttcctga 451 ttggttggta tcgagctgcc aaagtcaccg ggcgttacaa tttccttgag 501 ctgttgcaac accctgcttt cgcccagctg cgtgtggttg atgctaggtt 551 agccattgaa gaggcaagtg tgtttatttc cactgaccac gcgtctgcta 601 agcgtttccc tggcgctaga tttgcgctga caccggtgta tgctaacgct 651 tgggttgtga gcccggctgc taacagtttg atagtgacca ctgaccagga 701 acaagatggg ttctgctggt taaaactttt gccacctgac cgccgtgagg 751 ctggtttgcg gttgtattac aaccattacc gcgaacaaag gaccgggtgg 801 ctgtctaaaa caggacttcg cttatggctt ggagacctgg gtttgggcat 851 caatgcgagc tctggagggc tgaaattcca cattatgagg ggttcgcctc 901 agcgagcttg gcatatcaca acacgcagct gcaagctgaa gagctactac 951 gtttgtgaca tctctgaagc agactggtcc tgtttgcctg ctggcaacta 1001 cggcggctac aatccaccag gggacggagc ttgcggttac aggtgcttgg 1051 ccttcatgaa tggcgccact gttgtgtcgg ctggttgcag ttctgacttg 1101 tggtgtgatg atgagttggc ttatcgagtc tttcaattgt cacccacgtt 1151 cacggttacc atcccaggtg ggcgagtttg tccgaatgcc aagtacgcaa 1201 tgatttgtga caagcagcac tggcgcgtca aacgtgcaaa gggcgtcggc 1251 ctgtgtctcg atgaaagctg tttcaggggc atctgcaatt gccaacgcat 1301 gagtggacca ccacctgcac ccgtgtcagc cgccgtgtta gatcacatac 1351 tggaggcggc gacgtttggc aacgttcgcg tggttacacc tgaagggcag 1401 ccacgccccg taccagcgcc gcgagttcgt cccagcgcca actcttctgg 1451 agatgtcaaa gatccggcgc ccgttccgcc agtaccaaaa ccaaggacca 1501 agcttgccac accgaaccca actcaggcgc ccatcccagc accgcgcacg 1551 cgacttcaag gggcctcaac acaggagcca ctggcgagtg caggagttgc 1601 ttctgactcg gcacctaaat ggcgtgtggc caaaactgtg tacagctccg 1651 cggagcgctt tcggaccgaa ctggtacaac gtgctcggtc cgttggggac 1701 gttcttgttc aagcgctacc gctcaaaacc ccagcagtgc agcggtatac 1751 catgactctg aagatgatgc gttcacgctt cagttggcac tgcgacgtgt 1801 ggtacccttt ggctgtaatc gcttgtttgc tccctatatg gccatctctt 1851 gctttgctcc ttagctttgc cattgggttg atacccagtg tgggcaataa 1901 tgttgttctg acagcgcttc tggtttcatc agctaattat gttgcgtcaa 1951 tggaccatca atgtgaaggt gcggcttgct tagccttgct ggaagaagaa 2001 cactattata gagcggtccg ttggcgcccg attacaggcg cgctgtcgct 2051 tgtgctcaat ttactggggc aggtaggcta tgtagctcgt tccacctttg 2101 atgcagctta tgttccttgc actgtgttcg atctttgcag ctttgctatt 2151 ctgtacctct gccgcaatcg ttgctggaga tgcttcggac gctgtgtgcg 2201 agttgggcct gccacgcatg ttttgggctc caccgggcaa cgagtttcca 2251 aactggcgct cattgatttg tgtgaccact tttcaaagcc caccatcgat 2301 gttgtgggca tggcaactgg ttggagcgga tgttacacag gaaccgccgc 2351 aatggagcgt cagtgtgcct ctacggtgga ccctcactcg ttcgaccaga 2401 agaaggcagg agcgactgtt tacctcaccc cccctgtcaa cagcgggtca 2451 gcgctgcagt gcctcaatgt catgtggaag cgaccaattg ggtccactgt 2501 ccttggggaa caaacaggag ctgttgtgac ggcggtcaag agtatctctt 2551 tctcacctcc ctgctgcgtc tctaccactt tgcccacccg acccggtgtg 2601 accgttgtcg accatgctct ttacaaccgg ttgactgctt caggggtcga 2651 tcccgcttta ttgcgtgttg ggcaaggtga ttttctaaaa cttaatccgg 2701 ggttccggct gataggtgga tggatttatg ggatatgcta ttttgtgttg 2751 gtggttgtgt caacttttac ctgcttacct atcaaatgtg gcattggcac 2801 ccgcgaccct ttctgccgca gagtgttttc tgtacccgtc accaagaccc 2851 aagagcactg ccatgctgga atgtgtgcta gcgctgaagg catctctctg 2901 gactctctgg ggttaactca gttacaaagt tactggatcg cagccgtcac 2951 tagcggatta gtgatcttgt tggtctgcca ccgcctggcc atcagcgcct 3001 tggacttgtt gactctagct tcccctttag tgttgcttgt gttcccttgg 3051 gcatctgtgg ggcttttact tgcttgcagt ctcgctggtg ctgctgtgaa 3101 aatacagttg ttggcgacgc tttttgtgaa tctgttcttt ccccaagcta 3151 cccttgtcac tatgggatac tgggcgtgcg tggcggcttt ggccgtttac 3201 agtttgatgg gcttgcgagt gaaagtgaat gtgcccatgt gtgtgacacc 3251 tgcccatttt ctgctgctgg cgaggtcagc tggacagtca agagagcaga 3301 tgctccgggt cagcgctgct gcccccacca attcactgct tggagtggct 3351 cgtgattgtt atgtcacagg cacaactcgg ctgtacatac ccaaggaagg 3401 cgggatggtg tttgaagggc tattcaggtc accgaaggcg cgcggcaacg 3451 tcggcttcgt ggctggtagc agctacggca cagggtcagt gtggaccagg 3501 aacaacgagg tcgtcgtact gacagcgtca cacgtggttg gccgcgctaa 3551 catggccact ctgaagatcg gtgacgcaat gctgactctg actttcaaaa 3601 agaatggcga cttcgccgag gcagtgacga cacagtccga gctcccaggc 3651 aattggccac agttgcattt cgcccaacca acaaccgggc ccgcttcatg 3701 gtgcactgcc acaggagatg aagaaggctt gctcagtggc gaggtttgtc 3751 tggcgtggac tactagtggc gactctggat ctgcagtggt tcagggtgac 3801 gctgtggtag gggtccacac cggttcgaac acaagtggtg ttgcctacgt 3851 gaccacccca agcggaaaac tccttggcgc cgacaccgtg actttgtcat 3901 cactgtcaaa gcatttcaca ggccctttga catcaatccc gaaggacatc 3951 cctgacaaca ttattgccga tgttgatgct gttcctcgtt ctctggccat 4001 gctgattgat ggcttatcca atagagagag cagcctttct ggacctcagt 4051 tgttgttaat tgcttgtttt atgtggtctt atcttaacca acctgcttac 4101 ttgccttatg tgctgggctt ctttgccgct aacttcttcc tgccaaaaag 4151 tgttggccgc cctgtggtca ctgggcttct atggttgtgc tgcctcttca 4201 caccgctttc catgcgcttg tgcttgttcc atctggtctg tgctaccgtc 4251 acgggaaacg tgatatcttt gtggttctac atcactgccg ctggcacgtc 4301 ttacctttct gagatgtggt tcggaggcta tcccaccatg ttgtttgtgc 4351 cacggttcct agtgtaccag ttccccggct gggctattgg cacagtacta 4401 gcggtatgca gcatcaccat gctggctgct gccctcggtc acaccctgtt 4451 actggatgtg ttctccgcct caggtcgctt tgacaggact ttcatgatga 4501 aatacttcct ggagggagga gtgaaagaga gtgtcaccgc ctcagtcacc 4551 cgcgcttatg gcaaaccaat tacccaggag agtctcactg caacattagc 4601 tgccctcact gatgatgact tccaattcct ctctgatgtg cttgactgtc 4651 gggccgtccg atcggcaatg aatctgcgtg ccgctctcac aagttttcaa 4701 gtggcgcagt atcgtaacat ccttaatgca tccttgcaag tcgatcgtga 4751 cgctgctcgt agtcgcagac taatggcaaa actggctgat tttgcggttg 4801 aacaagaagt aacagctgga gaccgtgttg tggttatcga cggtctggac 4851 cgcatggctc acttcaaaga cgatttggtg ctggttcctt tgaccaccaa 4901 agtagtaggc ggttctaggt gcaccatttg tgacgtcgtt aaggaagaag 4951 ccaatgacac cccagttaag ccaatgccca gcaggagacg ccgcaagggc 5001 ctgcctaaag gtgctcagtt ggagtgggac cgtcaccagg aagagaagag 5051 gaacgccggt gatgatgatt ttgcggtctc gaatgattat gtcaagagag 5101 tgccaaagta ctgggatccc agcgacaccc gaggcacgac agtgaaaatc 5151 gccggcacta cctatcagaa agtggttgac tattcaggca atgtgcatta 5201 cgtggagcat caggaagatc tgctagacta cgtgctgggc aaggggagct 5251 atgaaggcct agatcaggac aaagtgttgg acctcacaaa catgcttaaa 5301 gtggacccca cggagctctc ctccaaagac aaagccaagg cgcgtcagct 5351 tgctcatctg ctgttggatc tggctaaccc agttgaggca gtgaatcagt 5401 taaactgaga gcgccccaca tctttcccgg cgatgtgggg cgtcggacct 5451 ttgctgactc taaagacaag ggtttcgtgg ctctacacag tcgcacaatg 5501 tttttagctg cccgggactt tttatttaac atcaaatttg tgtgcgacga 5551 agagttcaca aagaccccaa aagacacact gcttgggtac gtacgcgcct 5601 gccctggtta ctggtttatt ttccgtcgta cgcaccggtc gctgattgat 5651 gcatactggg acagtatgga gtgcgtttac gcgcttccca ccatatctga 5701 ttttgatgtg agcccaggtg acgtcgcagt gacgggcgag cgatgggatt 5751 ttgaatctcc cggaggaggc cgtgcaaaac gtctcacagc tgatctggtg 5801 cacgcttttc aagggttcca cggagcctct tattcctatg atgacaaggt 5851 ggcagctgct gtcagtggtg acccgtatcg gtcggacggc gtcttgtata 5901 acacccgttg gggcaacatt ccatattctg tcccaaccaa tgctttggaa 5951 gccacagctt gctaccgtgc tggatgtgag gccgttaccg acgggaccaa 6001 cgtcatcgca acaattgggc ccttcccgga gcaacaaccc ataccggaca 6051 tcccaaagag cgtgcttgac aactgcgctg acatcagctg tgacgctttc 6101 atagcgcccg ctgcagagac agccctgtgt ggagatttag agaaatacaa 6151 cctatccacg cagggttttg tgttgcctag tgttttctcc atggtgcggg 6201 cgtacttaaa agaggagatt ggagacgctc caccactcta cttgccatct 6251 actgtaccat ctaaaaattc acaagccgga attaacggcg ctgagtttcc 6301 tacaaagtct ttacagagct actgtttgat tgatgacatg gtgtcacagt 6351 ccatgaaaag caatctacaa accgccacca tggcgacttg taaacggcaa 6401 tactgttcca aatacaagat taggagcatt ctgggcacca acaattacat 6451 tggcctaggt ttgcgtgcct gcctttcggg ggttacggcc gcattccaaa 6501 aagctggaaa ggatgggtca ccgatttatt tgggcaagtc aaaattcgac 6551 ccgataccag ctcctgacaa gtactgcctt gaaacagacc tggagagttg 6601 tgatcgctcc accccggctt tggtgcgttg gttcgctact aatcttattt 6651 ttgagctagc tggccagccc gagttggtgc acagctacgt gttgaattgc 6701 tgtcacgatc tagttgtggc gggtagtgta gcattcacca aacgcggggg 6751 tttgtcatct ggagacccta tcacttccat ttccaatacc atctattcat 6801 tggtgctgta cacccagcac atgttgctat gtggacttga aggctatttc 6851 ccagagattg cagaaaaata tcttgatggc agcctggagc tgcgggacat 6901 gttcaagtac gttcgagtgt acatctactc ggacgatgtg gttctaacca 6951 cacccaacca gcattacgcg gccagctttg accgctgggt cccccacctg 7001 caggcgctgc taggtttcaa ggttgaccca aagaaaactg tgaacaccag 7051 ctccccttcc tttttgggct gccggttcaa gcaagtggac ggcaagtgtt 7101 atctagccag tcttcaggac cgcgttacac gctctctgtt ataccacatt 7151 ggtgcaaaga atccctcaga gtactatgaa gctgctgttt ccatctttaa 7201 ggactccatt atctgctgtg atgaagactg gtggacggac ctccatcgac 7251 gtatcagtgg cgctgcgcgt accgacggag ttgagttccc caccattgaa 7301 atgttaacat ccttccgcac caagcagtat gagagtgccg tgtgcacagt 7351 ttgtggggcc gcccccgtgg ccaagtctgc ttgtggaggg tggttctgtg 7401 gcaattgtgt cccgtaccac gcgggtcatt gtcacacaac ctcgctcttc 7451 gccaactgcg ggcacgacat catgtaccgc tccacttact gcacaatgtg 7501 tgagggttcc ccaaaacaga tggtaccaaa agtgcctcac ccgatcctgg 7551 atcatttgct gtgccacatt gattacggca gtaaagagga actaactctg 7601 gtagtggcgg atggtcgaac aacatcaccg cccgggcgct acaaagtggg 7651 tcacaaggta gtcgccgtgg ttgcagatgt gggaggcaac attgtgtttg 7701 ggtgcggtcc tggatcacac atcgcagtac cacttcagga tacgctcaag 7751 ggcgtggtgg tgaataaagc tctgaagaac gccgccgcct ctgagtacgt 7801 ggaaggaccc cctgggagtg ggaagacttt tcacctggtc aaagatgtgc 7851 tagccgtggt cggtagcgcg accttggttg tgcccaccca cgcgtccatg 7901 ctggactgca tcaacaagct caaacaagcg ggcgccgatc catactttgt 7951 ggtgcccaag tatacagttc ttgactttcc ccggcctggc agtggaaaca 8001 tcacagtgcg actgccacag gtcggaacca gtgagggaga aacctttgtg 8051 gatgaggtgg cctacttctc accagtggat ctggcgcgca ttttaaccca 8101 gggtcgagtc aagggttacg gtgatttaaa tcagctcggg tgcgtcggac 8151 ccgcgagcgt gccacgtaac ctttggctcc gacattttgt cagcctggag 8201 cccttgcgag tgtgccatcg attcggcgct gctgtgtgtg atttgatcaa 8251 gggcatttat ccttattatg agccagctcc acataccact aaagtggtgt 8301 ttgtgccaaa tccagacttt gagaaaggtg tagtcatcac cgcctaccac 8351 aaagatcgcg gtcttggtca ccgcacaatt gattcaattc aaggctgtac 8401 attccctgtt gtgactcttc gactgcccac accccaatca ctgacgcgcc 8451 cgcgcgcagt tgtggcggtt actagggcgt ctcaggaatt atacatctac 8501 gacccctttg atcagcttag cgggttgttg aagttcacca aggaagcaga 8551 ggcgcaggac ttgatccatg gcccacctac agcatgccac ctgggccaag 8601 aaattgacct ttggtccaat gagggcctcg aatattacaa ggaagtcaac 8651 ctgctgtaca cacacgtccc catcaaggat ggtgtaatac acagttaccc 8701 taattgtggc cctgcctgtg gctgggaaaa gcaatccaac aaaatttcgt 8751 gcctcccgag agtggcacaa aatttgggct accactattc cccagactta 8801 ccaggatttt gccccatacc aaaagaactc gctgagcatt ggcccgtagt 8851 gtccaatgat agatacccga attgcttgca aattacctta cagcaagtat 8901 gtgaactcag taaaccgtgc tcagcgggct atatggttgg acaatctgtt 8951 ttcgtgcaga cgcctggtgt gacatcttac tggcttactg aatgggtcga 9001 cggcaaagcg cgtgctctac cagattcctt attctcgtcc ggtaggttcg 9051 agactaacag ccgcgctttc ctcgatgaag ccgaggaaaa gtttgccgcc 9101 gctcaccctc atgcctgttt gggagaaatt aataagtcca ccgtgggagg 9151 atcccacttc atcttttccc aatatttacc accattgcta cccgcagacg 9201 ctgttgccct ggtaggtgct tcattggctg ggaaagctgc taaagctgct 9251 tgcagcgttg ttgatgtcta tgctccatca tttgaacctt atctacaccc 9301 tgagacactg agtcgcgtgt acaagattat gatcgatttc aagccgtgta 9351 ggcttatggt gtggagaaac gcgacctttt atgtccaaga gggtgttgat 9401 gcagttacat cagcactagc agctgtgtcc aaactcatca aagtgccggc 9451 caatgagcct gtttcattcc atgtggcatc agggtacaga accaacgcgc 9501 tggtagcgcc ccaggctaaa atttcaattg gagcctacgc cgccgagtgg 9551 gcactgtcaa ctgaaccgcc acctgctggt tatgcgatcg tgcggcgata 9601 tattgtaaag aggctcctca gctcaacaga agtgttcttg tgccgcaggg 9651 gtgttgtgtc ttccacctca gtgcagacca tttgtgcact agagggatgt 9701 aaacctctgt tcaacttctt acaaattggt tcagtcattg ggcccgtgtg 9751 atgggcttag tgtggtcact gatttcaaat tctattcaga ctattattgc 9801 tgattttgct atttctgtga ttgatgcagc gcttttcttt ctcatgctac 9851 ttgcattggc tgttgttact gtgtttcttt tctggctcat tgttgccatc 9901 ggccgcagct tggtggcgcg gtgttcacga ggtgcgcgtt acagacctgt 9951 ttaaggattt gcagtgcgac aacctgcgcg cgaaagatgc cttcccgagt 10001 ctgggatatg ctctgtcgat tggccagtcg aggctatcgt atatgctgca 10051 ggattggttg cttgctgcgc accgcaagga agttatgcct tccaatatca 10101 tgcctatgcc cggtcttact cctgattgct ttgaccatct ggagtcttct 10151 agctatgctc catttatcaa tgcctatcgg caggcaattt tgagtcaata 10201 cccacaagag ctccagctcg aagccatcaa ctgtaaattg cttgctgtgg 10251 ttgcaccggc attgtatcat aattaccatc tagccaattt gaccggaccg 10301 gccacatggg tcgtgcctac agtgggccag ttgcactatt atgcttcttc 10351 ctctattttt gcttcatctg tggaagtgtt ggcagcaata atactactat 10401 ttgcatgcat accactagtg acacgagtgt acatctcttt tacgcggcta 10451 atgtcacctt cccgtcgcac ttccagcggc actttgccgc ggcgcaagat 10501 tttgtagtgc acacgggtta tgaatatgcc ggggtcacta tgttagtgca 10551 cttgtttgcc aacttggttc tgacatttcc gagcttagtt aattgttccc 10601 gccctgtgaa tgtctttgct aatgcttctt gcgtgcaagt ggtttgtagt 10651 cataccaact caactactgg cttgggtcaa ctttcttttt cctttgtaga 10701 tgaagatcta cggctgcata tcaggcctac tcttatttgt tggtttgcct 10751 tgttgttggt gcactttcta cccatgccac gctgcagagg ctcgtaattt 10801 tacttacatt agtcatggat tgggccacgt gcacggtcat gaggggtgta 10851 ggaattttat taatgtcact cattctgcat ttctttatct taatcccacc 10901 actcccactg cgccggctat aactcattgt ttacttctgg ttctggcagc 10951 caaaatggaa cacccaaacg ctactatctg gctgcagctg cagccgtttg 11001 ggtatcatgt ggctggcgat gtcattgtca acttggaaga ggacaagagg 11051 catccttact ttaaactttt gagagcgccg gctttaccgc ttggttttgt 11101 ggctatagtt tatgttcttt tacgactggt acgttgggct caacgatgtt 11151 atctatgatt gtattgctat tcttgctttg gggtgcgcca tcacatgctt 11201 acttctcata ctacaccgct cagcgcttca cagacttcac cttgtgtatg 11251 ctgacggatc gcggcgttat tgccaatttg ctgcgatatg atgagcacac 11301 tgctttgtac aattgttccg ccagtaaaac ctgttggtat tgcacattcc 11351 tggacgaaca gattatcacg tttggaaccg attgtgatga cacctacgcg 11401 gtcccagttg ctgaggtcct ggaacaggcg catggaccgt acagtgcgct 11451 gtttgatgac atgccccctt ttatttacta tggccgtgaa ttcggcatag 11501 ttgtgttgga tgtgtttatg ttctatcccg ttttagttct gtttttctta 11551 tcagtactac cctatgctac gcttattctt gaaatgtgtg tatctattct 11601 gtttataatc tatggcattt acagcggggc ctacttggcc atgggcatat 11651 ttgcggccac gcttgctata cattcaattg tggtcctccg ccaattactg 11701 tggttatgcc tggcttggcg ataccgctgt acgcttcacg cgtcctttat 11751 atcagctgag gggaaagtgt accccgtaga ccccggactc ccggttgccg 11801 ccgtgggcaa tcggttgtta gtcccaggta ggcccactat cgattatgca 11851 gtggcctacg gcagcaaagt caaccttgtg aggttggggg cagctgaggt 11901 atgggagcca tagattcatt ttgtggtgac gggattttag gtgagtatct 11951 agattacttt attctgtccg tcccactctt gctgttgctt actaggtatg 12001 tagcatctgg gttagtgtat gttttgactg ccttgttcta ttcctttgta 12051 ttagcagctt atatttggtt tgttatagtt ggaagagcct tttctactgc 12101 ttatgctttt gtgcttttgg ctgcttttct gttattagta atgaggatga 12151 ttgtgggtat gatgcctcgt cttcggtcca ttttcaacca tcgccaactg 12201 gtggtagctg attttgtgga cacacctagt ggacctgttc ccatcccccg 12251 ctcaactact caggtagtgg ttcgcggcaa cgggtacacc gcagttggta 12301 acaagcttgt cgatggcgtc aagacgatca cgtccgcagg ccgcctcttt 12351 tcgaaacgga cggcggcgac agcctacaag ctacaatgac ctactgcgca 12401 tgtttggtca gatgcgggtc cgcaaaccgc ccgcgcaacc cactcaggct 12451 attattgcag agcctggaga ccttaggcat gatttaaatc aacaggagcg 12501 cgccaccctt tcgtcgaacg tacaacggtt cttcatgatt gggcatggtt 12551 cactcactgc agatgccgga ggactcacgt acaccgtcag ttgggttcct 12601 accaaacaaa tccagcgcaa agttgcgcct ccagcagggc cgtaagacgt 12651 ggatattctc ctgtgtggcg tcatgttgaa gtagttatta gccacccagg 12701 aacc