Sequence of DPV Alphamesonivirus 1

Nam Dinh virus isolate 02VN178, complete genome.

ACC No: DQ458789

Dated: 2012-05-04 | Length: 20192 | CRC: 1601306517

                
ID   DQ458789; SV 2; linear; genomic RNA; STD; VRL; 20192 BP.
XX
AC   DQ458789;
XX
DT   02-APR-2007 (Rel. 91, Created)
DT   04-MAY-2012 (Rel. 112, Last updated, Version 4)
XX
DE   Nam Dinh virus isolate 02VN178, complete genome.
XX
KW   .
XX
OS   Nam Dinh virus
OC   Viruses; ssRNA positive-strand viruses, no DNA stage; Nidovirales;
OC   unclassified Nidovirales.
XX
RN   [1]
RC   Publication Status: Online-Only
RP   1-20192
RX   PUBMED; 21931546.
RA   Nga P.T., Parquet Mdel C., Lauber C., Parida M., Nabeshima T., Yu F.,
RA   Thuy N.T., Inoue S., Ito T., Okamoto K., Ichinose A., Snijder E.J.,
RA   Morita K., Gorbalenya A.E.;
RT   "Discovery of the first insect nidovirus, a missing evolutionary link in
RT   the emergence of the largest RNA virus genomes";
RL   PLoS Pathog. 7(9):E1002215-E1002215(2011).
XX
RN   [2]
RP   1-20192
RA   Nga P.T., Parida M., Parquet M.D.C., Thuy N.T., Suu P.T., Khan A.H.,
RA   Salda L.T.D., Yu F., Inoue S., Ito T., Morita K.;
RT   "Identification of a novel mosquito virus in Viet Nam related to the
RT   members of the order Nidovirales";
RL   Unpublished.
XX
RN   [3]
RP   1-20192
RA   Nga P.T., Parida M., Parquet M.D.C., Thuy N.T., Suu P.T., Khan A.H.,
RA   Salda L.T.D., Yu F., Inoue S., Ito T., Morita K.;
RT   ;
RL   Submitted (22-MAR-2006) to the INSDC.
RL   Virology, Institute of Tropical Medicine, Sakamoto 1-12-4, Nagasaki
RL   852-8013, Japan
XX
RN   [4]
RC   Sequence update by submitter
RP   1-20192
RA   Nga P.T., Parquet M.D.C., Lauber C., Parida M., Nabeshima T., Yu F.,
RA   Thuy N.T., Inoue S., Ito T., Okamoto K., Ichinose A., Snijder E.J.,
RA   Morita K., Gorbalenya A.E.;
RT   ;
RL   Submitted (07-JUL-2011) to the INSDC.
RL   Virology, Institute of Tropical Medicine, Sakamoto 1-12-4, Nagasaki
RL   852-8013, Japan
XX
CC   On Jul 26, 2011 this sequence version replaced gi:108744356.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .20192
FT                   /organism="Nam Dinh virus"
FT                   /host="mosquito"
FT                   /isolate="02VN178"
FT                   /mol_type="genomic RNA"
FT                   /country="Viet Nam"
FT                   /collection_date="2002"
FT                   /db_xref="taxon:325676"
FT   gene            361. .15638
FT                   /gene="ORF1ab"
FT   CDS             join(361. .7851,7851. .15638)
FT                   /codon_start=1
FT                   /ribosomal_slippage
FT                   /gene="ORF1ab"
FT                   /product="pp1ab polyprotein"
FT                   /note="non-structural polyprotein; ORF1b expressed via -1
FT                   ribosomal frameshifting; contains two transmembrane domains
FT                   that flank a 3C-like proteinase (ORF1a-encoded), as well as
FT                   RNA-dependent RNA polymerase, Zm-Hel1 helicase, 3'-5'
FT                   exoribonuclease, N7-methyltransferase and
FT                   2'-O-methyltransferase domains (ORF1b-encoded); cleavage
FT                   sites not determined"
FT                   /db_xref="GOA:G1K4K6"
FT                   /db_xref="InterPro:IPR003593"
FT                   /db_xref="UniProtKB/TrEMBL:G1K4K6"
FT                   /protein_id="ABG02430.2"
FT                   /translation="MTYHDYALKDNVVLKRDQKLALDNFVTEVIQFWTPILTTLLLLAY
FT                   ALRKIMHNPFVGPISDNPLKRALQWIIFVFTRRNLYYQTPVFARDESRLNIFLHNDFAR
FT                   LDRNTLNGYCKLCNLYGHNHTDKHNPTIDALVLAKTCKLLRYNDKVTKPLAYTVHNIRA
FT                   YEKNTKTFADTFGTTTTNIPTKYALAPKKAVSELTTIESNLGPIYVNNTIAYPHLGFIA
FT                   YDNKQHLQELLANVTVVLDTIMVYTQYELDDATMNIRKSDITLSFVNDFDLTNALTNEL
FT                   KDPRTPWLLKAKKTSNKASEQDDDTEAEDTQKNKRKGKQLKPQTQLLQHTLAKQTKFAR
FT                   RQPFLSFGPTYMTLLCLISIMSPTYATVCTTYEPLDQADLYCNNLQNLTIEKYHAYANY
FT                   EQLNRQCFSIDGAEFKDLIRLSVSNALNLNNVIKPVPRDDYILKAFSNALPLNTHVLSD
FT                   YNTILDLQILMQFYNLNGSNVLYTETYSESEDYAGKVVQLLAQGTGGICKAPACILFTG
FT                   LATTVTDVEVKVTERLTKRIKHQEHGKPLHINPSCRKTCYCMYKPKVKPEPVETVKYAP
FT                   QAEFYTQLRYFQNHELQMYDDFEMGVLRYNNYTLNTFIYSNETCILTRGVHCVYNPEHF
FT                   AITRVYNNLGNYLECGVNQEFCESLQQEFMFNEPQLVITESMAVEAPTQYHKICDNHYT
FT                   SLQVKYPLIEKLFWSNFXVSVNRALAVKEPATFIIVHDTVAIIKTVIADIIEIMEKCYD
FT                   STAIKLTHHDFNKLEYMDDYSQILVKYKPLLEQHKIMLIEDIDLITVPAARALFSIFDT
FT                   YTPLVNGVFVLGTLNHKRYNETLAFNYYNDQTPTFYVDGILNANWQLLEDHTRQPLITR
FT                   VADNVHTILARQLVTPPAKIAATMPKVPTLNNTSKVINTLTHYSAHLLQVVGSDSTQAY
FT                   TYINSSVHNITDYVNDSVHNITDYVYTVYNSTKNHIVTRYNNMLMAAYDIQLNFYNQYP
FT                   LRQDFYHKGIRAFDLGQVCDFLHHTDTIVLYQDCINQKLDTIYVIKLRYGQNANGYHMY
FT                   PLKQPHTKQTIYELSDAIGFVYKDSKYNYFRTLFTNPGEYVLTIRENYLEYCKSDFSPT
FT                   PAFAPDATLQCYAYITGIQVIDNFIAEFGLFLMLYTAALIIILALAITIRDSTMMMFLK
FT                   LSIIFAYTFGPLLLTPKVFGSYIFVSLYNMLPYTSNTSYGCLLMMGALAITVIDLFAYM
FT                   TQRYRSEFTKNILQLATLLFEIVAITKYILIPYVCTSYGLVLTIIVSYVAYRYIQSQRP
FT                   NYLKATVSNATAHADWVAYRNTTREKTDEAAKSNLSKIINTSIADINKDQFLECVYLAA
FT                   CHRATVAASTYNPKHYLHIPNYNTKIMFARDNELMNYSVLSTDLKNKSAASNPSISHIV
FT                   LELPVAINPLIKYTTKTSVSSLRGAVVNGYIYIQRHLFGSKKQEFEACYNNGKGLLNCK
FT                   NLDRSKYDIDSAELIGTLIRIPLHDKQSIPHISLHPDPLSYNGPVTLYLSRYDTELNKD
FT                   VLCVHTGFMSEGHHDIKTVFGDCGGMLFDPKGRLLGLHCAGSDDVVFMDTTTGKSNIWT
FT                   SYKLQHPSEIMITLNNEINLPNPTNYDFETTKVVYQHPLRNVCATLETLQHLTNKTNVK
FT                   LPYDPRLLSDFNITAEQYAQYGYNIDYNNFINNFNRYTTTTIGTKSFETCIKYGLMDNK
FT                   KVEYYNQTATIFNPPEHSSSGFDNTMDVLYVFVYMFTHTHPAFYIAAACVFCLFFVKMN
FT                   KYLKMILSSIIFTIPHIYVNYYYGLVYMPLKWRKQITALAIRYNPYTAVALRYNKNLNI
FT                   AKDVAKELGTPKNLCTHLSTLLKCIKPYAAFNDLSQVINNVDDLMANWANTYNAEELLK
FT                   QYIDEIYKLYPILFVVFEKIENYEDQIKTILSYISDTGEFDLNGFEIHFDEKEHTTNII
FT                   DTNVEDIHDKLMAEKASLIALKNMNLEFDIETINNANIGELVRYLIISSTPDTLDRDLL
FT                   SRTTELLVRRIHQLRDDSEHNENLITLLSEIYKHKDFLTASHLTSNLRDRNYIMNNLVR
FT                   VIALFNKQINMQVAQKQYEARRIEELRKKESKQIMEQNNRIRKMQRQNQNIASAIVHMV
FT                   HACFANRFMLQNEAQKIMKALLGSDLELDPTDAEMQYYTAYRNGQVLTNQAIVTNFTTL
FT                   TTILWTGNGYQTVPSMCGAQEFTCTATHKHGYFNCTMEIKDAWYKHAEECTKCKSYYRT
FT                   NKHPRCGAIYDTTVKRYPTLSNFIARYRSCPACMPCTQCLSRREPGCESASYHIADTAH
FT                   YQNQAYLTPINIKPDNLEYNFVDINNGDVNAIYNGRIWLMRRTTAITPPPARYRNITNL
FT                   KLKQTDPEGYYYISEVCPTDLAILNAMINQIQLKLLDRTVLNNENHVENANTIQFNNPL
FT                   NDTTLDDLRTKHKHLLVMKLRPDSEHHFIEVLNFVRMNNLPIFIAHVTYADNTVNHATI
FT                   YINYLQAWRNEILDDVTTTCDILEKIIKHPLDFSRWARTLSRNSNVARYHQLCTTTDAG
FT                   IRHTIDISCNKTSISYIDEVNNNVNVKIKSHIVKEHKIYEMLINQYPNLFLIEHKLVND
FT                   IIPHLLRYNMTALSFADLYGLIKEENWHPIYDTLPQVTYHKINDDLLLKIKSHTPSPQH
FT                   TCCMLCRRFLVEFGLLLHKLNYKVFETTRAMLTHYDFVLTADNVDLNGILDFEDYKLRK
FT                   CTVAYDVKSQLRIMQPYYHALYSFYEHTGMYFISQPIYNSIVDPSLDLIQQFESAVEAT
FT                   RNLPLDAKFDDTPLYRPTIQHLAEYLKLNIYAMEPEPLWNCYDTMDCPQIELPGIDNAI
FT                   TSIITKPTRPLSEYIELNHTTVKNFDGDIYCKVNHNEINNLQDILYCMPTDATIHELYI
FT                   VDHPYELESHNRMLRTSLNIWLHNLYDANVNLSHFDSINYDKTRKASFPIVGTVPAITL
FT                   RDCEICQDEIPDDLKDVYDFGSCVHAKAQLSDYTTPRKLNPLIEFDPALLRHGEFLPNN
FT                   DYAYTMKTKPDHLIDRELKDYIDSTGLTALIPPLDINPAVHDPETTYSSSYYIKTPSET
FT                   SIRQDLELFNQNTAGSVSPTVFLMAIELLHQLLTEEISASDGKPNCPMVPSEVPVRNKH
FT                   KSAGTPYRKFGDSEFMRELYGNYRDAIVYHKRHSADQQLTLTINKVAPSKNHRDRTILA
FT                   ISINKSEPGRSLYRWNLDKIKYTSSLGGPILIGFTAQYGGWDKLYKYLYKNSPADNPDT
FT                   AEHAVLGGKDYPKWDRRISNMLQLTTTTVLYSLIDPNTQRKLNNATPAQTWHEYMAETT
FT                   QVLYDYLVFGNELYQKPGGVTSGNSRTADGNSLLHLLIDFYAIISQLIQSTPENVHLEV
FT                   NLRNALCKTVFTRIPSDYIDSSCVTLRNTDTLHTIRRRVAKGAYLSDDGLIVIDPRIIR
FT                   YDDFMSVSHLISHYMIAQNKHKYHIDAIQRYAREFLSQDTIKFGDMVYPIPEFGRMYTA
FT                   MLLSDNKNTLDPQINITRLLALFSYLYIYYFKYEDQPTHPILKFLDALRTYIENKLNTT
FT                   DEIFLDCIKVPDLQDVEFDLKNCDLYENFDYLWGLDQSSAYMEYLCKYKHRYRNLSLFK
FT                   RQLIQHHEEAQLHNENKLMNKGKLITYNCYVCGENAYLTCATCERAFCNSADTNHGSHM
FT                   EQHLQYSGHTCLYLNCKTVKCQHCFTMDINLLYTTGRDHYCESHKPKNAVRILNYNANT
FT                   KLPPLLYLCVTDTKRVTFYEQCYINYTKAHPTYAISKEQFMSLIQLYLHQDYTLPVNQL
FT                   ANRIRVSLQLSSYGVVRPYHQLIMQLTKLESKVLDSSVVDIPITLINSQEIGTYYIEIP
FT                   REHKLDQHSTYSYLLGTREVSFTPNYYRLSSTNTHIWQTDTQIPNYCTFIRQRRLNTLS
FT                   AILRNTTQHVPEFTRLLLEWNQQLPITAKPFAEFKPSLKIPAQPNVTDNINTLLKELNV
FT                   KRFKIMFGGPGTGKSHTLSILINHLHEKGLRILVYTPSHQSANALLYKIANLIKRRTIQ
FT                   NPGLVRIITDGMKEEIKPHPYITYRTNMLDKDRICVTTIQSFSTVQHVKDVDLVILDEF
FT                   SLTSDNYLLTGLAHLKPSTRVLFSGDPRQLSGVDEIRKPLQSRFHTLINYYTETYPREV
FT                   HVLKYHFRCHPSIFQYFKDLYYADKDMECATSIADRIIRPLNPINTVQVSEPTFRNQGV
FT                   ILNQDEADKVLEILVLVNQTLALHSSYEYQPTIAIICSYKSQLQNFISLQQQKILSENV
FT                   NLSTIDSAQGDEFDIVILCLSQINNFTLNPNRFNVAISRAKSVLFITVPPIDKNPAFLF
FT                   KDVYATLHKHNLTYFKIYNTSGKAILSLDSPTTLKTKAEKMTYTNVRHLDKNTHTMQRK
FT                   FPMNIVMDDCICFDAEFLNPRDNVQEPVMLSYGFSSKYGKRRIAGIPVRYIKDKFNRIV
FT                   PQKYNYKDNNKPLTSTYTCDWMRKQHPEQYKHLLTSVMQGIRNDTTVDLKPLLNFCVDN
FT                   MHVKPVIVTWSGASDHCFLKAHTLYPDIATVCNITIRCTSQPIYASPQGRHTYYLCQYH
FT                   AHQLKDHVNITHFVNLEIIDLKVDRNQYTDERTLRVYHNDYLKLTLDLDNVASNSLTDC
FT                   HTRYCRTVHAPATPHDPLDDAIMTQCIYQSFVLSHLENLAYEPQANLKAFTSMDYRLKN
FT                   FNPEMCKLRRELQKVWYSQYTNTNKTHCNMGCGKEPLQQALHNIDVLQGKSNPQNNMNT
FT                   HTCDSEEHIYFDSHWYKAGGFTKPSYIFSDINKEHYYKLGTTGLCLYLNSKYAKYVHEY
FT                   RTVSGNDVFKSLYSPYCDLGRKPHQAEIEPSCSIPDCIITSNIGERFQTLVCNVHKDQM
FT                   DIISKISQATKYGYQFIYTGKTLLNNHAALSKAPHNWDHLTLEIPGYNTRKQHSSHMTT
FT                   KALGILHILQDSMLYTNRKTLNPNLPVILPGSASYLGDTVLANEMAKTLKQTKFIHIDP
FT                   RLKIDNNTTHHRKTLMEMLDIGYTTELIISDIHDNKNPWIPELMTYTLKYLVDTGTLIM
FT                   KITSRGATEDVLQQLEDLSKNFTYVRVCNLNAVTFSSELWIVFANKRKPPVQGWTSHEL
FT                   RAELRKHWYSMTRSIIQPLMRSRQSVFRYSPK"
FT   gene            361. .7872
FT                   /gene="ORF1a"
FT   CDS             361. .7872
FT                   /codon_start=1
FT                   /gene="ORF1a"
FT                   /product="pp1a polyprotein"
FT                   /note="non-structural polyprotein; contains two
FT                   transmembrane domains that flank a 3C-like proteinase;
FT                   cleavage sites not determined"
FT                   /db_xref="UniProtKB/TrEMBL:G0KUE2"
FT                   /protein_id="AEK87148.1"
FT                   /translation="MTYHDYALKDNVVLKRDQKLALDNFVTEVIQFWTPILTTLLLLAY
FT                   ALRKIMHNPFVGPISDNPLKRALQWIIFVFTRRNLYYQTPVFARDESRLNIFLHNDFAR
FT                   LDRNTLNGYCKLCNLYGHNHTDKHNPTIDALVLAKTCKLLRYNDKVTKPLAYTVHNIRA
FT                   YEKNTKTFADTFGTTTTNIPTKYALAPKKAVSELTTIESNLGPIYVNNTIAYPHLGFIA
FT                   YDNKQHLQELLANVTVVLDTIMVYTQYELDDATMNIRKSDITLSFVNDFDLTNALTNEL
FT                   KDPRTPWLLKAKKTSNKASEQDDDTEAEDTQKNKRKGKQLKPQTQLLQHTLAKQTKFAR
FT                   RQPFLSFGPTYMTLLCLISIMSPTYATVCTTYEPLDQADLYCNNLQNLTIEKYHAYANY
FT                   EQLNRQCFSIDGAEFKDLIRLSVSNALNLNNVIKPVPRDDYILKAFSNALPLNTHVLSD
FT                   YNTILDLQILMQFYNLNGSNVLYTETYSESEDYAGKVVQLLAQGTGGICKAPACILFTG
FT                   LATTVTDVEVKVTERLTKRIKHQEHGKPLHINPSCRKTCYCMYKPKVKPEPVETVKYAP
FT                   QAEFYTQLRYFQNHELQMYDDFEMGVLRYNNYTLNTFIYSNETCILTRGVHCVYNPEHF
FT                   AITRVYNNLGNYLECGVNQEFCESLQQEFMFNEPQLVITESMAVEAPTQYHKICDNHYT
FT                   SLQVKYPLIEKLFWSNFXVSVNRALAVKEPATFIIVHDTVAIIKTVIADIIEIMEKCYD
FT                   STAIKLTHHDFNKLEYMDDYSQILVKYKPLLEQHKIMLIEDIDLITVPAARALFSIFDT
FT                   YTPLVNGVFVLGTLNHKRYNETLAFNYYNDQTPTFYVDGILNANWQLLEDHTRQPLITR
FT                   VADNVHTILARQLVTPPAKIAATMPKVPTLNNTSKVINTLTHYSAHLLQVVGSDSTQAY
FT                   TYINSSVHNITDYVNDSVHNITDYVYTVYNSTKNHIVTRYNNMLMAAYDIQLNFYNQYP
FT                   LRQDFYHKGIRAFDLGQVCDFLHHTDTIVLYQDCINQKLDTIYVIKLRYGQNANGYHMY
FT                   PLKQPHTKQTIYELSDAIGFVYKDSKYNYFRTLFTNPGEYVLTIRENYLEYCKSDFSPT
FT                   PAFAPDATLQCYAYITGIQVIDNFIAEFGLFLMLYTAALIIILALAITIRDSTMMMFLK
FT                   LSIIFAYTFGPLLLTPKVFGSYIFVSLYNMLPYTSNTSYGCLLMMGALAITVIDLFAYM
FT                   TQRYRSEFTKNILQLATLLFEIVAITKYILIPYVCTSYGLVLTIIVSYVAYRYIQSQRP
FT                   NYLKATVSNATAHADWVAYRNTTREKTDEAAKSNLSKIINTSIADINKDQFLECVYLAA
FT                   CHRATVAASTYNPKHYLHIPNYNTKIMFARDNELMNYSVLSTDLKNKSAASNPSISHIV
FT                   LELPVAINPLIKYTTKTSVSSLRGAVVNGYIYIQRHLFGSKKQEFEACYNNGKGLLNCK
FT                   NLDRSKYDIDSAELIGTLIRIPLHDKQSIPHISLHPDPLSYNGPVTLYLSRYDTELNKD
FT                   VLCVHTGFMSEGHHDIKTVFGDCGGMLFDPKGRLLGLHCAGSDDVVFMDTTTGKSNIWT
FT                   SYKLQHPSEIMITLNNEINLPNPTNYDFETTKVVYQHPLRNVCATLETLQHLTNKTNVK
FT                   LPYDPRLLSDFNITAEQYAQYGYNIDYNNFINNFNRYTTTTIGTKSFETCIKYGLMDNK
FT                   KVEYYNQTATIFNPPEHSSSGFDNTMDVLYVFVYMFTHTHPAFYIAAACVFCLFFVKMN
FT                   KYLKMILSSIIFTIPHIYVNYYYGLVYMPLKWRKQITALAIRYNPYTAVALRYNKNLNI
FT                   AKDVAKELGTPKNLCTHLSTLLKCIKPYAAFNDLSQVINNVDDLMANWANTYNAEELLK
FT                   QYIDEIYKLYPILFVVFEKIENYEDQIKTILSYISDTGEFDLNGFEIHFDEKEHTTNII
FT                   DTNVEDIHDKLMAEKASLIALKNMNLEFDIETINNANIGELVRYLIISSTPDTLDRDLL
FT                   SRTTELLVRRIHQLRDDSEHNENLITLLSEIYKHKDFLTASHLTSNLRDRNYIMNNLVR
FT                   VIALFNKQINMQVAQKQYEARRIEELRKKESKQIMEQNNRIRKMQRQNQNIASAIVHMV
FT                   HACFANRFMLQNEAQKIMKALLGSDLELDPTDAEMQYYTAYRNGQVLTNQAIVTNFTTL
FT                   TTILWTGNGYQTVPSMCGAQEFTCTATHKHGYFNCTMEIKDAWYKHAEECTKCKSYYRT
FT                   NKHPRCGAIYDTTVKRYPTLSNFIARYRSCPACMPCTQCLSRREPGCESASYHIADTAH
FT                   YQNQAYLTPINIKPDNLEYNFVDINNGDVNAIYNGRIWLMRRTTAITPPPARYRNITNL
FT                   KLKQTDPEGYYYISEVCPTDLAILNAMINQIQLKLLDRTVLNNENHVENANTIQFNNPL
FT                   NDTTLDDLRTKHKHLLVMKLRPDSEHHFIEVLNFVRMNNLPIFIAHVTYADNTVNHATI
FT                   YINYLQAWRNEILDDVTTTCDILEKIIKHPLDFQGGLVL"
FT   gene            15660. .18359
FT                   /gene="ORF2a"
FT   CDS             15660. .18359
FT                   /codon_start=1
FT                   /gene="ORF2a"
FT                   /product="putative spike protein"
FT                   /note="putative structural protein; S; p2a; contains
FT                   stretches of hydrophobic residues, potential N-linked
FT                   glycosylation signals (NXS/T), and cysteine residues at
FT                   locations flanked by hydrophobic regions"
FT                   /db_xref="UniProtKB/TrEMBL:G1K4K8"
FT                   /protein_id="ABG02427.2"
FT                   /translation="MINSKCPLLFQTTTTSMPNQAHRNRPGPPIILKPETQWHQNHNAN
FT                   ASSSSKLHRSPLNNHPKDNQNLNATNLMLQHLPSLRSRKQLQQAPTTPKPAVNFTKSEK
FT                   NSMLETTWDGGKMKRLDQQSSSSLNLKWHPELTKSTIAINLRILTISSILSVLAYLFKT
FT                   QPLSAMQFTTTKNSLLKKKMSTFVNYLMPSMQSSCVHVKHLTLVPFLLLLLMLPNANCS
FT                   TRIDLSTHHIVSYNKPLIVVDDFLKTTLKYNFGTDLYNSAINYKTSFEQLLNNFKTPYQ
FT                   PLVDAFRVLFSYLGIEPVAHPFKDYFNADSPCPLQTTTTTGDVTTIGEHFQEILDDGNL
FT                   ELEPLASYWLRHTEDIFVYTRSQLWAFICPSEFAQASIFLPNYTEAIYNVSTAFCKTVY
FT                   YDTPTNAFNAEICNKVNFITPAKAQKRSKRWDSSYVCGWPLVSSAAKVLGGECTTNIDI
FT                   GTLKSSLNAIQNFSYANTELIHDLQSQLSVVNARTNLHYNQLQQLVTAINDHQAKYVND
FT                   INNLINHIKNTTNTMENRINVNSIIMSYTNSLFRVYQNIVDYRFAYIETLSSIQQHYHF
FT                   PSEHLHAFNVPLQAKLREHGFSIPIIDSNIPYSYGKVRYLNVTGINFYDLEFDIYIPVI
FT                   KLIHEKDSKYYHSTLSALPIGINTTLVTYNTYQGNAICTDTYCLESPINGFCREGESYW
FT                   YCGQHYIRTLHKITSLYTKPTKFTXSAMFIPPHTMYFVHNTTYSLNYGSSLQALAGSIL
FT                   MLTCNSTVQIPGYNFNSNDFVSCTDMNVNNVFIHPSLRVNDANFYIPPTRVDLLEKLYK
FT                   RDIIPILNHIQKANDITIDTTADEELKQQYETLKSDFNAKYDALNIENRRIHALINSMH
FT                   SIQSEPSYILYMVIAVIVFIVLKFLRII"
FT   gene            15674. .16312
FT                   /gene="ORF2b"
FT   CDS             15674. .16312
FT                   /codon_start=1
FT                   /gene="ORF2b"
FT                   /product="putative nucleocapsid protein"
FT                   /note="putative structural protein; N; p2b; highly
FT                   hydrophilic, enriched with proline and acidic residues as
FT                   well as basic residues relative to other virion proteins"
FT                   /db_xref="GOA:G1K4K7"
FT                   /db_xref="UniProtKB/TrEMBL:G1K4K7"
FT                   /protein_id="ABG02426.1"
FT                   /translation="MPATISNNNNVNAQPGTSKQTRPTNNSKTGNAMAPKPQRQRKQQQ
FT                   QASSQSPKQPPKGQPKPKRNQPNAAASAQPKVKKAIATGPNYTETSGKLYKIGKEFDAR
FT                   NHMGWRKNEKTGSTVQFLFKPKMASRIDQVYYRNQFEDPDHFIHTFGVGVFVQDSTLER
FT                   NAIYNHQKLTTEEKDEYVRKLSDAFNAILLRTRQAFDSGSLPALTVDAA"
FT   gene            18402. .18878
FT                   /gene="ORF3"
FT   CDS             18402. .18878
FT                   /codon_start=1
FT                   /gene="ORF3"
FT                   /product="putative small glycoprotein"
FT                   /note="putative structural protein; p3; contains stretches
FT                   of hydrophobic residues, potential N-linked glycosylation
FT                   signals (NXS/T), and cysteine residues at locations flanked
FT                   by hydrophobic regions; similar to membrane (M) protein of
FT                   Nidoviruses"
FT                   /db_xref="UniProtKB/TrEMBL:G1K4K9"
FT                   /protein_id="ABG02428.1"
FT                   /translation="MIVKITILFSILAVAMAADTTPEVVSPSTKLCEASSTQHCTAMGY
FT                   DYCKSISGVQSCYCSHVQNFTSVMDVIDKNLKCSITSSKYLDPHYWFRDLLAASVTLLV
FT                   IFTAITWAYLIPTYAKIDAIYTNSTSKAKQLHYIPLLPRQSDGSYTLLPGRSYK"
FT   gene            18754. .19104
FT                   /gene="ORF4"
FT   CDS             18754. .19104
FT                   /codon_start=1
FT                   /gene="ORF4"
FT                   /product="putative small glycoprotein"
FT                   /note="putative structural protein; putative p4; contains
FT                   stretches of hydrophobic residues, potential N-linked
FT                   glycosylation signals (NXS/T), and cysteine residues at
FT                   locations flanked by hydrophobic regions; similar to
FT                   membrane (M) protein of Nidoviruses"
FT                   /db_xref="UniProtKB/TrEMBL:G1K4L0"
FT                   /protein_id="ABG02429.1"
FT                   /translation="MLKSMLFIRTQPLKQNNCTTSPYYHGSLTAVIRSSPDDRINKLQS
FT                   RIQSENRLRWLFNNFSNCITCSYKLATFLYYIFTAIYYGFCLIMLYILWIYFTQLTNQI
FT                   KLVYHNFSNPYN"
XX
SQ   Sequence 20192 BP; 6933 A; 4432 C; 3151 G; 5674 T; 2 other;

dq458789 Length: 20192  04-MAY-2012  Type: N  Check: 5547  ..

       1  actaaagaaa cttttgtttt ctcccataat actactacta caagtatcaa
      51  ccccgtccgt ctgtcagaga cgctaaactc tgataactaa acctagccac
     101  atcagttgct taaagaacct cttgagacac tctcccactt aacatctttt
     151  aggaatcttc gatgctacaa caacttggct agtaaacaat aaatccgcat
     201  acttcacagt tgtaagaggc cataggtcca aactttgaaa ggtttgtttc
     251  tattgtgtca aacacttaga ttaacagagg ctatattagt gctcatcacg
     301  ttaacaaagt aatcttgcgc aatagtatga gtttgttgta aaacgtcttg
     351  atacgacacc atgacatacc acgattacgc tcttaaggac aatgttgtcc
     401  tcaagagaga tcaaaaacta gctttggaca actttgtcac cgaagttatc
     451  caattctgga cccccattct gaccacgcta ctcttgcttg cctatgcact
     501  caggaaaatc atgcataatc cattcgtcgg acccatttcg gataatcctc
     551  tgaaacgtgc cctacaatgg atcatctttg tgttcactcg ccgaaacctg
     601  tattatcaaa caccagtttt tgcccgtgac gagtctcgcc taaacatttt
     651  tctccacaat gattttgcac gcttggacag aaacaccctt aatggatatt
     701  gtaaactatg taatctatat ggacataatc acacagataa acataatcct
     751  accatagacg cactagtttt agctaaaact tgtaaattac tccgctataa
     801  tgataaggta actaaaccat tggcctacac tgtacataac atacgggctt
     851  atgagaagaa taccaaaaca tttgcagata cctttggtac aaccactaca
     901  aatatcccaa ctaaatatgc tctagcacct aagaaagcag taagcgaact
     951  taccactatt gaatctaacc taggacctat ttacgttaat aacactatag
    1001  cttacccaca tcttggcttt attgcttatg ataataaaca acacctccag
    1051  gaactcctgg ctaatgtaac tgtagtttta gacacaatta tggtttatac
    1101  acaatacgag ctagatgacg ccactatgaa catacgcaaa agtgacataa
    1151  cacttagttt tgtaaatgac tttgatttga ctaatgcctt aaccaacgag
    1201  ctcaaagatc ctcgtacacc ttggttgcta aaagctaaga aaacatcaaa
    1251  caaagcatca gaacaagatg atgatacaga agctgaagac acccagaaaa
    1301  ataaacgcaa gggaaaacaa ttaaaaccac aaacacagct actgcaacat
    1351  acacttgcta aacaaacaaa attcgctcgt cgccaaccgt ttttatcatt
    1401  cggtcccact tacatgacac ttctatgtct tatttctatt atgagcccta
    1451  cttacgctac ggtttgcaca acttatgaac cacttgatca agcagatctg
    1501  tactgtaaca acttgcaaaa cctaaccatc gaaaaatacc acgcttatgc
    1551  aaattatgaa caattgaata gacaatgttt ttccatcgat ggtgctgagt
    1601  ttaaggactt gatacgttta tcggtatcca atgctctaaa tttgaacaac
    1651  gtgatcaaac ctgtacccag ggatgattac atacttaagg ctttctccaa
    1701  tgccctacca ctgaacacac acgtattatc cgactacaac accattttgg
    1751  acttgcaaat acttatgcaa ttctacaact taaatggatc aaacgttttg
    1801  tatactgaaa cttattcaga gagtgaggat tatgcaggta aggtcgtaca
    1851  attgttggcc caaggtacag gaggcatttg taaagcacct gcttgcattc
    1901  tattcaccgg tcttgccacc actgttacag atgtggaggt taaggtcact
    1951  gagcgcttaa ctaaacgtat aaaacatcag gaacacggaa agccactaca
    2001  tatcaacccc tcctgtcgta agacatgtta ttgcatgtat aaacccaaag
    2051  ttaaaccaga acccgtagaa acagtcaaat atgcaccaca ggctgaattc
    2101  tatacacaac ttcgctattt ccaaaaccat gaactacaga tgtatgatga
    2151  ttttgaaatg ggtgtgctac ggtataacaa ttacacactc aatacattca
    2201  tctactctaa cgaaacctgc attctaactc gtggtgtaca ttgtgtttac
    2251  aaccctgaac actttgccat cacacgtgtt tacaacaatt taggaaatta
    2301  cctagagtgc ggtgtaaatc aggaattttg tgaatcattg caacaggaat
    2351  ttatgtttaa cgaaccacaa ttagtcatta ctgaaagtat ggctgttgaa
    2401  gcccccacac aatatcacaa aatttgtgat aatcactaca catcattgca
    2451  agtaaaatac cccttgattg aaaaattatt ttggagtaac ttchatgtta
    2501  gtgtaaaccg cgctttagct gttaaagaac ccgccacgtt cataatagtc
    2551  cacgatacag tagcaatcat caaaacggta attgctgaca ttattgaaat
    2601  tatggaaaaa tgttatgaca gcacagctat aaagctcaca catcacgact
    2651  tcaacaaact ggaatatatg gatgattata gccagattct agtaaagtat
    2701  aagccacttt tggaacaaca taagattatg cttattgaag acatcgactt
    2751  aattacagta ccagctgctc gtgccttatt ttctattttt gatacttaca
    2801  cacccttagt gaatggagtc tttgttttgg gcacgcttaa ccataagcgt
    2851  tacaatgaaa cattggcttt caattactat aacgatcaaa cacctacatt
    2901  ttatgtggat ggtatcttaa atgctaattg gcagctcctg gaagaccaca
    2951  ctcgacaacc gttaattaca cgcgtcgctg ataatgtaca caccattctg
    3001  gctagacaac tcgtaacacc acctgctaaa attgcagcca ctatgcctaa
    3051  agtacccact ttgaataaca cctccaaggt tatcaacacg ctaacacatt
    3101  attccgccca cctcttacaa gttgtaggaa gtgacagcac acaagcatat
    3151  acctatatta acagttcggt gcataatatc accgattatg ttaatgactc
    3201  agtacataac attacagatt atgtatacac ggtgtataac tccacgaaga
    3251  accacatagt tacacgctac aacaatatgc ttatggctgc ttatgacata
    3301  caactgaatt tttacaatca gtaccctttg cgtcaagact tttaccataa
    3351  aggtattcgt gcgtttgatc taggtcaagt ttgcgatttc cttcaccaca
    3401  ctgacacaat tgttttgtat caggattgta taaatcaaaa gttagacacg
    3451  atttatgtca taaaattgcg ttacggacaa aatgctaatg gttaccacat
    3501  gtatcccctt aaacaaccac acactaaaca gaccatctat gaattgagcg
    3551  acgccattgg cttcgtttat aaagatagca aatacaacta ctttagaaca
    3601  ctgtttacga atccagggga atatgtttta acaattcgcg aaaattactt
    3651  ggaatactgc aaatcagact ttagcccaac accagctttc gctccagatg
    3701  caactcttca atgctatgca tacatcacag gtatacaggt tattgacaat
    3751  tttattgctg aatttggttt gtttctcatg ttatacacag ctgccttaat
    3801  aattattttg gcgctagcta ttacaattcg agatagcact atgatgatgt
    3851  ttttaaaact gtctataatt tttgcctaca catttggacc actgttgcta
    3901  acacctaaag tgttcggctc atatatcttt gtttcactct acaacatgct
    3951  accatataca agtaacacca gttatggttg tttacttatg atgggggccc
    4001  tcgcaattac agtcattgac ttatttgcat acatgacaca aaggtaccgc
    4051  tcagaattta ccaagaacat cttgcaattg gccacattac tttttgaaat
    4101  tgtggctatc actaagtata ttttgattcc gtacgtctgc accagctatg
    4151  gtctagtcct aacgatcata gttagctacg ttgcataccg ttacatacaa
    4201  tcacaacgac caaactacct aaaagctaca gtctctaatg ctactgcaca
    4251  tgcagactgg gttgcttaca gaaatacaac gcgtgagaaa acggatgaag
    4301  ctgcgaaatc aaatttgagt aagatcatta acactagcat tgccgatata
    4351  aataaggacc agttcctaga atgcgtatat ctggctgctt gccaccgtgc
    4401  cactgtcgcc gcttcaactt ataaccccaa gcactatttg cacataccaa
    4451  actataatac taaaattatg tttgcgcgtg acaatgaatt gatgaactat
    4501  tcagtgctat caaccgattt aaagaacaag agcgcagcat caaacccctc
    4551  aatttcacat atcgttcttg aactgcctgt tgccattaac cctctaatta
    4601  agtacactac taaaacaagt gtatcaagtc tacgaggagc agtagtcaat
    4651  ggatatattt atattcagcg ccatctgttc ggtagtaaga aacaagaatt
    4701  cgaggcatgt tataataatg gtaaggggct tctaaattgt aagaatctgg
    4751  accgctctaa atatgacatt gattcagcag aattaatagg tacattaatt
    4801  agaatcccac tacacgacaa acaaagtatc ccacatatca gcttacatcc
    4851  agatccatta agttataatg gaccggttac cctctacttg tcacgttacg
    4901  acacggaact aaacaaagat gtactttgtg tacatactgg tttcatgtca
    4951  gaaggacacc acgatattaa gactgtgttt ggcgattgtg gaggtatgct
    5001  atttgacccc aaaggcagat tattaggctt gcattgcgct ggttctgatg
    5051  atgttgtctt tatggataca accacaggaa aatctaacat ttggactagt
    5101  tacaaattgc aacacccatc tgaaattatg ataactttga ataatgaaat
    5151  caatttgccg aatccgacta attatgattt cgagactact aaggttgttt
    5201  atcaacaccc tttgcgtaac gtatgtgcca ctctagaaac actccagcat
    5251  ttaactaaca agactaacgt taaattgcca tatgacccac gtttgttgtc
    5301  agatttcaac attactgctg aacaatatgc ccagtatggc tacaacattg
    5351  actataacaa tttcatcaac aactttaatc gctacacaac tacaactata
    5401  gggaccaaaa gcttcgaaac ctgtataaag tacggactca tggacaataa
    5451  gaaagtcgaa tattacaacc aaactgctac catcttcaat cctccagagc
    5501  atagttctag tggttttgat aacactatgg atgtgttgta tgtgtttgtg
    5551  tatatgttta cacacacaca cccagccttc tatatagccg ctgcttgtgt
    5601  attttgtctt ttctttgtca aaatgaataa gtaccttaaa atgatcctta
    5651  gctccatcat ctttacaatc ccccacattt acgtcaatta ttattatggc
    5701  ttggtttaca tgccattgaa atggcgtaag caaatcactg ctttagccat
    5751  ccgctacaat ccctacacgg ctgtggcact gcgctataat aagaatttga
    5801  acattgcgaa ggatgtagca aaggaactcg gtacccctaa aaatttgtgt
    5851  acgcatttat caacgctctt gaaatgtatt aaaccctacg ccgcctttaa
    5901  cgacctcagt caggtgatca acaatgttga tgatctaatg gctaattggg
    5951  ctaatacata taatgccgaa gagcttctta aacaatacat cgatgaaatc
    6001  tacaagttgt acccaatact attcgttgtt ttcgagaaga ttgagaatta
    6051  tgaagatcag attaaaacta ttttatcata tattagcgat actggtgaat
    6101  tcgatctgaa tggctttgaa atccattttg atgagaagga acacacgact
    6151  aacatcatag acacaaatgt tgaggacata catgacaagt tgatggctga
    6201  aaaagctagt ttgatagctc ttaagaatat gaacctagag ttcgatattg
    6251  aaaccattaa caatgccaac attggtgaac tcgtacgcta tttgataatt
    6301  agttctactc cagacacact cgatcgcgac ttgctatcca ggaccactga
    6351  attactggtt agacgtatac atcagttacg tgatgactcc gaacacaacg
    6401  aaaacctgat cacactattg tcggaaattt acaaacataa agacttctta
    6451  acagcatctc atttgacctc taaccttcgt gatcgtaatt acatcatgaa
    6501  caatctggta cgcgtgatag ctttgtttaa taaacaaata aacatgcaag
    6551  tagcccagaa acagtatgag gcgcgccgca ttgaagaatt gcgtaagaaa
    6601  gaatctaaac aaattatgga acaaaacaac cgtatccgta aaatgcaacg
    6651  tcaaaatcag aacattgcta gtgcaatagt tcatatggtt catgcatgct
    6701  tcgctaaccg ctttatgtta caaaatgaag cccagaagat aatgaaagcc
    6751  cttctcggct ctgacctgga attggacccc actgacgctg aaatgcaata
    6801  ttacacagca tatcgcaacg gtcaagtact aactaatcaa gcaattgtta
    6851  ccaatttcac cacactcacg acaatactat ggacaggtaa tggttatcaa
    6901  accgtaccca gtatgtgtgg cgctcaagaa tttacttgca ctgctacgca
    6951  caaacatggt tattttaatt gcaccatgga gattaaggat gcttggtata
    7001  agcatgctga agaatgtaca aaatgtaaga gttactaccg aacaaataaa
    7051  catccacgtt gtggtgccat ttatgacact accgtgaaac gttatccaac
    7101  actcagtaac ttcattgctc gttaccgtag ctgtccagct tgtatgcctt
    7151  gtacacaatg tttgtcacgc cgtgaacctg gttgtgaaag tgccagttat
    7201  catattgctg acacagcaca ttatcagaat caagcatatt taacacctat
    7251  aaatatcaag ccagacaacc tcgaatacaa cttcgttgat atcaacaatg
    7301  gagatgtaaa cgcaatatac aatggtcgca tatggttaat gagacgtaca
    7351  acagcaatta caccaccacc agcacgttac cgtaacatca ctaatctcaa
    7401  actcaaacag accgatcctg aaggctatta ctacatatct gaagtatgtc
    7451  ccactgactt ggctattttg aacgccatga tcaatcaaat tcaactaaag
    7501  cttttggatc gtactgtcct aaacaatgaa aaccacgtgg aaaatgccaa
    7551  cactatacaa tttaacaacc cactaaatga cacgacacta gacgatttgc
    7601  gcactaaaca taagcattta ttggttatga aattgagacc cgactcggaa
    7651  catcacttca ttgaggtttt gaactttgtt agaatgaaca atttaccaat
    7701  atttattgcc cacgtaactt acgcagacaa caccgttaat catgccacca
    7751  tatatataaa ttatttgcag gcatggcgca atgaaattct tgacgatgtg
    7801  actacaacat gtgatatctt ggagaaaatc attaagcacc ctttggattt
    7851  tcaaggtggg ctcgtactct aagcaggaac agtaatgtag cccgatatca
    7901  ccaactatgt accaccactg atgctggtat aagacacacc atcgatattt
    7951  cctgcaacaa aacttccatc agttatatcg atgaagtaaa caacaacgtc
    8001  aatgtcaaaa taaaatcgca cattgtaaag gaacacaaaa tttatgaaat
    8051  gttaattaac caatatccta acctctttct tattgaacac aagctggtaa
    8101  acgatatcat cccacattta ttacgctata atatgacagc acttagtttt
    8151  gctgacctat atggcttaat taaagaggaa aattggcacc ctatttatga
    8201  tactctacca caagtgacat atcataaaat taatgacgat ctactactaa
    8251  aaattaaatc gcacactcca tctccacagc acacctgttg tatgctatgc
    8301  cgtcgttttc tagttgaatt cggcttgctt ttgcataaac taaattataa
    8351  ggtattcgaa acaacacgcg ctatgttaac tcattatgat tttgtattga
    8401  ccgcagacaa tgttgatcta aatggtatac tggattttga agattacaaa
    8451  ctaagaaaat gtacagtagc atatgatgtt aaatcacaat tgcgcattat
    8501  gcaaccttac taccacgcat tgtactcttt ctatgaacat acaggtatgt
    8551  acttcattag ccaacccatc tacaactcta tagtggatcc cagtctcgat
    8601  ctaattcaac agtttgaatc ggcagttgag gctactcgaa atctaccatt
    8651  ggacgctaag tttgatgaca caccattata tagaccaact atacaacatc
    8701  tcgccgaata tctaaaattg aacatatatg ctatggagcc ggaacctcta
    8751  tggaattgct acgatactat ggattgtccg caaatagaat taccgggaat
    8801  agacaacgct attacaagca taattacaaa accaacaaga cctctctcgg
    8851  aatacatcga attgaaccac accacagtta aaaattttga cggtgacata
    8901  tactgtaaag taaatcacaa cgaaattaac aaccttcagg atatacttta
    8951  ctgtatgccc acagacgcta caattcatga actatacata gttgaccacc
    9001  cctacgagtt agaatcacac aaccgcatgc tacgcactag tttaaatatt
    9051  tggcttcata atctttatga tgctaatgta aacttatcac actttgattc
    9101  aataaattat gacaaaacac gcaaagctag tttccccatt gttggtactg
    9151  taccagcaat aacattgcgc gactgcgaaa tttgtcaaga cgaaatacca
    9201  gatgacctga aggatgtcta tgattttgga tcttgtgtgc atgcaaaagc
    9251  ccagttgtct gattatacaa caccacgtaa gttgaaccca ttgatagaat
    9301  ttgatccagc actacttcgt catggagaat ttctaccaaa taatgattac
    9351  gcatacacta tgaagactaa accagaccac ttaattgatc gagaactaaa
    9401  agattacatc gattcaactg gtttaacagc tttaatacca ccactggaca
    9451  tcaaccccgc tgtacatgac cctgaaacaa catattcaag ttcgtattat
    9501  ataaaaacac catcagaaac atctatacga caagaccttg aattgtttaa
    9551  tcagaacacc gctggctcag tttcacctac agtcttttta atggctatag
    9601  aattattaca tcaactgtta actgaagaaa tttctgcttc agacggcaag
    9651  ccaaactgtc ctatggtgcc ctcagaagta cctgtacgca acaaacataa
    9701  atcagccggt actccatacc gaaaatttgg tgattcagaa ttcatgcgcg
    9751  aattatatgg taactatcgt gacgctattg tttatcataa gcgccattct
    9801  gcagatcaac agctaacact aactatcaat aaggttgccc cttcgaaaaa
    9851  tcatcgcgat cgtacaatcc tcgccattag tataaataaa tcagaaccag
    9901  gacgctcact ttatcgttgg aatttggata aaataaaata cacctccagt
    9951  ttaggtggtc caattctaat cggttttaca gcacaatacg gtggttggga
   10001  taaactctat aaatatcttt ataaaaattc ccccgcagac aacccagaca
   10051  cagcagaaca tgcagtgctt ggtggaaagg attatccaaa atgggatcgc
   10101  cgtatttcta acatgctaca actaacaact acaactgttt tatacagttt
   10151  gatagatcca aacactcaga gaaaactcaa taacgctaca cccgcacaaa
   10201  cttggcacga atatatggct gaaacaacac aagtcttata tgactatctc
   10251  gtctttggca atgaattata tcagaaacct ggaggtgtaa cttcaggtaa
   10301  tagtcgcaca gctgatggca attcactact tcacttattg attgactttt
   10351  atgctataat tagtcaattg attcaatcaa caccagaaaa cgtacatcta
   10401  gaagtgaatt tacgtaacgc tttgtgtaaa acagttttta ccagaatacc
   10451  ctcggattac atagattcaa gctgtgtaac acttagaaac actgatacat
   10501  tacacacaat tcgccgacgc gtagccaaag gagcttattt aagcgacgac
   10551  ggtttaatcg ttatagaccc acgcattata aggtatgacg actttatgtc
   10601  tgttagtcac cttattagcc attacatgat agcacaaaac aagcacaaat
   10651  atcacatcga cgctatccaa cgctatgcaa gagaattcct atcacaagac
   10701  actattaagt ttggtgatat ggtttaccca atacctgagt ttggacgcat
   10751  gtacaccgca atgctcctga gtgacaataa aaacacttta gacccacaaa
   10801  ttaatatcac gcgtttattg gcactatttt catatttata tatatactat
   10851  ttcaagtatg aagatcaacc cactcatcca atattaaaat ttcttgatgc
   10901  gctaagaacc tacatagaaa ataaactgaa tacaacggat gaaattttct
   10951  tagactgcat caaagttcct gatttacagg atgtagaatt tgaccttaaa
   11001  aattgtgatt tatacgaaaa ttttgactac ttatgggggc ttgaccaatc
   11051  aagtgcctat atggaatatc tctgtaaata caaacaccgc tatcgtaatt
   11101  tatcgctgtt taaacgtcaa cttatacaac accacgaaga agctcaattg
   11151  cataatgaaa ataagcttat gaataaagga aaattaatca cgtacaattg
   11201  ctatgtttgt ggcgaaaatg cgtatttaac atgtgctaca tgtgaacgcg
   11251  cattttgcaa tagtgcagat accaatcatg gctcacatat ggaacaacat
   11301  ctacaatatt caggtcatac ctgtttatac ctaaattgta aaactgtaaa
   11351  atgtcaacat tgttttacga tggacatcaa cttactatac accactggcc
   11401  gtgaccacta ttgtgaatca cataagccta aaaatgccgt acgtatacta
   11451  aactacaatg ctaatacaaa attaccaccg ctcctttatc tatgtgtaac
   11501  agacacaaag cgtgtgacat tttatgaaca atgttacatt aattacacaa
   11551  aagcacaccc gacctatgca ataagtaaag aacaatttat gagccttatt
   11601  cagctgtatc tacatcaaga ttacacacta ccagttaatc aattagctaa
   11651  ccggattaga gttagtctac aattgagttc atatggtgta gtcagaccat
   11701  atcatcaatt aattatgcag cttacaaaat tagaaagcaa agtcttagac
   11751  tcaagtgttg ttgatatacc aattacactc atcaattcac aagaaattgg
   11801  gacatactac attgaaatac ctcgggaaca caaactcgac caacattcca
   11851  cctattccta tctactagga actcgcgagg taagtttcac acccaattac
   11901  taccgcctaa gcagcacaaa cactcatata tggcaaactg acacacaaat
   11951  tccaaattat tgtactttta tacgacagcg tcgtctaaat accttaagcg
   12001  ctattctacg caacacaaca caacatgtgc cagaatttac gcgtttgcta
   12051  ttagaatgga atcaacaatt accgattaca gctaaaccct ttgcagaatt
   12101  taaaccttca ttgaaaattc cagctcagcc gaatgtgact gacaatatta
   12151  atacgctgct aaaggagctg aatgtaaaac gttttaaaat catgtttggc
   12201  gggcctggta caggaaaatc tcacacacta tctattctca taaaccattt
   12251  acatgagaag ggtctgcgaa ttctagtgta cacaccatca caccaatctg
   12301  ccaatgcttt gctatataaa atagcaaact tgattaaaag acgcactata
   12351  caaaaccccg gattagtcag aattattaca gatggcatga aagaagaaat
   12401  caagccacac ccatatataa cttatcgtac aaatatgcta gacaaagacc
   12451  gcatttgcgt gacaactata caaagttttt caactgtaca gcatgttaag
   12501  gatgtagatt tagtaattct tgacgaattc agtttaactt cggataatta
   12551  cctactaacc ggccttgcac atctaaaacc ttctacacgt gttttgtttt
   12601  ctggtgaccc cagacaactt agcggtgtgg atgagattag aaaaccacta
   12651  caatcacgtt ttcatacttt gattaattat tacactgaaa cctacccgcg
   12701  agaagtgcat gtgttaaaat accactttag atgccaccca agtatattcc
   12751  agtatttcaa ggatctgtat tatgcagata aagacatgga atgtgcgaca
   12801  tctattgcag atcgtattat acgcccactg aatccaatta atacagtgca
   12851  agtcagcgaa cccactttta gaaatcaagg tgtaatatta aatcaagatg
   12901  aagccgataa ggtcctagaa attctagtgc ttgtaaatca aacactagca
   12951  ctccattcaa gttatgaata ccaacccact attgcaatta tatgtagtta
   13001  caaatcacaa cttcaaaatt ttatctcact acagcaacag aaaattcttt
   13051  cagagaatgt caatttaagc actatcgatt ctgctcaagg cgacgaattc
   13101  gatatagtaa tactatgtct ttcccaaatt aacaacttca cgttaaatcc
   13151  taatcgattt aatgtagcaa tttcaagggc taagtcagtg ttgtttataa
   13201  cagttccccc tattgacaaa aaccccgcat ttctttttaa agatgtgtac
   13251  gcaactttgc ataaacacaa tttgacatac tttaagattt acaacactag
   13301  tggtaaagca atactttctt tagattcacc aacaacgtta aagaccaaag
   13351  cagagaaaat gacatataca aatgtacgac atctcgacaa gaacacccat
   13401  actatgcaac gaaaatttcc aatgaacata gttatggacg actgtatatg
   13451  ctttgatgct gaattcttaa accctagaga caacgtacaa gaaccagtaa
   13501  tgctttcata tggtttctcc agtaaatatg gcaaacgacg tatagcaggc
   13551  attccagtgc gctatataaa agacaaattc aacagaatag tcccacaaaa
   13601  gtacaattac aaggataata acaaaccatt aacatctact tacacctgcg
   13651  actggatgag gaaacaacac cctgaacaat ataaacacct cctaacatca
   13701  gttatgcaag gaatccgtaa tgacaccact gtagatctta agccactgct
   13751  taatttttgt gtagataaca tgcatgtgaa acctgttatc gtcacatggt
   13801  ctggcgctag tgaccactgt ttcttaaaag cacacacgct atatccagac
   13851  attgcaacag tatgtaatat aactatacgc tgcacgtcac aaccaattta
   13901  tgcttcacca caaggccgac acacttatta cctctgccaa tatcacgcac
   13951  accaacttaa agaccatgtt aatataactc attttgtaaa tctcgagatc
   14001  atagatctca aggtagaccg caatcaatat acagatgaga gaaccttaag
   14051  agtgtatcac aacgattact taaagctaac attggacctc gataatgtag
   14101  cctctaatag tctaactgac tgtcatacta gatactgcag aaccgtacat
   14151  gcacccgcaa caccacatga cccacttgat gatgccatca tgacacagtg
   14201  tatttatcaa tcttttgtat tgtcacatct tgaaaattta gcatacgaac
   14251  cacaagctaa cctcaaggcg tttacatcta tggactaccg ccttaaaaat
   14301  ttcaatcctg agatgtgtaa actaagacgc gaattacaaa aagtttggta
   14351  ctcacaatac acaaacacta acaaaacaca ctgcaatatg ggctgcggaa
   14401  aagaaccatt acaacaagct cttcataaca tcgacgtatt acaaggtaaa
   14451  tcaaatccac agaataacat gaacacccac acctgtgatt ctgaagagca
   14501  tatatatttt gatagtcact ggtataaagc tggtggtttt acaaagcctt
   14551  cgtacatttt tagcgacatt aataaagaac actattacaa actcgggacc
   14601  actggcttat gtctatactt aaatagtaaa tacgccaaat acgtacatga
   14651  ataccgcact gttagcggta acgacgtttt taaatcacta tattcaccat
   14701  attgtgattt aggcagaaaa ccacaccaag cagaaataga acctagctgc
   14751  tcaatacccg attgtataat cacatctaac ataggcgaac gttttcaaac
   14801  tttagtttgt aacgtacaca aagatcaaat ggacattatt agcaaaatct
   14851  cacaagccac taagtatggg tatcaattta tctatactgg taaaactttg
   14901  ttaaataacc acgcagcctt atctaaagct ccacataatt gggatcatct
   14951  aacattagaa attcctggct ataacacacg taaacagcat tccagtcata
   15001  tgactactaa agctttaggt atactacata tactacaaga tagtatgtta
   15051  tatacaaacc gtaaaacact gaatcctaac ttaccggtta ttttacctgg
   15101  ttcggctagt tatctcggtg atacagtact tgctaatgaa atggctaaaa
   15151  ccctcaaaca aacaaaattc atacacattg atccacgcct aaaaatagat
   15201  aacaacacaa cacaccacag aaaaacacta atggaaatgc tagacatagg
   15251  ttatacaacg gaattaataa tttcagacat ccatgataat aaaaatccat
   15301  ggattcccga gttaatgaca tacactttaa agtaccttgt agatactgga
   15351  accctcatta tgaaaatcac aagtcgcgga gcgactgaag acgtactaca
   15401  acaacttgag gacctttcta aaaattttac atacgtaaga gtgtgtaatt
   15451  taaatgctgt aactttttcc tcagaattat ggatagtctt cgcgaataag
   15501  cgtaagccac ccgtacaagg ctggacatca catgaactga gggctgagtt
   15551  acgtaagcat tggtattcta tgacacgcag tataattcaa cctctaatgc
   15601  gttctagaca aagtgtattc agatactctc ccaaataacc cactagttaa
   15651  tcccatttta tgattaactc aaaatgcccg ctactatttc aaacaacaac
   15701  aacgtcaatg cccaaccagg cacatcgaaa cagacccggc ccaccaataa
   15751  ttctaaaacc ggaaacgcaa tggcaccaaa accacaacgc caacgcaagc
   15801  agcagcagca agcttcatcg cagtccccta aacaaccacc caaaggacaa
   15851  ccaaaaccta aacgcaacca acctaatgct gcagcatctg cccagcctaa
   15901  ggtcaagaaa gcaattgcaa caggccccaa ctacaccgaa accagcggta
   15951  aactttacaa aatcggaaaa gaattcgatg ctagaaacca catgggatgg
   16001  aggaaaaatg aaaagactgg atcaacagtc cagttcctct ttaaacctaa
   16051  aatggcatcc cgaattgacc aagtctacta tcgcaatcaa tttgaggatc
   16101  ctgaccattt catccatact ttcggtgttg gcgtatttgt tcaagactca
   16151  acccttgagc gcaatgcaat ttacaaccac caaaaactca ctactgaaga
   16201  aaaagatgag tacgttcgta aactatctga tgccttcaat gcaatcctcc
   16251  tgcgtacacg tcaagcattt gactctggtt cccttcctgc tcttactgtt
   16301  gatgctgcct aatgctaact gctctacccg tatagatttg agcacacacc
   16351  acatagtttc atataacaag ccacttatag tggtagatga ttttctgaag
   16401  acaactttaa aatataattt tggcactgat ttgtataata gtgctataaa
   16451  ttataaaacc tctttcgagc aacttctgaa taactttaaa acaccttacc
   16501  aaccacttgt tgacgccttc cgcgttttat ttagttattt aggtatagaa
   16551  cccgtagccc atccattcaa ggattacttc aatgctgatt ccccttgccc
   16601  cttgcaaact acaacaacca ctggcgatgt aaccactata ggtgaacatt
   16651  tccaagaaat cttggatgat ggtaatttag aattggaacc actagctagt
   16701  tattggctta gacatacaga agatattttt gtatacacac gatcacaact
   16751  atgggccttt atatgtcctt ctgaatttgc acaagctagt atatttttac
   16801  ccaattatac tgaagccatt tataatgtat caaccgcttt ttgtaaaact
   16851  gtctattatg acacccctac aaatgccttc aatgctgaaa tttgtaataa
   16901  agttaatttc atcacaccag ccaaagcaca aaaacgcagc aaacgttggg
   16951  attcttccta cgtttgtggc tggccacttg tatctagcgc cgctaaagtg
   17001  ctgggaggtg aatgtacaac taacatcgac attggtactt taaaatcaag
   17051  tctaaatgct attcaaaatt tctcttatgc aaatacagaa ttaatccacg
   17101  acttacaatc acagcttagt gttgtaaatg ctcgcacaaa tcttcactat
   17151  aatcaacttc aacaactcgt cacagctata aatgatcatc aagctaaata
   17201  tgtgaatgac ataaacaact taattaacca tattaagaat acaactaaca
   17251  ctatggaaaa tcgcattaat gttaactcca ttattatgtc ttatactaat
   17301  tcactctttc gtgtatatca aaatatagtt gattatcgct tcgcgtatat
   17351  agaaacactt agttccatac agcaacacta tcattttccc tctgaacatc
   17401  tacatgcttt taatgtccct cttcaggcta aacttcgaga gcatgggttt
   17451  tcaataccta ttatagattc aaatatccct tactcttatg gtaaagttag
   17501  atatctaaat gtaactggca taaattttta tgaccttgaa tttgatatct
   17551  acatccctgt aataaaactc attcatgaaa aagatagtaa atactaccat
   17601  tcaacgcttt cagcgctccc cattggaatt aatactacac tagtaactta
   17651  caatacttat cagggtaatg ctatatgcac agatacgtat tgtcttgagt
   17701  cacctattaa cggattttgc cgcgaaggtg agagttattg gtattgtggc
   17751  caacattata ttagaacact tcataaaata acctctttgt ataccaaacc
   17801  cacaaaattc actraaagtg ccatgtttat accaccacat actatgtact
   17851  ttgtacataa taccacttat tcattaaact atggatcctc acttcaagca
   17901  ctagctggtt ctatactgat gctaacttgc aactctacag tccagatccc
   17951  tgggtacaac ttcaactcca acgattttgt ttcgtgcaca gatatgaatg
   18001  ttaataatgt ttttatacat ccctccctac gtgtcaatga cgcaaacttc
   18051  tatataccgc ccactcgagt agatctactt gaaaaacttt ataaacgcga
   18101  tataatccct atactaaatc atatacaaaa agctaatgac ataactatag
   18151  acaccacggc tgacgaagaa ttaaaacaac aatatgaaac acttaaaagc
   18201  gatttcaatg ctaaatacga cgcattgaat atagaaaata gacgaataca
   18251  cgcactaatt aatagtatgc attccataca atcggaaccc tcatatattc
   18301  tatacatggt tatcgcagta atcgttttta tagtcctcaa atttcttaga
   18351  atcatataat catactacta ctatcaaaat ataaaaaccc cctttttaaa
   18401  tatgatagtc aaaattacca tcctttttag catcctcgct gtcgctatgg
   18451  cagcagacac gaccccagaa gtggtgtcac cctccactaa gctctgtgaa
   18501  gcaagttcta cacaacactg tactgcaatg ggctacgatt actgcaaaag
   18551  catctctggt gtccagagtt gttactgctc ccatgtacaa aatttcacaa
   18601  gcgttatgga cgtcatcgac aaaaatttga aatgctcaat tacgtctagc
   18651  aaatatctag acccacacta ctggtttcgc gacctcctgg cggctagcgt
   18701  cacacttttg gtcatattca ctgctattac ttgggcttat cttattccta
   18751  cttatgctaa aatcgatgct atttatacga actcaacctc taaagcaaaa
   18801  caattgcact acatccccct actaccacgg cagtctgacg gcagttatac
   18851  gctcctcccc ggacgatcgt ataaataaac tgcaatcacg tatccagtca
   18901  gaaaatcgac ttaggtggtt gtttaataac ttctctaatt gcatcacttg
   18951  ttcgtataag ttagctacat ttttatatta catatttacc gcaatatact
   19001  atggcttctg ccttattatg ttatatatat tatggattta ctttacccaa
   19051  ttaactaacc aaattaaact tgtatatcac aactttagta atccatataa
   19101  ctaggtttta aatcacatca ctaaataatg taataatacc cactaataca
   19151  aaacacttta ttacatccac ataaaaccgc ctgagtttag ttaaagctgt
   19201  atactttacg cctccttgga ggattctaga cagaccattc tagacagcac
   19251  taatttaatc acgcgtttct taacacgcat ttaacacagc aaaatacaaa
   19301  aatttttacc tatgccaaat gccaaattta ctacacacac ctaaattcac
   19351  accacattga taaactaaac accacttaaa attcaaaatc actctataca
   19401  ttcttaggaa agtcatgttg gaaagtacgc tagatcttgt tgttggcagg
   19451  aagcgtagtg ctcatgttta tgtgtccggc ctcgggccta atcgaagatt
   19501  tttatacata cggacacaag gcctggaaca agccgatcat tcaagtaaat
   19551  aacatccact gaacaaaagt tgaaccactg aatgagacta atgtatagaa
   19601  tagagacgca aacacactac gcgggatcga accggaaacc acacattgta
   19651  ccggccattc tttgtacacc acttattata gtttagatgt cagctattat
   19701  gaattgtttt gtatttctta ccactatagt ctcgcctgta agagagattg
   19751  tacgcaatat acaacacact acaattctag tagacattga acagcagggt
   19801  attgcccgcc taacttataa tacgccagtg tactgatcac tttccatggc
   19851  ggaataacca ccaacacata ccacactatc taacattaca tacatggaca
   19901  aaacacaaca gcagaaatat caacagacaa ttaagaccga acaccactgg
   19951  tgagtaggtg tactgaactc cgaggagacg taggtacatg gaattgttat
   20001  agactgcaga tatcaataca tatcttgtgc gagaaaatac attgcgagag
   20051  acgcattgag tagtgaagca ttaggcaccc gaaaacggtt agggcttagt
   20101  agtatggcgc ttggcattac aggtaaaaaa aaaaaaaaaa aaaaaaaaaa
   20151  aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa