Sequence of DPV Alphamesonivirus 1
Nam Dinh virus isolate 02VN178, complete genome.
ACC No: DQ458789
Dated: 2012-05-04 | Length: 20192 | CRC: 1601306517
ID DQ458789; SV 2; linear; genomic RNA; STD; VRL; 20192 BP.
XX
AC DQ458789;
XX
DT 02-APR-2007 (Rel. 91, Created)
DT 04-MAY-2012 (Rel. 112, Last updated, Version 4)
XX
DE Nam Dinh virus isolate 02VN178, complete genome.
XX
KW .
XX
OS Nam Dinh virus
OC Viruses; ssRNA positive-strand viruses, no DNA stage; Nidovirales;
OC unclassified Nidovirales.
XX
RN [1]
RC Publication Status: Online-Only
RP 1-20192
RX PUBMED; 21931546.
RA Nga P.T., Parquet Mdel C., Lauber C., Parida M., Nabeshima T., Yu F.,
RA Thuy N.T., Inoue S., Ito T., Okamoto K., Ichinose A., Snijder E.J.,
RA Morita K., Gorbalenya A.E.;
RT "Discovery of the first insect nidovirus, a missing evolutionary link in
RT the emergence of the largest RNA virus genomes";
RL PLoS Pathog. 7(9):E1002215-E1002215(2011).
XX
RN [2]
RP 1-20192
RA Nga P.T., Parida M., Parquet M.D.C., Thuy N.T., Suu P.T., Khan A.H.,
RA Salda L.T.D., Yu F., Inoue S., Ito T., Morita K.;
RT "Identification of a novel mosquito virus in Viet Nam related to the
RT members of the order Nidovirales";
RL Unpublished.
XX
RN [3]
RP 1-20192
RA Nga P.T., Parida M., Parquet M.D.C., Thuy N.T., Suu P.T., Khan A.H.,
RA Salda L.T.D., Yu F., Inoue S., Ito T., Morita K.;
RT ;
RL Submitted (22-MAR-2006) to the INSDC.
RL Virology, Institute of Tropical Medicine, Sakamoto 1-12-4, Nagasaki
RL 852-8013, Japan
XX
RN [4]
RC Sequence update by submitter
RP 1-20192
RA Nga P.T., Parquet M.D.C., Lauber C., Parida M., Nabeshima T., Yu F.,
RA Thuy N.T., Inoue S., Ito T., Okamoto K., Ichinose A., Snijder E.J.,
RA Morita K., Gorbalenya A.E.;
RT ;
RL Submitted (07-JUL-2011) to the INSDC.
RL Virology, Institute of Tropical Medicine, Sakamoto 1-12-4, Nagasaki
RL 852-8013, Japan
XX
CC On Jul 26, 2011 this sequence version replaced gi:108744356.
XX
FH Key Location/Qualifiers
FH
FT source 1. .20192
FT /organism="Nam Dinh virus"
FT /host="mosquito"
FT /isolate="02VN178"
FT /mol_type="genomic RNA"
FT /country="Viet Nam"
FT /collection_date="2002"
FT /db_xref="taxon:325676"
FT gene 361. .15638
FT /gene="ORF1ab"
FT CDS join(361. .7851,7851. .15638)
FT /codon_start=1
FT /ribosomal_slippage
FT /gene="ORF1ab"
FT /product="pp1ab polyprotein"
FT /note="non-structural polyprotein; ORF1b expressed via -1
FT ribosomal frameshifting; contains two transmembrane domains
FT that flank a 3C-like proteinase (ORF1a-encoded), as well as
FT RNA-dependent RNA polymerase, Zm-Hel1 helicase, 3'-5'
FT exoribonuclease, N7-methyltransferase and
FT 2'-O-methyltransferase domains (ORF1b-encoded); cleavage
FT sites not determined"
FT /db_xref="GOA:G1K4K6"
FT /db_xref="InterPro:IPR003593"
FT /db_xref="UniProtKB/TrEMBL:G1K4K6"
FT /protein_id="ABG02430.2"
FT /translation="MTYHDYALKDNVVLKRDQKLALDNFVTEVIQFWTPILTTLLLLAY
FT ALRKIMHNPFVGPISDNPLKRALQWIIFVFTRRNLYYQTPVFARDESRLNIFLHNDFAR
FT LDRNTLNGYCKLCNLYGHNHTDKHNPTIDALVLAKTCKLLRYNDKVTKPLAYTVHNIRA
FT YEKNTKTFADTFGTTTTNIPTKYALAPKKAVSELTTIESNLGPIYVNNTIAYPHLGFIA
FT YDNKQHLQELLANVTVVLDTIMVYTQYELDDATMNIRKSDITLSFVNDFDLTNALTNEL
FT KDPRTPWLLKAKKTSNKASEQDDDTEAEDTQKNKRKGKQLKPQTQLLQHTLAKQTKFAR
FT RQPFLSFGPTYMTLLCLISIMSPTYATVCTTYEPLDQADLYCNNLQNLTIEKYHAYANY
FT EQLNRQCFSIDGAEFKDLIRLSVSNALNLNNVIKPVPRDDYILKAFSNALPLNTHVLSD
FT YNTILDLQILMQFYNLNGSNVLYTETYSESEDYAGKVVQLLAQGTGGICKAPACILFTG
FT LATTVTDVEVKVTERLTKRIKHQEHGKPLHINPSCRKTCYCMYKPKVKPEPVETVKYAP
FT QAEFYTQLRYFQNHELQMYDDFEMGVLRYNNYTLNTFIYSNETCILTRGVHCVYNPEHF
FT AITRVYNNLGNYLECGVNQEFCESLQQEFMFNEPQLVITESMAVEAPTQYHKICDNHYT
FT SLQVKYPLIEKLFWSNFXVSVNRALAVKEPATFIIVHDTVAIIKTVIADIIEIMEKCYD
FT STAIKLTHHDFNKLEYMDDYSQILVKYKPLLEQHKIMLIEDIDLITVPAARALFSIFDT
FT YTPLVNGVFVLGTLNHKRYNETLAFNYYNDQTPTFYVDGILNANWQLLEDHTRQPLITR
FT VADNVHTILARQLVTPPAKIAATMPKVPTLNNTSKVINTLTHYSAHLLQVVGSDSTQAY
FT TYINSSVHNITDYVNDSVHNITDYVYTVYNSTKNHIVTRYNNMLMAAYDIQLNFYNQYP
FT LRQDFYHKGIRAFDLGQVCDFLHHTDTIVLYQDCINQKLDTIYVIKLRYGQNANGYHMY
FT PLKQPHTKQTIYELSDAIGFVYKDSKYNYFRTLFTNPGEYVLTIRENYLEYCKSDFSPT
FT PAFAPDATLQCYAYITGIQVIDNFIAEFGLFLMLYTAALIIILALAITIRDSTMMMFLK
FT LSIIFAYTFGPLLLTPKVFGSYIFVSLYNMLPYTSNTSYGCLLMMGALAITVIDLFAYM
FT TQRYRSEFTKNILQLATLLFEIVAITKYILIPYVCTSYGLVLTIIVSYVAYRYIQSQRP
FT NYLKATVSNATAHADWVAYRNTTREKTDEAAKSNLSKIINTSIADINKDQFLECVYLAA
FT CHRATVAASTYNPKHYLHIPNYNTKIMFARDNELMNYSVLSTDLKNKSAASNPSISHIV
FT LELPVAINPLIKYTTKTSVSSLRGAVVNGYIYIQRHLFGSKKQEFEACYNNGKGLLNCK
FT NLDRSKYDIDSAELIGTLIRIPLHDKQSIPHISLHPDPLSYNGPVTLYLSRYDTELNKD
FT VLCVHTGFMSEGHHDIKTVFGDCGGMLFDPKGRLLGLHCAGSDDVVFMDTTTGKSNIWT
FT SYKLQHPSEIMITLNNEINLPNPTNYDFETTKVVYQHPLRNVCATLETLQHLTNKTNVK
FT LPYDPRLLSDFNITAEQYAQYGYNIDYNNFINNFNRYTTTTIGTKSFETCIKYGLMDNK
FT KVEYYNQTATIFNPPEHSSSGFDNTMDVLYVFVYMFTHTHPAFYIAAACVFCLFFVKMN
FT KYLKMILSSIIFTIPHIYVNYYYGLVYMPLKWRKQITALAIRYNPYTAVALRYNKNLNI
FT AKDVAKELGTPKNLCTHLSTLLKCIKPYAAFNDLSQVINNVDDLMANWANTYNAEELLK
FT QYIDEIYKLYPILFVVFEKIENYEDQIKTILSYISDTGEFDLNGFEIHFDEKEHTTNII
FT DTNVEDIHDKLMAEKASLIALKNMNLEFDIETINNANIGELVRYLIISSTPDTLDRDLL
FT SRTTELLVRRIHQLRDDSEHNENLITLLSEIYKHKDFLTASHLTSNLRDRNYIMNNLVR
FT VIALFNKQINMQVAQKQYEARRIEELRKKESKQIMEQNNRIRKMQRQNQNIASAIVHMV
FT HACFANRFMLQNEAQKIMKALLGSDLELDPTDAEMQYYTAYRNGQVLTNQAIVTNFTTL
FT TTILWTGNGYQTVPSMCGAQEFTCTATHKHGYFNCTMEIKDAWYKHAEECTKCKSYYRT
FT NKHPRCGAIYDTTVKRYPTLSNFIARYRSCPACMPCTQCLSRREPGCESASYHIADTAH
FT YQNQAYLTPINIKPDNLEYNFVDINNGDVNAIYNGRIWLMRRTTAITPPPARYRNITNL
FT KLKQTDPEGYYYISEVCPTDLAILNAMINQIQLKLLDRTVLNNENHVENANTIQFNNPL
FT NDTTLDDLRTKHKHLLVMKLRPDSEHHFIEVLNFVRMNNLPIFIAHVTYADNTVNHATI
FT YINYLQAWRNEILDDVTTTCDILEKIIKHPLDFSRWARTLSRNSNVARYHQLCTTTDAG
FT IRHTIDISCNKTSISYIDEVNNNVNVKIKSHIVKEHKIYEMLINQYPNLFLIEHKLVND
FT IIPHLLRYNMTALSFADLYGLIKEENWHPIYDTLPQVTYHKINDDLLLKIKSHTPSPQH
FT TCCMLCRRFLVEFGLLLHKLNYKVFETTRAMLTHYDFVLTADNVDLNGILDFEDYKLRK
FT CTVAYDVKSQLRIMQPYYHALYSFYEHTGMYFISQPIYNSIVDPSLDLIQQFESAVEAT
FT RNLPLDAKFDDTPLYRPTIQHLAEYLKLNIYAMEPEPLWNCYDTMDCPQIELPGIDNAI
FT TSIITKPTRPLSEYIELNHTTVKNFDGDIYCKVNHNEINNLQDILYCMPTDATIHELYI
FT VDHPYELESHNRMLRTSLNIWLHNLYDANVNLSHFDSINYDKTRKASFPIVGTVPAITL
FT RDCEICQDEIPDDLKDVYDFGSCVHAKAQLSDYTTPRKLNPLIEFDPALLRHGEFLPNN
FT DYAYTMKTKPDHLIDRELKDYIDSTGLTALIPPLDINPAVHDPETTYSSSYYIKTPSET
FT SIRQDLELFNQNTAGSVSPTVFLMAIELLHQLLTEEISASDGKPNCPMVPSEVPVRNKH
FT KSAGTPYRKFGDSEFMRELYGNYRDAIVYHKRHSADQQLTLTINKVAPSKNHRDRTILA
FT ISINKSEPGRSLYRWNLDKIKYTSSLGGPILIGFTAQYGGWDKLYKYLYKNSPADNPDT
FT AEHAVLGGKDYPKWDRRISNMLQLTTTTVLYSLIDPNTQRKLNNATPAQTWHEYMAETT
FT QVLYDYLVFGNELYQKPGGVTSGNSRTADGNSLLHLLIDFYAIISQLIQSTPENVHLEV
FT NLRNALCKTVFTRIPSDYIDSSCVTLRNTDTLHTIRRRVAKGAYLSDDGLIVIDPRIIR
FT YDDFMSVSHLISHYMIAQNKHKYHIDAIQRYAREFLSQDTIKFGDMVYPIPEFGRMYTA
FT MLLSDNKNTLDPQINITRLLALFSYLYIYYFKYEDQPTHPILKFLDALRTYIENKLNTT
FT DEIFLDCIKVPDLQDVEFDLKNCDLYENFDYLWGLDQSSAYMEYLCKYKHRYRNLSLFK
FT RQLIQHHEEAQLHNENKLMNKGKLITYNCYVCGENAYLTCATCERAFCNSADTNHGSHM
FT EQHLQYSGHTCLYLNCKTVKCQHCFTMDINLLYTTGRDHYCESHKPKNAVRILNYNANT
FT KLPPLLYLCVTDTKRVTFYEQCYINYTKAHPTYAISKEQFMSLIQLYLHQDYTLPVNQL
FT ANRIRVSLQLSSYGVVRPYHQLIMQLTKLESKVLDSSVVDIPITLINSQEIGTYYIEIP
FT REHKLDQHSTYSYLLGTREVSFTPNYYRLSSTNTHIWQTDTQIPNYCTFIRQRRLNTLS
FT AILRNTTQHVPEFTRLLLEWNQQLPITAKPFAEFKPSLKIPAQPNVTDNINTLLKELNV
FT KRFKIMFGGPGTGKSHTLSILINHLHEKGLRILVYTPSHQSANALLYKIANLIKRRTIQ
FT NPGLVRIITDGMKEEIKPHPYITYRTNMLDKDRICVTTIQSFSTVQHVKDVDLVILDEF
FT SLTSDNYLLTGLAHLKPSTRVLFSGDPRQLSGVDEIRKPLQSRFHTLINYYTETYPREV
FT HVLKYHFRCHPSIFQYFKDLYYADKDMECATSIADRIIRPLNPINTVQVSEPTFRNQGV
FT ILNQDEADKVLEILVLVNQTLALHSSYEYQPTIAIICSYKSQLQNFISLQQQKILSENV
FT NLSTIDSAQGDEFDIVILCLSQINNFTLNPNRFNVAISRAKSVLFITVPPIDKNPAFLF
FT KDVYATLHKHNLTYFKIYNTSGKAILSLDSPTTLKTKAEKMTYTNVRHLDKNTHTMQRK
FT FPMNIVMDDCICFDAEFLNPRDNVQEPVMLSYGFSSKYGKRRIAGIPVRYIKDKFNRIV
FT PQKYNYKDNNKPLTSTYTCDWMRKQHPEQYKHLLTSVMQGIRNDTTVDLKPLLNFCVDN
FT MHVKPVIVTWSGASDHCFLKAHTLYPDIATVCNITIRCTSQPIYASPQGRHTYYLCQYH
FT AHQLKDHVNITHFVNLEIIDLKVDRNQYTDERTLRVYHNDYLKLTLDLDNVASNSLTDC
FT HTRYCRTVHAPATPHDPLDDAIMTQCIYQSFVLSHLENLAYEPQANLKAFTSMDYRLKN
FT FNPEMCKLRRELQKVWYSQYTNTNKTHCNMGCGKEPLQQALHNIDVLQGKSNPQNNMNT
FT HTCDSEEHIYFDSHWYKAGGFTKPSYIFSDINKEHYYKLGTTGLCLYLNSKYAKYVHEY
FT RTVSGNDVFKSLYSPYCDLGRKPHQAEIEPSCSIPDCIITSNIGERFQTLVCNVHKDQM
FT DIISKISQATKYGYQFIYTGKTLLNNHAALSKAPHNWDHLTLEIPGYNTRKQHSSHMTT
FT KALGILHILQDSMLYTNRKTLNPNLPVILPGSASYLGDTVLANEMAKTLKQTKFIHIDP
FT RLKIDNNTTHHRKTLMEMLDIGYTTELIISDIHDNKNPWIPELMTYTLKYLVDTGTLIM
FT KITSRGATEDVLQQLEDLSKNFTYVRVCNLNAVTFSSELWIVFANKRKPPVQGWTSHEL
FT RAELRKHWYSMTRSIIQPLMRSRQSVFRYSPK"
FT gene 361. .7872
FT /gene="ORF1a"
FT CDS 361. .7872
FT /codon_start=1
FT /gene="ORF1a"
FT /product="pp1a polyprotein"
FT /note="non-structural polyprotein; contains two
FT transmembrane domains that flank a 3C-like proteinase;
FT cleavage sites not determined"
FT /db_xref="UniProtKB/TrEMBL:G0KUE2"
FT /protein_id="AEK87148.1"
FT /translation="MTYHDYALKDNVVLKRDQKLALDNFVTEVIQFWTPILTTLLLLAY
FT ALRKIMHNPFVGPISDNPLKRALQWIIFVFTRRNLYYQTPVFARDESRLNIFLHNDFAR
FT LDRNTLNGYCKLCNLYGHNHTDKHNPTIDALVLAKTCKLLRYNDKVTKPLAYTVHNIRA
FT YEKNTKTFADTFGTTTTNIPTKYALAPKKAVSELTTIESNLGPIYVNNTIAYPHLGFIA
FT YDNKQHLQELLANVTVVLDTIMVYTQYELDDATMNIRKSDITLSFVNDFDLTNALTNEL
FT KDPRTPWLLKAKKTSNKASEQDDDTEAEDTQKNKRKGKQLKPQTQLLQHTLAKQTKFAR
FT RQPFLSFGPTYMTLLCLISIMSPTYATVCTTYEPLDQADLYCNNLQNLTIEKYHAYANY
FT EQLNRQCFSIDGAEFKDLIRLSVSNALNLNNVIKPVPRDDYILKAFSNALPLNTHVLSD
FT YNTILDLQILMQFYNLNGSNVLYTETYSESEDYAGKVVQLLAQGTGGICKAPACILFTG
FT LATTVTDVEVKVTERLTKRIKHQEHGKPLHINPSCRKTCYCMYKPKVKPEPVETVKYAP
FT QAEFYTQLRYFQNHELQMYDDFEMGVLRYNNYTLNTFIYSNETCILTRGVHCVYNPEHF
FT AITRVYNNLGNYLECGVNQEFCESLQQEFMFNEPQLVITESMAVEAPTQYHKICDNHYT
FT SLQVKYPLIEKLFWSNFXVSVNRALAVKEPATFIIVHDTVAIIKTVIADIIEIMEKCYD
FT STAIKLTHHDFNKLEYMDDYSQILVKYKPLLEQHKIMLIEDIDLITVPAARALFSIFDT
FT YTPLVNGVFVLGTLNHKRYNETLAFNYYNDQTPTFYVDGILNANWQLLEDHTRQPLITR
FT VADNVHTILARQLVTPPAKIAATMPKVPTLNNTSKVINTLTHYSAHLLQVVGSDSTQAY
FT TYINSSVHNITDYVNDSVHNITDYVYTVYNSTKNHIVTRYNNMLMAAYDIQLNFYNQYP
FT LRQDFYHKGIRAFDLGQVCDFLHHTDTIVLYQDCINQKLDTIYVIKLRYGQNANGYHMY
FT PLKQPHTKQTIYELSDAIGFVYKDSKYNYFRTLFTNPGEYVLTIRENYLEYCKSDFSPT
FT PAFAPDATLQCYAYITGIQVIDNFIAEFGLFLMLYTAALIIILALAITIRDSTMMMFLK
FT LSIIFAYTFGPLLLTPKVFGSYIFVSLYNMLPYTSNTSYGCLLMMGALAITVIDLFAYM
FT TQRYRSEFTKNILQLATLLFEIVAITKYILIPYVCTSYGLVLTIIVSYVAYRYIQSQRP
FT NYLKATVSNATAHADWVAYRNTTREKTDEAAKSNLSKIINTSIADINKDQFLECVYLAA
FT CHRATVAASTYNPKHYLHIPNYNTKIMFARDNELMNYSVLSTDLKNKSAASNPSISHIV
FT LELPVAINPLIKYTTKTSVSSLRGAVVNGYIYIQRHLFGSKKQEFEACYNNGKGLLNCK
FT NLDRSKYDIDSAELIGTLIRIPLHDKQSIPHISLHPDPLSYNGPVTLYLSRYDTELNKD
FT VLCVHTGFMSEGHHDIKTVFGDCGGMLFDPKGRLLGLHCAGSDDVVFMDTTTGKSNIWT
FT SYKLQHPSEIMITLNNEINLPNPTNYDFETTKVVYQHPLRNVCATLETLQHLTNKTNVK
FT LPYDPRLLSDFNITAEQYAQYGYNIDYNNFINNFNRYTTTTIGTKSFETCIKYGLMDNK
FT KVEYYNQTATIFNPPEHSSSGFDNTMDVLYVFVYMFTHTHPAFYIAAACVFCLFFVKMN
FT KYLKMILSSIIFTIPHIYVNYYYGLVYMPLKWRKQITALAIRYNPYTAVALRYNKNLNI
FT AKDVAKELGTPKNLCTHLSTLLKCIKPYAAFNDLSQVINNVDDLMANWANTYNAEELLK
FT QYIDEIYKLYPILFVVFEKIENYEDQIKTILSYISDTGEFDLNGFEIHFDEKEHTTNII
FT DTNVEDIHDKLMAEKASLIALKNMNLEFDIETINNANIGELVRYLIISSTPDTLDRDLL
FT SRTTELLVRRIHQLRDDSEHNENLITLLSEIYKHKDFLTASHLTSNLRDRNYIMNNLVR
FT VIALFNKQINMQVAQKQYEARRIEELRKKESKQIMEQNNRIRKMQRQNQNIASAIVHMV
FT HACFANRFMLQNEAQKIMKALLGSDLELDPTDAEMQYYTAYRNGQVLTNQAIVTNFTTL
FT TTILWTGNGYQTVPSMCGAQEFTCTATHKHGYFNCTMEIKDAWYKHAEECTKCKSYYRT
FT NKHPRCGAIYDTTVKRYPTLSNFIARYRSCPACMPCTQCLSRREPGCESASYHIADTAH
FT YQNQAYLTPINIKPDNLEYNFVDINNGDVNAIYNGRIWLMRRTTAITPPPARYRNITNL
FT KLKQTDPEGYYYISEVCPTDLAILNAMINQIQLKLLDRTVLNNENHVENANTIQFNNPL
FT NDTTLDDLRTKHKHLLVMKLRPDSEHHFIEVLNFVRMNNLPIFIAHVTYADNTVNHATI
FT YINYLQAWRNEILDDVTTTCDILEKIIKHPLDFQGGLVL"
FT gene 15660. .18359
FT /gene="ORF2a"
FT CDS 15660. .18359
FT /codon_start=1
FT /gene="ORF2a"
FT /product="putative spike protein"
FT /note="putative structural protein; S; p2a; contains
FT stretches of hydrophobic residues, potential N-linked
FT glycosylation signals (NXS/T), and cysteine residues at
FT locations flanked by hydrophobic regions"
FT /db_xref="UniProtKB/TrEMBL:G1K4K8"
FT /protein_id="ABG02427.2"
FT /translation="MINSKCPLLFQTTTTSMPNQAHRNRPGPPIILKPETQWHQNHNAN
FT ASSSSKLHRSPLNNHPKDNQNLNATNLMLQHLPSLRSRKQLQQAPTTPKPAVNFTKSEK
FT NSMLETTWDGGKMKRLDQQSSSSLNLKWHPELTKSTIAINLRILTISSILSVLAYLFKT
FT QPLSAMQFTTTKNSLLKKKMSTFVNYLMPSMQSSCVHVKHLTLVPFLLLLLMLPNANCS
FT TRIDLSTHHIVSYNKPLIVVDDFLKTTLKYNFGTDLYNSAINYKTSFEQLLNNFKTPYQ
FT PLVDAFRVLFSYLGIEPVAHPFKDYFNADSPCPLQTTTTTGDVTTIGEHFQEILDDGNL
FT ELEPLASYWLRHTEDIFVYTRSQLWAFICPSEFAQASIFLPNYTEAIYNVSTAFCKTVY
FT YDTPTNAFNAEICNKVNFITPAKAQKRSKRWDSSYVCGWPLVSSAAKVLGGECTTNIDI
FT GTLKSSLNAIQNFSYANTELIHDLQSQLSVVNARTNLHYNQLQQLVTAINDHQAKYVND
FT INNLINHIKNTTNTMENRINVNSIIMSYTNSLFRVYQNIVDYRFAYIETLSSIQQHYHF
FT PSEHLHAFNVPLQAKLREHGFSIPIIDSNIPYSYGKVRYLNVTGINFYDLEFDIYIPVI
FT KLIHEKDSKYYHSTLSALPIGINTTLVTYNTYQGNAICTDTYCLESPINGFCREGESYW
FT YCGQHYIRTLHKITSLYTKPTKFTXSAMFIPPHTMYFVHNTTYSLNYGSSLQALAGSIL
FT MLTCNSTVQIPGYNFNSNDFVSCTDMNVNNVFIHPSLRVNDANFYIPPTRVDLLEKLYK
FT RDIIPILNHIQKANDITIDTTADEELKQQYETLKSDFNAKYDALNIENRRIHALINSMH
FT SIQSEPSYILYMVIAVIVFIVLKFLRII"
FT gene 15674. .16312
FT /gene="ORF2b"
FT CDS 15674. .16312
FT /codon_start=1
FT /gene="ORF2b"
FT /product="putative nucleocapsid protein"
FT /note="putative structural protein; N; p2b; highly
FT hydrophilic, enriched with proline and acidic residues as
FT well as basic residues relative to other virion proteins"
FT /db_xref="GOA:G1K4K7"
FT /db_xref="UniProtKB/TrEMBL:G1K4K7"
FT /protein_id="ABG02426.1"
FT /translation="MPATISNNNNVNAQPGTSKQTRPTNNSKTGNAMAPKPQRQRKQQQ
FT QASSQSPKQPPKGQPKPKRNQPNAAASAQPKVKKAIATGPNYTETSGKLYKIGKEFDAR
FT NHMGWRKNEKTGSTVQFLFKPKMASRIDQVYYRNQFEDPDHFIHTFGVGVFVQDSTLER
FT NAIYNHQKLTTEEKDEYVRKLSDAFNAILLRTRQAFDSGSLPALTVDAA"
FT gene 18402. .18878
FT /gene="ORF3"
FT CDS 18402. .18878
FT /codon_start=1
FT /gene="ORF3"
FT /product="putative small glycoprotein"
FT /note="putative structural protein; p3; contains stretches
FT of hydrophobic residues, potential N-linked glycosylation
FT signals (NXS/T), and cysteine residues at locations flanked
FT by hydrophobic regions; similar to membrane (M) protein of
FT Nidoviruses"
FT /db_xref="UniProtKB/TrEMBL:G1K4K9"
FT /protein_id="ABG02428.1"
FT /translation="MIVKITILFSILAVAMAADTTPEVVSPSTKLCEASSTQHCTAMGY
FT DYCKSISGVQSCYCSHVQNFTSVMDVIDKNLKCSITSSKYLDPHYWFRDLLAASVTLLV
FT IFTAITWAYLIPTYAKIDAIYTNSTSKAKQLHYIPLLPRQSDGSYTLLPGRSYK"
FT gene 18754. .19104
FT /gene="ORF4"
FT CDS 18754. .19104
FT /codon_start=1
FT /gene="ORF4"
FT /product="putative small glycoprotein"
FT /note="putative structural protein; putative p4; contains
FT stretches of hydrophobic residues, potential N-linked
FT glycosylation signals (NXS/T), and cysteine residues at
FT locations flanked by hydrophobic regions; similar to
FT membrane (M) protein of Nidoviruses"
FT /db_xref="UniProtKB/TrEMBL:G1K4L0"
FT /protein_id="ABG02429.1"
FT /translation="MLKSMLFIRTQPLKQNNCTTSPYYHGSLTAVIRSSPDDRINKLQS
FT RIQSENRLRWLFNNFSNCITCSYKLATFLYYIFTAIYYGFCLIMLYILWIYFTQLTNQI
FT KLVYHNFSNPYN"
XX
SQ Sequence 20192 BP; 6933 A; 4432 C; 3151 G; 5674 T; 2 other;
dq458789 Length: 20192 04-MAY-2012 Type: N Check: 5547 ..
1 actaaagaaa cttttgtttt ctcccataat actactacta caagtatcaa
51 ccccgtccgt ctgtcagaga cgctaaactc tgataactaa acctagccac
101 atcagttgct taaagaacct cttgagacac tctcccactt aacatctttt
151 aggaatcttc gatgctacaa caacttggct agtaaacaat aaatccgcat
201 acttcacagt tgtaagaggc cataggtcca aactttgaaa ggtttgtttc
251 tattgtgtca aacacttaga ttaacagagg ctatattagt gctcatcacg
301 ttaacaaagt aatcttgcgc aatagtatga gtttgttgta aaacgtcttg
351 atacgacacc atgacatacc acgattacgc tcttaaggac aatgttgtcc
401 tcaagagaga tcaaaaacta gctttggaca actttgtcac cgaagttatc
451 caattctgga cccccattct gaccacgcta ctcttgcttg cctatgcact
501 caggaaaatc atgcataatc cattcgtcgg acccatttcg gataatcctc
551 tgaaacgtgc cctacaatgg atcatctttg tgttcactcg ccgaaacctg
601 tattatcaaa caccagtttt tgcccgtgac gagtctcgcc taaacatttt
651 tctccacaat gattttgcac gcttggacag aaacaccctt aatggatatt
701 gtaaactatg taatctatat ggacataatc acacagataa acataatcct
751 accatagacg cactagtttt agctaaaact tgtaaattac tccgctataa
801 tgataaggta actaaaccat tggcctacac tgtacataac atacgggctt
851 atgagaagaa taccaaaaca tttgcagata cctttggtac aaccactaca
901 aatatcccaa ctaaatatgc tctagcacct aagaaagcag taagcgaact
951 taccactatt gaatctaacc taggacctat ttacgttaat aacactatag
1001 cttacccaca tcttggcttt attgcttatg ataataaaca acacctccag
1051 gaactcctgg ctaatgtaac tgtagtttta gacacaatta tggtttatac
1101 acaatacgag ctagatgacg ccactatgaa catacgcaaa agtgacataa
1151 cacttagttt tgtaaatgac tttgatttga ctaatgcctt aaccaacgag
1201 ctcaaagatc ctcgtacacc ttggttgcta aaagctaaga aaacatcaaa
1251 caaagcatca gaacaagatg atgatacaga agctgaagac acccagaaaa
1301 ataaacgcaa gggaaaacaa ttaaaaccac aaacacagct actgcaacat
1351 acacttgcta aacaaacaaa attcgctcgt cgccaaccgt ttttatcatt
1401 cggtcccact tacatgacac ttctatgtct tatttctatt atgagcccta
1451 cttacgctac ggtttgcaca acttatgaac cacttgatca agcagatctg
1501 tactgtaaca acttgcaaaa cctaaccatc gaaaaatacc acgcttatgc
1551 aaattatgaa caattgaata gacaatgttt ttccatcgat ggtgctgagt
1601 ttaaggactt gatacgttta tcggtatcca atgctctaaa tttgaacaac
1651 gtgatcaaac ctgtacccag ggatgattac atacttaagg ctttctccaa
1701 tgccctacca ctgaacacac acgtattatc cgactacaac accattttgg
1751 acttgcaaat acttatgcaa ttctacaact taaatggatc aaacgttttg
1801 tatactgaaa cttattcaga gagtgaggat tatgcaggta aggtcgtaca
1851 attgttggcc caaggtacag gaggcatttg taaagcacct gcttgcattc
1901 tattcaccgg tcttgccacc actgttacag atgtggaggt taaggtcact
1951 gagcgcttaa ctaaacgtat aaaacatcag gaacacggaa agccactaca
2001 tatcaacccc tcctgtcgta agacatgtta ttgcatgtat aaacccaaag
2051 ttaaaccaga acccgtagaa acagtcaaat atgcaccaca ggctgaattc
2101 tatacacaac ttcgctattt ccaaaaccat gaactacaga tgtatgatga
2151 ttttgaaatg ggtgtgctac ggtataacaa ttacacactc aatacattca
2201 tctactctaa cgaaacctgc attctaactc gtggtgtaca ttgtgtttac
2251 aaccctgaac actttgccat cacacgtgtt tacaacaatt taggaaatta
2301 cctagagtgc ggtgtaaatc aggaattttg tgaatcattg caacaggaat
2351 ttatgtttaa cgaaccacaa ttagtcatta ctgaaagtat ggctgttgaa
2401 gcccccacac aatatcacaa aatttgtgat aatcactaca catcattgca
2451 agtaaaatac cccttgattg aaaaattatt ttggagtaac ttchatgtta
2501 gtgtaaaccg cgctttagct gttaaagaac ccgccacgtt cataatagtc
2551 cacgatacag tagcaatcat caaaacggta attgctgaca ttattgaaat
2601 tatggaaaaa tgttatgaca gcacagctat aaagctcaca catcacgact
2651 tcaacaaact ggaatatatg gatgattata gccagattct agtaaagtat
2701 aagccacttt tggaacaaca taagattatg cttattgaag acatcgactt
2751 aattacagta ccagctgctc gtgccttatt ttctattttt gatacttaca
2801 cacccttagt gaatggagtc tttgttttgg gcacgcttaa ccataagcgt
2851 tacaatgaaa cattggcttt caattactat aacgatcaaa cacctacatt
2901 ttatgtggat ggtatcttaa atgctaattg gcagctcctg gaagaccaca
2951 ctcgacaacc gttaattaca cgcgtcgctg ataatgtaca caccattctg
3001 gctagacaac tcgtaacacc acctgctaaa attgcagcca ctatgcctaa
3051 agtacccact ttgaataaca cctccaaggt tatcaacacg ctaacacatt
3101 attccgccca cctcttacaa gttgtaggaa gtgacagcac acaagcatat
3151 acctatatta acagttcggt gcataatatc accgattatg ttaatgactc
3201 agtacataac attacagatt atgtatacac ggtgtataac tccacgaaga
3251 accacatagt tacacgctac aacaatatgc ttatggctgc ttatgacata
3301 caactgaatt tttacaatca gtaccctttg cgtcaagact tttaccataa
3351 aggtattcgt gcgtttgatc taggtcaagt ttgcgatttc cttcaccaca
3401 ctgacacaat tgttttgtat caggattgta taaatcaaaa gttagacacg
3451 atttatgtca taaaattgcg ttacggacaa aatgctaatg gttaccacat
3501 gtatcccctt aaacaaccac acactaaaca gaccatctat gaattgagcg
3551 acgccattgg cttcgtttat aaagatagca aatacaacta ctttagaaca
3601 ctgtttacga atccagggga atatgtttta acaattcgcg aaaattactt
3651 ggaatactgc aaatcagact ttagcccaac accagctttc gctccagatg
3701 caactcttca atgctatgca tacatcacag gtatacaggt tattgacaat
3751 tttattgctg aatttggttt gtttctcatg ttatacacag ctgccttaat
3801 aattattttg gcgctagcta ttacaattcg agatagcact atgatgatgt
3851 ttttaaaact gtctataatt tttgcctaca catttggacc actgttgcta
3901 acacctaaag tgttcggctc atatatcttt gtttcactct acaacatgct
3951 accatataca agtaacacca gttatggttg tttacttatg atgggggccc
4001 tcgcaattac agtcattgac ttatttgcat acatgacaca aaggtaccgc
4051 tcagaattta ccaagaacat cttgcaattg gccacattac tttttgaaat
4101 tgtggctatc actaagtata ttttgattcc gtacgtctgc accagctatg
4151 gtctagtcct aacgatcata gttagctacg ttgcataccg ttacatacaa
4201 tcacaacgac caaactacct aaaagctaca gtctctaatg ctactgcaca
4251 tgcagactgg gttgcttaca gaaatacaac gcgtgagaaa acggatgaag
4301 ctgcgaaatc aaatttgagt aagatcatta acactagcat tgccgatata
4351 aataaggacc agttcctaga atgcgtatat ctggctgctt gccaccgtgc
4401 cactgtcgcc gcttcaactt ataaccccaa gcactatttg cacataccaa
4451 actataatac taaaattatg tttgcgcgtg acaatgaatt gatgaactat
4501 tcagtgctat caaccgattt aaagaacaag agcgcagcat caaacccctc
4551 aatttcacat atcgttcttg aactgcctgt tgccattaac cctctaatta
4601 agtacactac taaaacaagt gtatcaagtc tacgaggagc agtagtcaat
4651 ggatatattt atattcagcg ccatctgttc ggtagtaaga aacaagaatt
4701 cgaggcatgt tataataatg gtaaggggct tctaaattgt aagaatctgg
4751 accgctctaa atatgacatt gattcagcag aattaatagg tacattaatt
4801 agaatcccac tacacgacaa acaaagtatc ccacatatca gcttacatcc
4851 agatccatta agttataatg gaccggttac cctctacttg tcacgttacg
4901 acacggaact aaacaaagat gtactttgtg tacatactgg tttcatgtca
4951 gaaggacacc acgatattaa gactgtgttt ggcgattgtg gaggtatgct
5001 atttgacccc aaaggcagat tattaggctt gcattgcgct ggttctgatg
5051 atgttgtctt tatggataca accacaggaa aatctaacat ttggactagt
5101 tacaaattgc aacacccatc tgaaattatg ataactttga ataatgaaat
5151 caatttgccg aatccgacta attatgattt cgagactact aaggttgttt
5201 atcaacaccc tttgcgtaac gtatgtgcca ctctagaaac actccagcat
5251 ttaactaaca agactaacgt taaattgcca tatgacccac gtttgttgtc
5301 agatttcaac attactgctg aacaatatgc ccagtatggc tacaacattg
5351 actataacaa tttcatcaac aactttaatc gctacacaac tacaactata
5401 gggaccaaaa gcttcgaaac ctgtataaag tacggactca tggacaataa
5451 gaaagtcgaa tattacaacc aaactgctac catcttcaat cctccagagc
5501 atagttctag tggttttgat aacactatgg atgtgttgta tgtgtttgtg
5551 tatatgttta cacacacaca cccagccttc tatatagccg ctgcttgtgt
5601 attttgtctt ttctttgtca aaatgaataa gtaccttaaa atgatcctta
5651 gctccatcat ctttacaatc ccccacattt acgtcaatta ttattatggc
5701 ttggtttaca tgccattgaa atggcgtaag caaatcactg ctttagccat
5751 ccgctacaat ccctacacgg ctgtggcact gcgctataat aagaatttga
5801 acattgcgaa ggatgtagca aaggaactcg gtacccctaa aaatttgtgt
5851 acgcatttat caacgctctt gaaatgtatt aaaccctacg ccgcctttaa
5901 cgacctcagt caggtgatca acaatgttga tgatctaatg gctaattggg
5951 ctaatacata taatgccgaa gagcttctta aacaatacat cgatgaaatc
6001 tacaagttgt acccaatact attcgttgtt ttcgagaaga ttgagaatta
6051 tgaagatcag attaaaacta ttttatcata tattagcgat actggtgaat
6101 tcgatctgaa tggctttgaa atccattttg atgagaagga acacacgact
6151 aacatcatag acacaaatgt tgaggacata catgacaagt tgatggctga
6201 aaaagctagt ttgatagctc ttaagaatat gaacctagag ttcgatattg
6251 aaaccattaa caatgccaac attggtgaac tcgtacgcta tttgataatt
6301 agttctactc cagacacact cgatcgcgac ttgctatcca ggaccactga
6351 attactggtt agacgtatac atcagttacg tgatgactcc gaacacaacg
6401 aaaacctgat cacactattg tcggaaattt acaaacataa agacttctta
6451 acagcatctc atttgacctc taaccttcgt gatcgtaatt acatcatgaa
6501 caatctggta cgcgtgatag ctttgtttaa taaacaaata aacatgcaag
6551 tagcccagaa acagtatgag gcgcgccgca ttgaagaatt gcgtaagaaa
6601 gaatctaaac aaattatgga acaaaacaac cgtatccgta aaatgcaacg
6651 tcaaaatcag aacattgcta gtgcaatagt tcatatggtt catgcatgct
6701 tcgctaaccg ctttatgtta caaaatgaag cccagaagat aatgaaagcc
6751 cttctcggct ctgacctgga attggacccc actgacgctg aaatgcaata
6801 ttacacagca tatcgcaacg gtcaagtact aactaatcaa gcaattgtta
6851 ccaatttcac cacactcacg acaatactat ggacaggtaa tggttatcaa
6901 accgtaccca gtatgtgtgg cgctcaagaa tttacttgca ctgctacgca
6951 caaacatggt tattttaatt gcaccatgga gattaaggat gcttggtata
7001 agcatgctga agaatgtaca aaatgtaaga gttactaccg aacaaataaa
7051 catccacgtt gtggtgccat ttatgacact accgtgaaac gttatccaac
7101 actcagtaac ttcattgctc gttaccgtag ctgtccagct tgtatgcctt
7151 gtacacaatg tttgtcacgc cgtgaacctg gttgtgaaag tgccagttat
7201 catattgctg acacagcaca ttatcagaat caagcatatt taacacctat
7251 aaatatcaag ccagacaacc tcgaatacaa cttcgttgat atcaacaatg
7301 gagatgtaaa cgcaatatac aatggtcgca tatggttaat gagacgtaca
7351 acagcaatta caccaccacc agcacgttac cgtaacatca ctaatctcaa
7401 actcaaacag accgatcctg aaggctatta ctacatatct gaagtatgtc
7451 ccactgactt ggctattttg aacgccatga tcaatcaaat tcaactaaag
7501 cttttggatc gtactgtcct aaacaatgaa aaccacgtgg aaaatgccaa
7551 cactatacaa tttaacaacc cactaaatga cacgacacta gacgatttgc
7601 gcactaaaca taagcattta ttggttatga aattgagacc cgactcggaa
7651 catcacttca ttgaggtttt gaactttgtt agaatgaaca atttaccaat
7701 atttattgcc cacgtaactt acgcagacaa caccgttaat catgccacca
7751 tatatataaa ttatttgcag gcatggcgca atgaaattct tgacgatgtg
7801 actacaacat gtgatatctt ggagaaaatc attaagcacc ctttggattt
7851 tcaaggtggg ctcgtactct aagcaggaac agtaatgtag cccgatatca
7901 ccaactatgt accaccactg atgctggtat aagacacacc atcgatattt
7951 cctgcaacaa aacttccatc agttatatcg atgaagtaaa caacaacgtc
8001 aatgtcaaaa taaaatcgca cattgtaaag gaacacaaaa tttatgaaat
8051 gttaattaac caatatccta acctctttct tattgaacac aagctggtaa
8101 acgatatcat cccacattta ttacgctata atatgacagc acttagtttt
8151 gctgacctat atggcttaat taaagaggaa aattggcacc ctatttatga
8201 tactctacca caagtgacat atcataaaat taatgacgat ctactactaa
8251 aaattaaatc gcacactcca tctccacagc acacctgttg tatgctatgc
8301 cgtcgttttc tagttgaatt cggcttgctt ttgcataaac taaattataa
8351 ggtattcgaa acaacacgcg ctatgttaac tcattatgat tttgtattga
8401 ccgcagacaa tgttgatcta aatggtatac tggattttga agattacaaa
8451 ctaagaaaat gtacagtagc atatgatgtt aaatcacaat tgcgcattat
8501 gcaaccttac taccacgcat tgtactcttt ctatgaacat acaggtatgt
8551 acttcattag ccaacccatc tacaactcta tagtggatcc cagtctcgat
8601 ctaattcaac agtttgaatc ggcagttgag gctactcgaa atctaccatt
8651 ggacgctaag tttgatgaca caccattata tagaccaact atacaacatc
8701 tcgccgaata tctaaaattg aacatatatg ctatggagcc ggaacctcta
8751 tggaattgct acgatactat ggattgtccg caaatagaat taccgggaat
8801 agacaacgct attacaagca taattacaaa accaacaaga cctctctcgg
8851 aatacatcga attgaaccac accacagtta aaaattttga cggtgacata
8901 tactgtaaag taaatcacaa cgaaattaac aaccttcagg atatacttta
8951 ctgtatgccc acagacgcta caattcatga actatacata gttgaccacc
9001 cctacgagtt agaatcacac aaccgcatgc tacgcactag tttaaatatt
9051 tggcttcata atctttatga tgctaatgta aacttatcac actttgattc
9101 aataaattat gacaaaacac gcaaagctag tttccccatt gttggtactg
9151 taccagcaat aacattgcgc gactgcgaaa tttgtcaaga cgaaatacca
9201 gatgacctga aggatgtcta tgattttgga tcttgtgtgc atgcaaaagc
9251 ccagttgtct gattatacaa caccacgtaa gttgaaccca ttgatagaat
9301 ttgatccagc actacttcgt catggagaat ttctaccaaa taatgattac
9351 gcatacacta tgaagactaa accagaccac ttaattgatc gagaactaaa
9401 agattacatc gattcaactg gtttaacagc tttaatacca ccactggaca
9451 tcaaccccgc tgtacatgac cctgaaacaa catattcaag ttcgtattat
9501 ataaaaacac catcagaaac atctatacga caagaccttg aattgtttaa
9551 tcagaacacc gctggctcag tttcacctac agtcttttta atggctatag
9601 aattattaca tcaactgtta actgaagaaa tttctgcttc agacggcaag
9651 ccaaactgtc ctatggtgcc ctcagaagta cctgtacgca acaaacataa
9701 atcagccggt actccatacc gaaaatttgg tgattcagaa ttcatgcgcg
9751 aattatatgg taactatcgt gacgctattg tttatcataa gcgccattct
9801 gcagatcaac agctaacact aactatcaat aaggttgccc cttcgaaaaa
9851 tcatcgcgat cgtacaatcc tcgccattag tataaataaa tcagaaccag
9901 gacgctcact ttatcgttgg aatttggata aaataaaata cacctccagt
9951 ttaggtggtc caattctaat cggttttaca gcacaatacg gtggttggga
10001 taaactctat aaatatcttt ataaaaattc ccccgcagac aacccagaca
10051 cagcagaaca tgcagtgctt ggtggaaagg attatccaaa atgggatcgc
10101 cgtatttcta acatgctaca actaacaact acaactgttt tatacagttt
10151 gatagatcca aacactcaga gaaaactcaa taacgctaca cccgcacaaa
10201 cttggcacga atatatggct gaaacaacac aagtcttata tgactatctc
10251 gtctttggca atgaattata tcagaaacct ggaggtgtaa cttcaggtaa
10301 tagtcgcaca gctgatggca attcactact tcacttattg attgactttt
10351 atgctataat tagtcaattg attcaatcaa caccagaaaa cgtacatcta
10401 gaagtgaatt tacgtaacgc tttgtgtaaa acagttttta ccagaatacc
10451 ctcggattac atagattcaa gctgtgtaac acttagaaac actgatacat
10501 tacacacaat tcgccgacgc gtagccaaag gagcttattt aagcgacgac
10551 ggtttaatcg ttatagaccc acgcattata aggtatgacg actttatgtc
10601 tgttagtcac cttattagcc attacatgat agcacaaaac aagcacaaat
10651 atcacatcga cgctatccaa cgctatgcaa gagaattcct atcacaagac
10701 actattaagt ttggtgatat ggtttaccca atacctgagt ttggacgcat
10751 gtacaccgca atgctcctga gtgacaataa aaacacttta gacccacaaa
10801 ttaatatcac gcgtttattg gcactatttt catatttata tatatactat
10851 ttcaagtatg aagatcaacc cactcatcca atattaaaat ttcttgatgc
10901 gctaagaacc tacatagaaa ataaactgaa tacaacggat gaaattttct
10951 tagactgcat caaagttcct gatttacagg atgtagaatt tgaccttaaa
11001 aattgtgatt tatacgaaaa ttttgactac ttatgggggc ttgaccaatc
11051 aagtgcctat atggaatatc tctgtaaata caaacaccgc tatcgtaatt
11101 tatcgctgtt taaacgtcaa cttatacaac accacgaaga agctcaattg
11151 cataatgaaa ataagcttat gaataaagga aaattaatca cgtacaattg
11201 ctatgtttgt ggcgaaaatg cgtatttaac atgtgctaca tgtgaacgcg
11251 cattttgcaa tagtgcagat accaatcatg gctcacatat ggaacaacat
11301 ctacaatatt caggtcatac ctgtttatac ctaaattgta aaactgtaaa
11351 atgtcaacat tgttttacga tggacatcaa cttactatac accactggcc
11401 gtgaccacta ttgtgaatca cataagccta aaaatgccgt acgtatacta
11451 aactacaatg ctaatacaaa attaccaccg ctcctttatc tatgtgtaac
11501 agacacaaag cgtgtgacat tttatgaaca atgttacatt aattacacaa
11551 aagcacaccc gacctatgca ataagtaaag aacaatttat gagccttatt
11601 cagctgtatc tacatcaaga ttacacacta ccagttaatc aattagctaa
11651 ccggattaga gttagtctac aattgagttc atatggtgta gtcagaccat
11701 atcatcaatt aattatgcag cttacaaaat tagaaagcaa agtcttagac
11751 tcaagtgttg ttgatatacc aattacactc atcaattcac aagaaattgg
11801 gacatactac attgaaatac ctcgggaaca caaactcgac caacattcca
11851 cctattccta tctactagga actcgcgagg taagtttcac acccaattac
11901 taccgcctaa gcagcacaaa cactcatata tggcaaactg acacacaaat
11951 tccaaattat tgtactttta tacgacagcg tcgtctaaat accttaagcg
12001 ctattctacg caacacaaca caacatgtgc cagaatttac gcgtttgcta
12051 ttagaatgga atcaacaatt accgattaca gctaaaccct ttgcagaatt
12101 taaaccttca ttgaaaattc cagctcagcc gaatgtgact gacaatatta
12151 atacgctgct aaaggagctg aatgtaaaac gttttaaaat catgtttggc
12201 gggcctggta caggaaaatc tcacacacta tctattctca taaaccattt
12251 acatgagaag ggtctgcgaa ttctagtgta cacaccatca caccaatctg
12301 ccaatgcttt gctatataaa atagcaaact tgattaaaag acgcactata
12351 caaaaccccg gattagtcag aattattaca gatggcatga aagaagaaat
12401 caagccacac ccatatataa cttatcgtac aaatatgcta gacaaagacc
12451 gcatttgcgt gacaactata caaagttttt caactgtaca gcatgttaag
12501 gatgtagatt tagtaattct tgacgaattc agtttaactt cggataatta
12551 cctactaacc ggccttgcac atctaaaacc ttctacacgt gttttgtttt
12601 ctggtgaccc cagacaactt agcggtgtgg atgagattag aaaaccacta
12651 caatcacgtt ttcatacttt gattaattat tacactgaaa cctacccgcg
12701 agaagtgcat gtgttaaaat accactttag atgccaccca agtatattcc
12751 agtatttcaa ggatctgtat tatgcagata aagacatgga atgtgcgaca
12801 tctattgcag atcgtattat acgcccactg aatccaatta atacagtgca
12851 agtcagcgaa cccactttta gaaatcaagg tgtaatatta aatcaagatg
12901 aagccgataa ggtcctagaa attctagtgc ttgtaaatca aacactagca
12951 ctccattcaa gttatgaata ccaacccact attgcaatta tatgtagtta
13001 caaatcacaa cttcaaaatt ttatctcact acagcaacag aaaattcttt
13051 cagagaatgt caatttaagc actatcgatt ctgctcaagg cgacgaattc
13101 gatatagtaa tactatgtct ttcccaaatt aacaacttca cgttaaatcc
13151 taatcgattt aatgtagcaa tttcaagggc taagtcagtg ttgtttataa
13201 cagttccccc tattgacaaa aaccccgcat ttctttttaa agatgtgtac
13251 gcaactttgc ataaacacaa tttgacatac tttaagattt acaacactag
13301 tggtaaagca atactttctt tagattcacc aacaacgtta aagaccaaag
13351 cagagaaaat gacatataca aatgtacgac atctcgacaa gaacacccat
13401 actatgcaac gaaaatttcc aatgaacata gttatggacg actgtatatg
13451 ctttgatgct gaattcttaa accctagaga caacgtacaa gaaccagtaa
13501 tgctttcata tggtttctcc agtaaatatg gcaaacgacg tatagcaggc
13551 attccagtgc gctatataaa agacaaattc aacagaatag tcccacaaaa
13601 gtacaattac aaggataata acaaaccatt aacatctact tacacctgcg
13651 actggatgag gaaacaacac cctgaacaat ataaacacct cctaacatca
13701 gttatgcaag gaatccgtaa tgacaccact gtagatctta agccactgct
13751 taatttttgt gtagataaca tgcatgtgaa acctgttatc gtcacatggt
13801 ctggcgctag tgaccactgt ttcttaaaag cacacacgct atatccagac
13851 attgcaacag tatgtaatat aactatacgc tgcacgtcac aaccaattta
13901 tgcttcacca caaggccgac acacttatta cctctgccaa tatcacgcac
13951 accaacttaa agaccatgtt aatataactc attttgtaaa tctcgagatc
14001 atagatctca aggtagaccg caatcaatat acagatgaga gaaccttaag
14051 agtgtatcac aacgattact taaagctaac attggacctc gataatgtag
14101 cctctaatag tctaactgac tgtcatacta gatactgcag aaccgtacat
14151 gcacccgcaa caccacatga cccacttgat gatgccatca tgacacagtg
14201 tatttatcaa tcttttgtat tgtcacatct tgaaaattta gcatacgaac
14251 cacaagctaa cctcaaggcg tttacatcta tggactaccg ccttaaaaat
14301 ttcaatcctg agatgtgtaa actaagacgc gaattacaaa aagtttggta
14351 ctcacaatac acaaacacta acaaaacaca ctgcaatatg ggctgcggaa
14401 aagaaccatt acaacaagct cttcataaca tcgacgtatt acaaggtaaa
14451 tcaaatccac agaataacat gaacacccac acctgtgatt ctgaagagca
14501 tatatatttt gatagtcact ggtataaagc tggtggtttt acaaagcctt
14551 cgtacatttt tagcgacatt aataaagaac actattacaa actcgggacc
14601 actggcttat gtctatactt aaatagtaaa tacgccaaat acgtacatga
14651 ataccgcact gttagcggta acgacgtttt taaatcacta tattcaccat
14701 attgtgattt aggcagaaaa ccacaccaag cagaaataga acctagctgc
14751 tcaatacccg attgtataat cacatctaac ataggcgaac gttttcaaac
14801 tttagtttgt aacgtacaca aagatcaaat ggacattatt agcaaaatct
14851 cacaagccac taagtatggg tatcaattta tctatactgg taaaactttg
14901 ttaaataacc acgcagcctt atctaaagct ccacataatt gggatcatct
14951 aacattagaa attcctggct ataacacacg taaacagcat tccagtcata
15001 tgactactaa agctttaggt atactacata tactacaaga tagtatgtta
15051 tatacaaacc gtaaaacact gaatcctaac ttaccggtta ttttacctgg
15101 ttcggctagt tatctcggtg atacagtact tgctaatgaa atggctaaaa
15151 ccctcaaaca aacaaaattc atacacattg atccacgcct aaaaatagat
15201 aacaacacaa cacaccacag aaaaacacta atggaaatgc tagacatagg
15251 ttatacaacg gaattaataa tttcagacat ccatgataat aaaaatccat
15301 ggattcccga gttaatgaca tacactttaa agtaccttgt agatactgga
15351 accctcatta tgaaaatcac aagtcgcgga gcgactgaag acgtactaca
15401 acaacttgag gacctttcta aaaattttac atacgtaaga gtgtgtaatt
15451 taaatgctgt aactttttcc tcagaattat ggatagtctt cgcgaataag
15501 cgtaagccac ccgtacaagg ctggacatca catgaactga gggctgagtt
15551 acgtaagcat tggtattcta tgacacgcag tataattcaa cctctaatgc
15601 gttctagaca aagtgtattc agatactctc ccaaataacc cactagttaa
15651 tcccatttta tgattaactc aaaatgcccg ctactatttc aaacaacaac
15701 aacgtcaatg cccaaccagg cacatcgaaa cagacccggc ccaccaataa
15751 ttctaaaacc ggaaacgcaa tggcaccaaa accacaacgc caacgcaagc
15801 agcagcagca agcttcatcg cagtccccta aacaaccacc caaaggacaa
15851 ccaaaaccta aacgcaacca acctaatgct gcagcatctg cccagcctaa
15901 ggtcaagaaa gcaattgcaa caggccccaa ctacaccgaa accagcggta
15951 aactttacaa aatcggaaaa gaattcgatg ctagaaacca catgggatgg
16001 aggaaaaatg aaaagactgg atcaacagtc cagttcctct ttaaacctaa
16051 aatggcatcc cgaattgacc aagtctacta tcgcaatcaa tttgaggatc
16101 ctgaccattt catccatact ttcggtgttg gcgtatttgt tcaagactca
16151 acccttgagc gcaatgcaat ttacaaccac caaaaactca ctactgaaga
16201 aaaagatgag tacgttcgta aactatctga tgccttcaat gcaatcctcc
16251 tgcgtacacg tcaagcattt gactctggtt cccttcctgc tcttactgtt
16301 gatgctgcct aatgctaact gctctacccg tatagatttg agcacacacc
16351 acatagtttc atataacaag ccacttatag tggtagatga ttttctgaag
16401 acaactttaa aatataattt tggcactgat ttgtataata gtgctataaa
16451 ttataaaacc tctttcgagc aacttctgaa taactttaaa acaccttacc
16501 aaccacttgt tgacgccttc cgcgttttat ttagttattt aggtatagaa
16551 cccgtagccc atccattcaa ggattacttc aatgctgatt ccccttgccc
16601 cttgcaaact acaacaacca ctggcgatgt aaccactata ggtgaacatt
16651 tccaagaaat cttggatgat ggtaatttag aattggaacc actagctagt
16701 tattggctta gacatacaga agatattttt gtatacacac gatcacaact
16751 atgggccttt atatgtcctt ctgaatttgc acaagctagt atatttttac
16801 ccaattatac tgaagccatt tataatgtat caaccgcttt ttgtaaaact
16851 gtctattatg acacccctac aaatgccttc aatgctgaaa tttgtaataa
16901 agttaatttc atcacaccag ccaaagcaca aaaacgcagc aaacgttggg
16951 attcttccta cgtttgtggc tggccacttg tatctagcgc cgctaaagtg
17001 ctgggaggtg aatgtacaac taacatcgac attggtactt taaaatcaag
17051 tctaaatgct attcaaaatt tctcttatgc aaatacagaa ttaatccacg
17101 acttacaatc acagcttagt gttgtaaatg ctcgcacaaa tcttcactat
17151 aatcaacttc aacaactcgt cacagctata aatgatcatc aagctaaata
17201 tgtgaatgac ataaacaact taattaacca tattaagaat acaactaaca
17251 ctatggaaaa tcgcattaat gttaactcca ttattatgtc ttatactaat
17301 tcactctttc gtgtatatca aaatatagtt gattatcgct tcgcgtatat
17351 agaaacactt agttccatac agcaacacta tcattttccc tctgaacatc
17401 tacatgcttt taatgtccct cttcaggcta aacttcgaga gcatgggttt
17451 tcaataccta ttatagattc aaatatccct tactcttatg gtaaagttag
17501 atatctaaat gtaactggca taaattttta tgaccttgaa tttgatatct
17551 acatccctgt aataaaactc attcatgaaa aagatagtaa atactaccat
17601 tcaacgcttt cagcgctccc cattggaatt aatactacac tagtaactta
17651 caatacttat cagggtaatg ctatatgcac agatacgtat tgtcttgagt
17701 cacctattaa cggattttgc cgcgaaggtg agagttattg gtattgtggc
17751 caacattata ttagaacact tcataaaata acctctttgt ataccaaacc
17801 cacaaaattc actraaagtg ccatgtttat accaccacat actatgtact
17851 ttgtacataa taccacttat tcattaaact atggatcctc acttcaagca
17901 ctagctggtt ctatactgat gctaacttgc aactctacag tccagatccc
17951 tgggtacaac ttcaactcca acgattttgt ttcgtgcaca gatatgaatg
18001 ttaataatgt ttttatacat ccctccctac gtgtcaatga cgcaaacttc
18051 tatataccgc ccactcgagt agatctactt gaaaaacttt ataaacgcga
18101 tataatccct atactaaatc atatacaaaa agctaatgac ataactatag
18151 acaccacggc tgacgaagaa ttaaaacaac aatatgaaac acttaaaagc
18201 gatttcaatg ctaaatacga cgcattgaat atagaaaata gacgaataca
18251 cgcactaatt aatagtatgc attccataca atcggaaccc tcatatattc
18301 tatacatggt tatcgcagta atcgttttta tagtcctcaa atttcttaga
18351 atcatataat catactacta ctatcaaaat ataaaaaccc cctttttaaa
18401 tatgatagtc aaaattacca tcctttttag catcctcgct gtcgctatgg
18451 cagcagacac gaccccagaa gtggtgtcac cctccactaa gctctgtgaa
18501 gcaagttcta cacaacactg tactgcaatg ggctacgatt actgcaaaag
18551 catctctggt gtccagagtt gttactgctc ccatgtacaa aatttcacaa
18601 gcgttatgga cgtcatcgac aaaaatttga aatgctcaat tacgtctagc
18651 aaatatctag acccacacta ctggtttcgc gacctcctgg cggctagcgt
18701 cacacttttg gtcatattca ctgctattac ttgggcttat cttattccta
18751 cttatgctaa aatcgatgct atttatacga actcaacctc taaagcaaaa
18801 caattgcact acatccccct actaccacgg cagtctgacg gcagttatac
18851 gctcctcccc ggacgatcgt ataaataaac tgcaatcacg tatccagtca
18901 gaaaatcgac ttaggtggtt gtttaataac ttctctaatt gcatcacttg
18951 ttcgtataag ttagctacat ttttatatta catatttacc gcaatatact
19001 atggcttctg ccttattatg ttatatatat tatggattta ctttacccaa
19051 ttaactaacc aaattaaact tgtatatcac aactttagta atccatataa
19101 ctaggtttta aatcacatca ctaaataatg taataatacc cactaataca
19151 aaacacttta ttacatccac ataaaaccgc ctgagtttag ttaaagctgt
19201 atactttacg cctccttgga ggattctaga cagaccattc tagacagcac
19251 taatttaatc acgcgtttct taacacgcat ttaacacagc aaaatacaaa
19301 aatttttacc tatgccaaat gccaaattta ctacacacac ctaaattcac
19351 accacattga taaactaaac accacttaaa attcaaaatc actctataca
19401 ttcttaggaa agtcatgttg gaaagtacgc tagatcttgt tgttggcagg
19451 aagcgtagtg ctcatgttta tgtgtccggc ctcgggccta atcgaagatt
19501 tttatacata cggacacaag gcctggaaca agccgatcat tcaagtaaat
19551 aacatccact gaacaaaagt tgaaccactg aatgagacta atgtatagaa
19601 tagagacgca aacacactac gcgggatcga accggaaacc acacattgta
19651 ccggccattc tttgtacacc acttattata gtttagatgt cagctattat
19701 gaattgtttt gtatttctta ccactatagt ctcgcctgta agagagattg
19751 tacgcaatat acaacacact acaattctag tagacattga acagcagggt
19801 attgcccgcc taacttataa tacgccagtg tactgatcac tttccatggc
19851 ggaataacca ccaacacata ccacactatc taacattaca tacatggaca
19901 aaacacaaca gcagaaatat caacagacaa ttaagaccga acaccactgg
19951 tgagtaggtg tactgaactc cgaggagacg taggtacatg gaattgttat
20001 agactgcaga tatcaataca tatcttgtgc gagaaaatac attgcgagag
20051 acgcattgag tagtgaagca ttaggcaccc gaaaacggtt agggcttagt
20101 agtatggcgc ttggcattac aggtaaaaaa aaaaaaaaaa aaaaaaaaaa
20151 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa