!!NA_SEQUENCE 1.0
ID TMEGDVCG standard; RNA; VRL; 8105 BP.
XX
AC M20562;
XX
SV M20562.1
XX
DT 06-JUL-1989 (Rel. 20, Created)
DT 04-MAR-2000 (Rel. 63, Last updated, Version 3)
XX
DE Theiler murine encephalomyelitis, complete genome.
XX
KW complete genome; polyprotein.
XX
OS Theiler's encephalomyelitis virus
OC Viruses; ssRNA positive-strand viruses, no DNA stage; Picornaviridae;
OC Cardiovirus.
XX
RN [1]
RP 1-8105
RX MEDLINE; 88265847.
RA Pevear D.C., Borkowski J.A., Calenoff M., Oh C.K., Ostrawski B.,
RA Lipton H.L.;
RT "Insights into Theiler's virus neurovirulence based on a genomic comparison
RT of the neurovirulent GDVII and less virulent BeAn strains";
RL Virology 165:1-12(1988).
XX
DR SWISS-PROT; P08545; POLG_TMEVG.
XX
CC Draft entry and computer-readable sequence for [1] kindly provided
CC by H.L.Lipton, 08-SEP-1988.
XX
FH Key Location/Qualifiers
FH
FT source 1. .8105
FT /db_xref="taxon:12124"
FT /organism="Theiler's encephalomyelitis virus"
FT sig_peptide 1069. .1296
FT /note="polyprotein signal peptide"
FT CDS 1069. .7980
FT /codon_start=1
FT /db_xref="SWISS-PROT:P08545"
FT /note="viral polyprotein"
FT /protein_id="AAA47929.1"
FT /translation="MACKHGYPDVCPICTAVDATPDFEYLLMADGEWFPTDLLCVDLDD
FT DVFWPSDTSTQPQTMEWTDVPLVCDTVMEPQGNASSSDKSNSQSSGNEGVIINNFYSNQ
FT YQNSIDLSASGGNAGDAPQNNGQLSSILGGAANAFATMAPLLMDQNTEEMENLSDRVAS
FT DKAGNSATNTQSTVGRLCGYGKSHHGEHPTSCADAATDKVLAAERYYTIDLASWTTSQE
FT AFSHIRIPLPHVLAGEDGGVFGATLRRHYLCKTGWRVQVQCNASQFHAGSLLVFMAPEF
FT YTGKGTKSGTMEPSDPFTMDTTWRSPQSAPTGYRYDRQAGFFAMNHQNQWQWTVYPHQI
FT LNLRTNTTVDLEVPYVNVAPSSSWTQHANWTLVVAVLSPLQYATGSSPDVQITASLQPV
FT NPVFNGLRHETVLAQSPIPVTVREHQGCFYSTNPDTTVPIYGKTISTPSDYMCGEFSDL
FT LELCKLPTFLGNPSTDNKRYPYFSATNSVPATSLVDYQVALSCSCTANSMLAAVARNFN
FT QYRGSLNFLFVFTGAAMVKGKFRIAYTPPGAGKPTTRDQAMQATYAIWDLGLNSSFNFT
FT APFISPTHYRQTSYTSPTITSVDGWVTVWQLTPLTYPSGTPTHSDILTLVSAGDDFTLR
FT MPISPTKWVPQGIDNAEKGKVSNDDASVDFVAEPVKLPENQTRVAFFYDRAVPIGMLRP
FT GQNMETTFSYQENDFRLNCLLLTPLPSYCPDSSSGPVRTKAPVQWRWVRSGGANGANFP
FT LMTKQDYAFLCFSPFTYYKCDLEVTVSAMGAGTVSSVLRWAPTGAPADVTDQLIGYTPS
FT LGETRNPHMWIVGSGNSQISFVVPYNSPLSVLPAAWFNGWSDFGNTKDFGVAPTSDFGR
FT IWIQGNSSASVRIRYKKMKVFCPRPTLFFPWPTPTTTKINADNPVPILELENPASLYRI
FT DLFITFTDELITFDYKVHGRPVLTFRIPGFGLTPAGRMLVCMGAKPAHSPFTSSKSLYH
FT VIFTSTCNSFSFTIYKGRYRSWKKPIHDELVDRGYTTFREFFKAVRGYHADYYKQRLIH
FT DVEMNPGPVQSVFQPQGAVLTKSLAPQAGIQNILLRLLGIEGDCSEVSKAITVVTDLVA
FT AWEKAKTTLVSPEFWSELILKTTKFIAASVLYLHNPDFTTTVCLSLMTGVDLLTNDSVF
FT DWLKSKLSSFFRTPPPACPNVMQPQGPLREANEGFTFAKNIEWATKTIQSIVNWLTSWF
FT KQEEDHPQSKLDKLLMEFPDHCRNIMDMRNGRKAYCECTASFKYFDDLYNLAVTCKRIP
FT LASLCEKFKNRHDHSVTRPEPVVAVLRGAAGQGKSVTSQIIAQSVSKMAFGRQSVYSMP
FT PDSEYFDGYENQFSVIMDDLGQNPDGEDFTVFCQMVSSTNFLPNMAHLERKGTPFTSSF
FT IVATTNLPKFRPVTVAHYPAVDRRITFDFTVTAGPHCKTPAGMLDIEKAFDEIPGSKPQ
FT LACFSADCPLLHKRGVMFTCNRTKTVYNLQQVVKMVNDTITRKTENVKKMNSLVAQSPP
FT DWQHFENILTCLRQNNAALQDQVDELQEAFTQARERSDFLSDWLKVSAIIFAGIVSLSA
FT VIKLASKFKESIWPTPVRVELSEGEQAAYAGRARAQKQALQVLDIQGGGKVLAQAGNPV
FT MDFELFCAKNMVSPITFYYPDKAEVTQSCLLLRAHLFVVNRHVAETEWTAFKLRDVRHE
FT RDTVVMRSVNRSGAETDLTFVKVTKGPLFKDNVNKFCSNKDDFPARNDTVTGIMNTGLA
FT FVYSGNFLIGNQPVNTTTGACFNHCLHYRAQTRRGWCGSAIICNVNGKKAVYGMHSAGG
FT GGLAAATIITRELIEAAEKSMLALEPQGAIVDISTGSVVHVPRKTKLRRTVAHDVFQPK
FT FEPAVLSRYDPRTDKDVDVVAFSKHTTNMESLPPIFDIVCGEYANRVFTILGKDNGLLT
FT VEQAVLGLSGMDPMEKDTSPGLPYTQQGLRRTDLLDFNTAKMTPQLDYAHSKLVLGVYD
FT DVVYQSFLKDEIRPLEKIHEAKTRIVDVPPFAHCIWGRQLLGRFASKFQTKPGFELGSA
FT IGTDPDVDWTRYAAELSGFNYVYDVDYSNFDASHSTAMFECLINNFFTEQNGFDRRIAE
FT YLRSLAVSRHAYEDRRVLIRGGLPSGCAATSMLNTIMNNVIIRAALYLTYSNFEFDDIK
FT VLSYGDDLLIGTNYQIDFNLVKERLAPFGYKITPANKTTTFPLTSHLQDVTFLKRRFVR
FT FNSYLFRPQMDAVNLKAMVSYCKPGTLKEKLMSIALLAVHSGPDIYDEIFLPFRNVGIV
FT VPTYDSMLYRWLSLFR"
FT mat_peptide 1297. .1509
FT /note="protein 1A"
FT mat_peptide 1510. .2310
FT /note="protein 1B"
FT mat_peptide 2311. .3006
FT /note="protein 1C"
FT mat_peptide 3007. .3834
FT /note="protein 1D"
FT mat_peptide 3835. .4260
FT /note="protein 2A"
FT mat_peptide 4261. .4641
FT /note="protein 2B"
FT mat_peptide 4642. .5619
FT /note="protein 2C"
FT mat_peptide 5620. .5883
FT /note="protein 3A"
FT mat_peptide 5884. .5943
FT /note="protein 3B"
FT mat_peptide 5944. .6594
FT /note="protein 3C"
FT mat_peptide 6595. .7977
FT /note="protein 3D"
XX
SQ Sequence 8105 BP; 1944 A; 2226 C; 1758 G; 2177 T; 0 other;
M20562 Length: 8105 May 20, 2002 10:15 Type: N Check: 9306 ..
1 ttgaaagggg gcccggggga tctcccccgc ggtaactggt cacagttgcc
51 gcggacggag atcatccccc ggttaccccc tttcgacgcg ggtactgcga
101 tagtgccacc ccagtccttc ctactcccga ctcccgaccc caacccaggt
151 tccttggaac aggaacacca atttattcat cccttggatg ctgactaatc
201 agaggaacgt cagcattttc cggcccaggc taagagaagt agataagtta
251 gaatctaaat tatttatcat ccccttgacg aattcgcgtt ggaaaagcac
301 ctctcacttg ccgctcttca cacccatcat tctaattcgg cccctgtgtt
351 gagccccttg ttgaagtgtt tccctccatc gcgacgtggt tggagatcta
401 agttaaccga ctccgacgaa actaccatca tgcctccccg attatgtgat
451 gctttctgcc ctgctgggtg gagcatcctc gggttgagaa atctttcttc
501 cttttacctt ggactccggt cccccggtct aagccgcttg gaataagaca
551 gggttatctt cactcctctt cttttctact tcacagtgtt ctatgctgtg
601 aaagggtatg tgtcgcccct tccttcttcg gagaacacgc gtggcggttt
651 tttccgtctc tcgacaagcg cgtgtgcgac atgcagagtc tcgcgaagaa
701 agcagttctc ggtctagctt tagtgcccac aagaaaacag ctgtagcgac
751 cacacaaagg cagcggaacc cccctcctgg taacaggagc ctctgcggcc
801 aaaagccacg tggataagat ccacctttgt gtgcggtgca accccagcac
851 cctgttttct tggtgacact ctagtgaacc cctgaatggc gatcttaagc
901 gcctctgtag ggaagccagg aatgtccagg aggtacccct tccgctcgga
951 agggatctga cctggagaca catcacatgt gctttacacc tgtgcttgtg
1001 tttaaaaaat tgtcgcagct tccccaaacc aagtggtctt ggttttctct
1051 ttttattata ttgtcaatat ggcttgcaaa cacggatacc cagacgtgtg
1101 ccctatttgc acagccgttg acgctactcc cgactttgaa tatttgctca
1151 tggcagacgg agaatggttc cctacggacc ttctttgtgt ggacttggac
1201 gatgacgtct tctggccttc ggacacgagc actcaacctc aaacaatgga
1251 atggactgat gtaccgctcg tatgcgatac tgtcatggaa ccccagggaa
1301 atgcctcgtc atctgataag agtaactccc agtcctcagg aaatgagggg
1351 gttatcatta ataacttcta ttccaatcaa taccagaact caattgattt
1401 gtctgccagt ggtggcaacg ctggcgatgc tccccagaac aatggacaac
1451 tgtccagcat tctgggtgga gctgcaaatg cttttgctac tatggcacct
1501 ctcctcatgg accagaacac agaggagatg gaaaacctct ctgacagagt
1551 agcttctgac aaagcaggga attcggccac aaacacacag tctactgttg
1601 gccggctctg tggctatgga aagtcccacc acggagaaca cccaacctct
1651 tgtgccgatg ccgcgactga caaggtcctc gcggctgaac gttactacac
1701 tatcgatctg gctagttgga ccacttccca agaagctttc tctcacatca
1751 ggattcctct ccctcacgtc cttgctggcg aggacggagg ggtttttgga
1801 gctaccttaa ggagacacta cctctgcaag actggatggc gcgtacaagt
1851 ccaatgcaac gcctcccagt ttcatgctgg ctcccttctt gtcttcatgg
1901 ctccagaatt ctatactggt aaaggaacaa aatcaggcac tatggagcct
1951 tcagacccat ttaccatgga caccacctgg cgcagcccgc aaagtgcgcc
2001 cacaggctac cgctatgaca gacaagccgg ctttttcgcc atgaaccacc
2051 agaaccaatg gcaatggact gtctaccctc accagatttt gaatttgcgc
2101 acaaacacca ccgttgactt ggaagttccc tatgtcaatg tggcaccctc
2151 cagctcttgg actcaacatg caaactggac tctcgttgtt gctgtgctca
2201 gccctcttca gtacgccacc ggttcttcac cggacgttca aatcacagcc
2251 tccctgcaac ctgttaatcc cgtgtttaat ggtttgagac acgaaactgt
2301 gcttgcgcaa agtcctattc cagtcacggt gcgtgagcac cagggctgtt
2351 tctactccac taaccctgac accactgttc ccatctatgg gaaaaccatt
2401 tccaccccga gtgactacat gtgtggtgag ttttctgatc ttcttgaatt
2451 gtgcaagctc cccacattcc ttggcaaccc cagcaccgac aacaagcgtt
2501 acccttattt ctctgccacc aactccgtgc cagccacatc cctggttgac
2551 taccaagttg ctctctcatg ctcttgtacg gccaactcaa tgcttgctgc
2601 tgttgctcgt aactttaatc agtaccgtgg ttcactgaat tttcttttcg
2651 ttttcactgg tgctgcaatg gttaagggca agtttcgcat agcctacacc
2701 ccgcctggtg cgggaaagcc caccacccgg gaccaagcta tgcaggctac
2751 ctacgccatt tgggacttgg gcttgaattc cagcttcaac ttcactgcgc
2801 cttttatatc tccaactcat taccgtcaga ctagctatac tagccccacc
2851 atcacatctg ttgacggttg ggtcactgtt tggcagctga cccccctgac
2901 ctacccttct ggaaccccca cccattctga tattctcacc cttgtctccg
2951 ccggcgatga cttcacgctc aggatgccaa tttcacccac caaatgggtt
3001 ccacagggaa ttgacaatgc tgagaaggga aaggtctcca acgatgacgc
3051 ttcggtcgat ttcgttgccg agccagtcaa gctacccgag aaccaaaccc
3101 gggtggcctt cttttacgac agagctgtcc ccataggaat gttgagaccc
3151 ggccaaaata tggaaaccac ctttagctac caagagaatg atttccgcct
3201 caattgtctt ctgttgaccc ctcttccttc ttattgtccc gacagttcct
3251 ccggtcctgt cagaacgaag gctcccgtcc agtggcgatg ggtgcggtct
3301 ggtggcgcca atggtgccaa cttcccactc atgaccaaac aggactacgc
3351 cttcctctgc ttttcccctt tcacctacta caagtgtgac cttgaagtta
3401 ccgttagtgc tatgggagca ggcaccgttt cttctgttct gcgctgggca
3451 cccaccgggg cgcccgcgga tgtcactgac cagctgatcg gctatactcc
3501 tagtcttggt gaaacacgta acccccacat gtggatcgtt ggctctggaa
3551 attctcaaat ttcttttgtc gtaccttaca attcccctct gtccgtctta
3601 cccgctgctt ggttcaatgg atggtccgac tttggaaaca ccaaggattt
3651 tggagttgct cctacgtcgg attttgggcg catttggata cagggtaaca
3701 gctctgcctc agttcgaatc aggtacaaga agatgaaggt cttctgcccc
3751 cgcccgaccc tctttttccc ctggccaacg cccaccacca ccaagatcaa
3801 tgctgacaat ccagtcccca ttcttgagct tgagaatccc gcttctctct
3851 accgcattga tcttttcatc acctttactg atgagctcat aacttttgac
3901 tacaaggtcc acggacgtcc tgtgctcacc ttccggattc caggcttcgg
3951 tctgacaccg gcaggcagaa tgctcgtgtg catgggcgcg aagcccgcac
4001 acagtccgtt cacctcgtct aaatctctat accatgttat cttcacttcc
4051 acttgcaatt ccttcagctt taccatctat aaaggacgct accgctcctg
4101 gaagaagccc atccacgatg agcttgtgga tcgtggttac accactttcc
4151 gcgagttctt caaggctgtg cgcggatacc atgctgacta ctacaaacag
4201 agactcatac atgatgtaga aatgaaccca ggccctgtgc agtcggtttt
4251 tcagccacaa ggtgcggtgc taactaaatc cctagcaccc caggcaggaa
4301 ttcaaaacat ccttctacgc ctccttggca tagaaggtga ctgttcagaa
4351 gttagtaaag caatcacagt tgtcactgac ttggttgctg catgggaaaa
4401 agcaaaaacc actctggtct ctcctgaatt ttggtcagaa cttatattaa
4451 aaactaccaa gttcattgct gcttccgtgc tctaccttca caaccctgac
4501 ttcactacca ctgtttgtct ctcattgatg actggtgtag acctcctcac
4551 caatgattct gtttttgatt ggcttaagag caaattgtcc tccttctttc
4601 gtactcctcc cccagcttgc cccaatgtca tgcaacctca ggggcctcta
4651 cgcgaggcca atgagggttt tacctttgcc aagaatattg aatgggccac
4701 gaaaaccatc cagtccattg tcaattggct tactagctgg ttcaagcagg
4751 aagaggacca cccccaatca aaattagaca aattgcttat ggaattccct
4801 gatcattgta ggaacattat ggatatgagg aacggtcgaa aggcctattg
4851 tgaatgcact gcttccttta agtattttga tgatctttac aatcttgctg
4901 ttacttgcaa aagaattcca ttagcctccc tttgtgagaa atttaagaat
4951 agacacgacc actctgtcac cagacccgag ccggtggttg ctgtcctgcg
5001 cggcgccgct gggcaaggca aatctgtgac cagccaaatc attgcccaat
5051 ctgtttctaa gatggccttt ggccgtcagt ctgtttattc tatgcccccc
5101 gattcggaat attttgatgg ctatgaaaat caattttctg tgattatgga
5151 tgatctagga caaaatcccg atggtgaaga tttcaccgtc ttttgtcaaa
5201 tggtttctag cacaaatttt ctcccgaata tggctcacct ggaaagaaaa
5251 ggcacccctt tcacctctag ctttattgtt gccacaacaa atttgcccaa
5301 attccgcccc gttacggtag cccattaccc tgctgttgat aggcgaatca
5351 cctttgactt caccgttact gctggacccc actgtaaaac acccgctgga
5401 atgttggaca ttgagaaggc ttttgatgaa atacctggct ccaaacctca
5451 gcttgcctgc tttagtgctg attgtcccct cctacacaag agaggagtta
5501 tgttcacctg caatcgcacc aaaactgtct acaaccttca acaggttgtg
5551 aaaatggtca acgacactat cactcgcaag actgaaaacg ttaagaaaat
5601 gaacagcttg gttgcccagt ccccaccaga ctggcaacac tttgagaata
5651 tcctcacttg cctccgtcag aataatgctg ctcttcagga tcaagttgat
5701 gaattgcaag aagcgttcac ccaagcccgc gagcgttctg attttctttc
5751 tgattggttg aaggtttctg ctatcatttt tgctggtatt gtctcacttt
5801 ctgctgtcat aaaactagcc tccaaattta aagaatcaat ttggcccacg
5851 cccgtgagag ttgagctctc tgagggcgaa caggccgcgt atgctggtcg
5901 tgcgcgcgct caaaaacaag cccttcaggt cctagatatt caaggaggcg
5951 ggaaggttct agcccaggcc ggtaaccccg tcatggactt tgagcttttc
6001 tgtgccaaaa acatggtttc cccgattacc ttctactacc ctgacaaggc
6051 tgaagtgacc cagagctgct tgctgctccg tgcccacctc ttcgtggtca
6101 accgccacgt cgctgaaacg gaatggacag ctttcaagct tagggatgtg
6151 aggcacgaac gtgacactgt tgtcatgcgt tccgttaacc gctcaggagc
6201 tgaaacggac cttacattcg tgaaggttac taaaggacca ctcttcaagg
6251 acaatgtgaa caagttttgc tcaaacaagg acgattttcc tgctaggaat
6301 gacactgtta ccgggataat gaacactgga ttggccttcg tgtattccgg
6351 taactttctg attggcaatc aacctgtgaa cacaacaact ggagcctgct
6401 tcaaccactg cctccactat cgagctcaaa ctcgacgtgg ttggtgtggt
6451 tctgccatca tctgcaatgt taacggcaaa aaagctgttt acggaatgca
6501 ctctgctgga ggcggaggcc ttgccgccgc taccatcatc accagagagt
6551 tgattgaagc agctgagaag tctatgttgg cgctggaacc gcaaggtgcc
6601 atcgtggaca tttccacagg atctgtcgta catgtcccca gaaagaccaa
6651 actgaggaga acagtcgctc atgacgtttt ccaacccaaa ttcgaacctg
6701 cagttctgtc ccgttatgac cctcggaccg acaaggatgt agatgttgta
6751 gccttctcca aacacactac taacatggaa agcttgcctc caatctttga
6801 cattgtctgc ggtgaatacg ctaaccgtgt tttcaccatc cttggtaaag
6851 acaacggtct cttaaccgtt gaacaggctg tgcttggctt gtcgggcatg
6901 gaccccatgg agaaggacac ctcccctgga ttgccctaca ctcaacaagg
6951 actcagacga actgaccttc tggatttcaa cactgccaaa atgacacccc
7001 aattggacta tgcccattcc aaactggtac tcggcgttta tgacgacgtt
7051 gtctaccaat catttttgaa agatgaaatt cggcccttgg agaagatcca
7101 cgaagcaaaa acccggattg ttgatgtgcc cccgtttgcc cactgcattt
7151 ggggaagaca gcttttggga cgcttcgctt ccaaatttca aactaaacct
7201 ggatttgaac ttggatctgc aattggaact gacccggatg ttgattggac
7251 gcgctatgcc gccgagctga gcgggttcaa ctacgtctat gatgtagatt
7301 actccaactt tgatgcttcc cattctactg caatgtttga atgtttgatt
7351 aacaatttct ttacagagca aaatggattt gacagacgca ttgccgagta
7401 ccttagatct ctggctgtgt cgcgacatgc ttatgaggac cgccgtgtcc
7451 tcatacgtgg gggcctgcct tcgggctgtg ctgctaccag catgttaaac
7501 accatcatga acaatgtcat aattcgtgct gccctgtacc ttacttattc
7551 aaattttgaa tttgatgata ttaaggtcct ttcctacgga gacgaccttt
7601 taattggaac taattaccaa attgatttta atcttgttaa agaaagatta
7651 gcccccttcg gttataagat tactcctgcc aacaagacca ctacttttcc
7701 tctgacctcc catttgcaag atgttacctt tctaaagaga agatttgtga
7751 gatttaattc ttacctgttc agacctcaaa tggatgctgt caatttgaaa
7801 gcaatggtta gctactgtaa accaggaaca cttaaggaga aactaatgtc
7851 cattgctctt ctggccgttc attctggacc agatatttat gatgagattt
7901 tccttccttt taggaatgtt ggaatagttg tccccaccta tgattctatg
7951 ctttacagat ggcttagctt atttagatga acatcctctc gatcggatcg
8001 caacgcttta ccctagaagc cactagggtg tacgcggccg ttctgacgtt
8051 ggaattcttt taggcaaaag ttgtgtagat gcttataatt ggaaatgaga
8101 acaac