!!NA_SEQUENCE 1.0
ID   TMEGDVCG   standard; RNA; VRL; 8105 BP.
XX
AC   M20562;
XX
SV   M20562.1
XX
DT   06-JUL-1989 (Rel. 20, Created)
DT   04-MAR-2000 (Rel. 63, Last updated, Version 3)
XX
DE   Theiler murine encephalomyelitis, complete genome.
XX
KW   complete genome; polyprotein.
XX
OS   Theiler's encephalomyelitis virus
OC   Viruses; ssRNA positive-strand viruses, no DNA stage; Picornaviridae;
OC   Cardiovirus.
XX
RN   [1]
RP   1-8105
RX   MEDLINE; 88265847.
RA   Pevear D.C., Borkowski J.A., Calenoff M., Oh C.K., Ostrawski B.,
RA   Lipton H.L.;
RT   "Insights into Theiler's virus neurovirulence based on a genomic comparison
RT   of the neurovirulent GDVII and less virulent BeAn strains";
RL   Virology 165:1-12(1988).
XX
DR   SWISS-PROT; P08545; POLG_TMEVG.
XX
CC   Draft entry and computer-readable sequence for [1] kindly provided
CC   by H.L.Lipton, 08-SEP-1988.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .8105
FT                   /db_xref="taxon:12124"
FT                   /organism="Theiler's encephalomyelitis virus"
FT   sig_peptide     1069. .1296
FT                   /note="polyprotein signal peptide"
FT   CDS             1069. .7980
FT                   /codon_start=1
FT                   /db_xref="SWISS-PROT:P08545"
FT                   /note="viral polyprotein"
FT                   /protein_id="AAA47929.1"
FT                   /translation="MACKHGYPDVCPICTAVDATPDFEYLLMADGEWFPTDLLCVDLDD
FT                   DVFWPSDTSTQPQTMEWTDVPLVCDTVMEPQGNASSSDKSNSQSSGNEGVIINNFYSNQ
FT                   YQNSIDLSASGGNAGDAPQNNGQLSSILGGAANAFATMAPLLMDQNTEEMENLSDRVAS
FT                   DKAGNSATNTQSTVGRLCGYGKSHHGEHPTSCADAATDKVLAAERYYTIDLASWTTSQE
FT                   AFSHIRIPLPHVLAGEDGGVFGATLRRHYLCKTGWRVQVQCNASQFHAGSLLVFMAPEF
FT                   YTGKGTKSGTMEPSDPFTMDTTWRSPQSAPTGYRYDRQAGFFAMNHQNQWQWTVYPHQI
FT                   LNLRTNTTVDLEVPYVNVAPSSSWTQHANWTLVVAVLSPLQYATGSSPDVQITASLQPV
FT                   NPVFNGLRHETVLAQSPIPVTVREHQGCFYSTNPDTTVPIYGKTISTPSDYMCGEFSDL
FT                   LELCKLPTFLGNPSTDNKRYPYFSATNSVPATSLVDYQVALSCSCTANSMLAAVARNFN
FT                   QYRGSLNFLFVFTGAAMVKGKFRIAYTPPGAGKPTTRDQAMQATYAIWDLGLNSSFNFT
FT                   APFISPTHYRQTSYTSPTITSVDGWVTVWQLTPLTYPSGTPTHSDILTLVSAGDDFTLR
FT                   MPISPTKWVPQGIDNAEKGKVSNDDASVDFVAEPVKLPENQTRVAFFYDRAVPIGMLRP
FT                   GQNMETTFSYQENDFRLNCLLLTPLPSYCPDSSSGPVRTKAPVQWRWVRSGGANGANFP
FT                   LMTKQDYAFLCFSPFTYYKCDLEVTVSAMGAGTVSSVLRWAPTGAPADVTDQLIGYTPS
FT                   LGETRNPHMWIVGSGNSQISFVVPYNSPLSVLPAAWFNGWSDFGNTKDFGVAPTSDFGR
FT                   IWIQGNSSASVRIRYKKMKVFCPRPTLFFPWPTPTTTKINADNPVPILELENPASLYRI
FT                   DLFITFTDELITFDYKVHGRPVLTFRIPGFGLTPAGRMLVCMGAKPAHSPFTSSKSLYH
FT                   VIFTSTCNSFSFTIYKGRYRSWKKPIHDELVDRGYTTFREFFKAVRGYHADYYKQRLIH
FT                   DVEMNPGPVQSVFQPQGAVLTKSLAPQAGIQNILLRLLGIEGDCSEVSKAITVVTDLVA
FT                   AWEKAKTTLVSPEFWSELILKTTKFIAASVLYLHNPDFTTTVCLSLMTGVDLLTNDSVF
FT                   DWLKSKLSSFFRTPPPACPNVMQPQGPLREANEGFTFAKNIEWATKTIQSIVNWLTSWF
FT                   KQEEDHPQSKLDKLLMEFPDHCRNIMDMRNGRKAYCECTASFKYFDDLYNLAVTCKRIP
FT                   LASLCEKFKNRHDHSVTRPEPVVAVLRGAAGQGKSVTSQIIAQSVSKMAFGRQSVYSMP
FT                   PDSEYFDGYENQFSVIMDDLGQNPDGEDFTVFCQMVSSTNFLPNMAHLERKGTPFTSSF
FT                   IVATTNLPKFRPVTVAHYPAVDRRITFDFTVTAGPHCKTPAGMLDIEKAFDEIPGSKPQ
FT                   LACFSADCPLLHKRGVMFTCNRTKTVYNLQQVVKMVNDTITRKTENVKKMNSLVAQSPP
FT                   DWQHFENILTCLRQNNAALQDQVDELQEAFTQARERSDFLSDWLKVSAIIFAGIVSLSA
FT                   VIKLASKFKESIWPTPVRVELSEGEQAAYAGRARAQKQALQVLDIQGGGKVLAQAGNPV
FT                   MDFELFCAKNMVSPITFYYPDKAEVTQSCLLLRAHLFVVNRHVAETEWTAFKLRDVRHE
FT                   RDTVVMRSVNRSGAETDLTFVKVTKGPLFKDNVNKFCSNKDDFPARNDTVTGIMNTGLA
FT                   FVYSGNFLIGNQPVNTTTGACFNHCLHYRAQTRRGWCGSAIICNVNGKKAVYGMHSAGG
FT                   GGLAAATIITRELIEAAEKSMLALEPQGAIVDISTGSVVHVPRKTKLRRTVAHDVFQPK
FT                   FEPAVLSRYDPRTDKDVDVVAFSKHTTNMESLPPIFDIVCGEYANRVFTILGKDNGLLT
FT                   VEQAVLGLSGMDPMEKDTSPGLPYTQQGLRRTDLLDFNTAKMTPQLDYAHSKLVLGVYD
FT                   DVVYQSFLKDEIRPLEKIHEAKTRIVDVPPFAHCIWGRQLLGRFASKFQTKPGFELGSA
FT                   IGTDPDVDWTRYAAELSGFNYVYDVDYSNFDASHSTAMFECLINNFFTEQNGFDRRIAE
FT                   YLRSLAVSRHAYEDRRVLIRGGLPSGCAATSMLNTIMNNVIIRAALYLTYSNFEFDDIK
FT                   VLSYGDDLLIGTNYQIDFNLVKERLAPFGYKITPANKTTTFPLTSHLQDVTFLKRRFVR
FT                   FNSYLFRPQMDAVNLKAMVSYCKPGTLKEKLMSIALLAVHSGPDIYDEIFLPFRNVGIV
FT                   VPTYDSMLYRWLSLFR"
FT   mat_peptide     1297. .1509
FT                   /note="protein 1A"
FT   mat_peptide     1510. .2310
FT                   /note="protein 1B"
FT   mat_peptide     2311. .3006
FT                   /note="protein 1C"
FT   mat_peptide     3007. .3834
FT                   /note="protein 1D"
FT   mat_peptide     3835. .4260
FT                   /note="protein 2A"
FT   mat_peptide     4261. .4641
FT                   /note="protein 2B"
FT   mat_peptide     4642. .5619
FT                   /note="protein 2C"
FT   mat_peptide     5620. .5883
FT                   /note="protein 3A"
FT   mat_peptide     5884. .5943
FT                   /note="protein 3B"
FT   mat_peptide     5944. .6594
FT                   /note="protein 3C"
FT   mat_peptide     6595. .7977
FT                   /note="protein 3D"
XX
SQ   Sequence 8105 BP; 1944 A; 2226 C; 1758 G; 2177 T; 0 other;

   M20562  Length: 8105  May 20, 2002 10:15  Type: N  Check: 9306  ..
       1  ttgaaagggg gcccggggga tctcccccgc ggtaactggt cacagttgcc
      51  gcggacggag atcatccccc ggttaccccc tttcgacgcg ggtactgcga
     101  tagtgccacc ccagtccttc ctactcccga ctcccgaccc caacccaggt
     151  tccttggaac aggaacacca atttattcat cccttggatg ctgactaatc
     201  agaggaacgt cagcattttc cggcccaggc taagagaagt agataagtta
     251  gaatctaaat tatttatcat ccccttgacg aattcgcgtt ggaaaagcac
     301  ctctcacttg ccgctcttca cacccatcat tctaattcgg cccctgtgtt
     351  gagccccttg ttgaagtgtt tccctccatc gcgacgtggt tggagatcta
     401  agttaaccga ctccgacgaa actaccatca tgcctccccg attatgtgat
     451  gctttctgcc ctgctgggtg gagcatcctc gggttgagaa atctttcttc
     501  cttttacctt ggactccggt cccccggtct aagccgcttg gaataagaca
     551  gggttatctt cactcctctt cttttctact tcacagtgtt ctatgctgtg
     601  aaagggtatg tgtcgcccct tccttcttcg gagaacacgc gtggcggttt
     651  tttccgtctc tcgacaagcg cgtgtgcgac atgcagagtc tcgcgaagaa
     701  agcagttctc ggtctagctt tagtgcccac aagaaaacag ctgtagcgac
     751  cacacaaagg cagcggaacc cccctcctgg taacaggagc ctctgcggcc
     801  aaaagccacg tggataagat ccacctttgt gtgcggtgca accccagcac
     851  cctgttttct tggtgacact ctagtgaacc cctgaatggc gatcttaagc
     901  gcctctgtag ggaagccagg aatgtccagg aggtacccct tccgctcgga
     951  agggatctga cctggagaca catcacatgt gctttacacc tgtgcttgtg
    1001  tttaaaaaat tgtcgcagct tccccaaacc aagtggtctt ggttttctct
    1051  ttttattata ttgtcaatat ggcttgcaaa cacggatacc cagacgtgtg
    1101  ccctatttgc acagccgttg acgctactcc cgactttgaa tatttgctca
    1151  tggcagacgg agaatggttc cctacggacc ttctttgtgt ggacttggac
    1201  gatgacgtct tctggccttc ggacacgagc actcaacctc aaacaatgga
    1251  atggactgat gtaccgctcg tatgcgatac tgtcatggaa ccccagggaa
    1301  atgcctcgtc atctgataag agtaactccc agtcctcagg aaatgagggg
    1351  gttatcatta ataacttcta ttccaatcaa taccagaact caattgattt
    1401  gtctgccagt ggtggcaacg ctggcgatgc tccccagaac aatggacaac
    1451  tgtccagcat tctgggtgga gctgcaaatg cttttgctac tatggcacct
    1501  ctcctcatgg accagaacac agaggagatg gaaaacctct ctgacagagt
    1551  agcttctgac aaagcaggga attcggccac aaacacacag tctactgttg
    1601  gccggctctg tggctatgga aagtcccacc acggagaaca cccaacctct
    1651  tgtgccgatg ccgcgactga caaggtcctc gcggctgaac gttactacac
    1701  tatcgatctg gctagttgga ccacttccca agaagctttc tctcacatca
    1751  ggattcctct ccctcacgtc cttgctggcg aggacggagg ggtttttgga
    1801  gctaccttaa ggagacacta cctctgcaag actggatggc gcgtacaagt
    1851  ccaatgcaac gcctcccagt ttcatgctgg ctcccttctt gtcttcatgg
    1901  ctccagaatt ctatactggt aaaggaacaa aatcaggcac tatggagcct
    1951  tcagacccat ttaccatgga caccacctgg cgcagcccgc aaagtgcgcc
    2001  cacaggctac cgctatgaca gacaagccgg ctttttcgcc atgaaccacc
    2051  agaaccaatg gcaatggact gtctaccctc accagatttt gaatttgcgc
    2101  acaaacacca ccgttgactt ggaagttccc tatgtcaatg tggcaccctc
    2151  cagctcttgg actcaacatg caaactggac tctcgttgtt gctgtgctca
    2201  gccctcttca gtacgccacc ggttcttcac cggacgttca aatcacagcc
    2251  tccctgcaac ctgttaatcc cgtgtttaat ggtttgagac acgaaactgt
    2301  gcttgcgcaa agtcctattc cagtcacggt gcgtgagcac cagggctgtt
    2351  tctactccac taaccctgac accactgttc ccatctatgg gaaaaccatt
    2401  tccaccccga gtgactacat gtgtggtgag ttttctgatc ttcttgaatt
    2451  gtgcaagctc cccacattcc ttggcaaccc cagcaccgac aacaagcgtt
    2501  acccttattt ctctgccacc aactccgtgc cagccacatc cctggttgac
    2551  taccaagttg ctctctcatg ctcttgtacg gccaactcaa tgcttgctgc
    2601  tgttgctcgt aactttaatc agtaccgtgg ttcactgaat tttcttttcg
    2651  ttttcactgg tgctgcaatg gttaagggca agtttcgcat agcctacacc
    2701  ccgcctggtg cgggaaagcc caccacccgg gaccaagcta tgcaggctac
    2751  ctacgccatt tgggacttgg gcttgaattc cagcttcaac ttcactgcgc
    2801  cttttatatc tccaactcat taccgtcaga ctagctatac tagccccacc
    2851  atcacatctg ttgacggttg ggtcactgtt tggcagctga cccccctgac
    2901  ctacccttct ggaaccccca cccattctga tattctcacc cttgtctccg
    2951  ccggcgatga cttcacgctc aggatgccaa tttcacccac caaatgggtt
    3001  ccacagggaa ttgacaatgc tgagaaggga aaggtctcca acgatgacgc
    3051  ttcggtcgat ttcgttgccg agccagtcaa gctacccgag aaccaaaccc
    3101  gggtggcctt cttttacgac agagctgtcc ccataggaat gttgagaccc
    3151  ggccaaaata tggaaaccac ctttagctac caagagaatg atttccgcct
    3201  caattgtctt ctgttgaccc ctcttccttc ttattgtccc gacagttcct
    3251  ccggtcctgt cagaacgaag gctcccgtcc agtggcgatg ggtgcggtct
    3301  ggtggcgcca atggtgccaa cttcccactc atgaccaaac aggactacgc
    3351  cttcctctgc ttttcccctt tcacctacta caagtgtgac cttgaagtta
    3401  ccgttagtgc tatgggagca ggcaccgttt cttctgttct gcgctgggca
    3451  cccaccgggg cgcccgcgga tgtcactgac cagctgatcg gctatactcc
    3501  tagtcttggt gaaacacgta acccccacat gtggatcgtt ggctctggaa
    3551  attctcaaat ttcttttgtc gtaccttaca attcccctct gtccgtctta
    3601  cccgctgctt ggttcaatgg atggtccgac tttggaaaca ccaaggattt
    3651  tggagttgct cctacgtcgg attttgggcg catttggata cagggtaaca
    3701  gctctgcctc agttcgaatc aggtacaaga agatgaaggt cttctgcccc
    3751  cgcccgaccc tctttttccc ctggccaacg cccaccacca ccaagatcaa
    3801  tgctgacaat ccagtcccca ttcttgagct tgagaatccc gcttctctct
    3851  accgcattga tcttttcatc acctttactg atgagctcat aacttttgac
    3901  tacaaggtcc acggacgtcc tgtgctcacc ttccggattc caggcttcgg
    3951  tctgacaccg gcaggcagaa tgctcgtgtg catgggcgcg aagcccgcac
    4001  acagtccgtt cacctcgtct aaatctctat accatgttat cttcacttcc
    4051  acttgcaatt ccttcagctt taccatctat aaaggacgct accgctcctg
    4101  gaagaagccc atccacgatg agcttgtgga tcgtggttac accactttcc
    4151  gcgagttctt caaggctgtg cgcggatacc atgctgacta ctacaaacag
    4201  agactcatac atgatgtaga aatgaaccca ggccctgtgc agtcggtttt
    4251  tcagccacaa ggtgcggtgc taactaaatc cctagcaccc caggcaggaa
    4301  ttcaaaacat ccttctacgc ctccttggca tagaaggtga ctgttcagaa
    4351  gttagtaaag caatcacagt tgtcactgac ttggttgctg catgggaaaa
    4401  agcaaaaacc actctggtct ctcctgaatt ttggtcagaa cttatattaa
    4451  aaactaccaa gttcattgct gcttccgtgc tctaccttca caaccctgac
    4501  ttcactacca ctgtttgtct ctcattgatg actggtgtag acctcctcac
    4551  caatgattct gtttttgatt ggcttaagag caaattgtcc tccttctttc
    4601  gtactcctcc cccagcttgc cccaatgtca tgcaacctca ggggcctcta
    4651  cgcgaggcca atgagggttt tacctttgcc aagaatattg aatgggccac
    4701  gaaaaccatc cagtccattg tcaattggct tactagctgg ttcaagcagg
    4751  aagaggacca cccccaatca aaattagaca aattgcttat ggaattccct
    4801  gatcattgta ggaacattat ggatatgagg aacggtcgaa aggcctattg
    4851  tgaatgcact gcttccttta agtattttga tgatctttac aatcttgctg
    4901  ttacttgcaa aagaattcca ttagcctccc tttgtgagaa atttaagaat
    4951  agacacgacc actctgtcac cagacccgag ccggtggttg ctgtcctgcg
    5001  cggcgccgct gggcaaggca aatctgtgac cagccaaatc attgcccaat
    5051  ctgtttctaa gatggccttt ggccgtcagt ctgtttattc tatgcccccc
    5101  gattcggaat attttgatgg ctatgaaaat caattttctg tgattatgga
    5151  tgatctagga caaaatcccg atggtgaaga tttcaccgtc ttttgtcaaa
    5201  tggtttctag cacaaatttt ctcccgaata tggctcacct ggaaagaaaa
    5251  ggcacccctt tcacctctag ctttattgtt gccacaacaa atttgcccaa
    5301  attccgcccc gttacggtag cccattaccc tgctgttgat aggcgaatca
    5351  cctttgactt caccgttact gctggacccc actgtaaaac acccgctgga
    5401  atgttggaca ttgagaaggc ttttgatgaa atacctggct ccaaacctca
    5451  gcttgcctgc tttagtgctg attgtcccct cctacacaag agaggagtta
    5501  tgttcacctg caatcgcacc aaaactgtct acaaccttca acaggttgtg
    5551  aaaatggtca acgacactat cactcgcaag actgaaaacg ttaagaaaat
    5601  gaacagcttg gttgcccagt ccccaccaga ctggcaacac tttgagaata
    5651  tcctcacttg cctccgtcag aataatgctg ctcttcagga tcaagttgat
    5701  gaattgcaag aagcgttcac ccaagcccgc gagcgttctg attttctttc
    5751  tgattggttg aaggtttctg ctatcatttt tgctggtatt gtctcacttt
    5801  ctgctgtcat aaaactagcc tccaaattta aagaatcaat ttggcccacg
    5851  cccgtgagag ttgagctctc tgagggcgaa caggccgcgt atgctggtcg
    5901  tgcgcgcgct caaaaacaag cccttcaggt cctagatatt caaggaggcg
    5951  ggaaggttct agcccaggcc ggtaaccccg tcatggactt tgagcttttc
    6001  tgtgccaaaa acatggtttc cccgattacc ttctactacc ctgacaaggc
    6051  tgaagtgacc cagagctgct tgctgctccg tgcccacctc ttcgtggtca
    6101  accgccacgt cgctgaaacg gaatggacag ctttcaagct tagggatgtg
    6151  aggcacgaac gtgacactgt tgtcatgcgt tccgttaacc gctcaggagc
    6201  tgaaacggac cttacattcg tgaaggttac taaaggacca ctcttcaagg
    6251  acaatgtgaa caagttttgc tcaaacaagg acgattttcc tgctaggaat
    6301  gacactgtta ccgggataat gaacactgga ttggccttcg tgtattccgg
    6351  taactttctg attggcaatc aacctgtgaa cacaacaact ggagcctgct
    6401  tcaaccactg cctccactat cgagctcaaa ctcgacgtgg ttggtgtggt
    6451  tctgccatca tctgcaatgt taacggcaaa aaagctgttt acggaatgca
    6501  ctctgctgga ggcggaggcc ttgccgccgc taccatcatc accagagagt
    6551  tgattgaagc agctgagaag tctatgttgg cgctggaacc gcaaggtgcc
    6601  atcgtggaca tttccacagg atctgtcgta catgtcccca gaaagaccaa
    6651  actgaggaga acagtcgctc atgacgtttt ccaacccaaa ttcgaacctg
    6701  cagttctgtc ccgttatgac cctcggaccg acaaggatgt agatgttgta
    6751  gccttctcca aacacactac taacatggaa agcttgcctc caatctttga
    6801  cattgtctgc ggtgaatacg ctaaccgtgt tttcaccatc cttggtaaag
    6851  acaacggtct cttaaccgtt gaacaggctg tgcttggctt gtcgggcatg
    6901  gaccccatgg agaaggacac ctcccctgga ttgccctaca ctcaacaagg
    6951  actcagacga actgaccttc tggatttcaa cactgccaaa atgacacccc
    7001  aattggacta tgcccattcc aaactggtac tcggcgttta tgacgacgtt
    7051  gtctaccaat catttttgaa agatgaaatt cggcccttgg agaagatcca
    7101  cgaagcaaaa acccggattg ttgatgtgcc cccgtttgcc cactgcattt
    7151  ggggaagaca gcttttggga cgcttcgctt ccaaatttca aactaaacct
    7201  ggatttgaac ttggatctgc aattggaact gacccggatg ttgattggac
    7251  gcgctatgcc gccgagctga gcgggttcaa ctacgtctat gatgtagatt
    7301  actccaactt tgatgcttcc cattctactg caatgtttga atgtttgatt
    7351  aacaatttct ttacagagca aaatggattt gacagacgca ttgccgagta
    7401  ccttagatct ctggctgtgt cgcgacatgc ttatgaggac cgccgtgtcc
    7451  tcatacgtgg gggcctgcct tcgggctgtg ctgctaccag catgttaaac
    7501  accatcatga acaatgtcat aattcgtgct gccctgtacc ttacttattc
    7551  aaattttgaa tttgatgata ttaaggtcct ttcctacgga gacgaccttt
    7601  taattggaac taattaccaa attgatttta atcttgttaa agaaagatta
    7651  gcccccttcg gttataagat tactcctgcc aacaagacca ctacttttcc
    7701  tctgacctcc catttgcaag atgttacctt tctaaagaga agatttgtga
    7751  gatttaattc ttacctgttc agacctcaaa tggatgctgt caatttgaaa
    7801  gcaatggtta gctactgtaa accaggaaca cttaaggaga aactaatgtc
    7851  cattgctctt ctggccgttc attctggacc agatatttat gatgagattt
    7901  tccttccttt taggaatgtt ggaatagttg tccccaccta tgattctatg
    7951  ctttacagat ggcttagctt atttagatga acatcctctc gatcggatcg
    8001  caacgcttta ccctagaagc cactagggtg tacgcggccg ttctgacgtt
    8051  ggaattcttt taggcaaaag ttgtgtagat gcttataatt ggaaatgaga
    8101  acaac