Sequence of DPV Sorghum mosaic virus

Sorghum mosaic virus strain M polyprotein mRNA, partial cds.

ACC No: U57360

Dated: 2006-11-14 | Length: 2053 | CRC: 249179188

                
ID   U57360; SV 1; linear; mRNA; STD; VRL; 2053 BP.
XX
AC   U57360;
XX
DT   06-FEB-1997 (Rel. 50, Created)
DT   14-NOV-2006 (Rel. 89, Last updated, Version 4)
XX
DE   Sorghum mosaic virus strain M polyprotein mRNA, partial cds.
XX
KW   .
XX
OS   Sorghum mosaic virus
OC   Viruses; ssRNA positive-strand viruses, no DNA stage; Potyviridae;
OC   Potyvirus.
XX
RN   [1]
RP   1-2053
RA   Mirkov T.E., Yang Z.N.;
RT   "Sequence and relationships of Sugarcane Mosaic and Sorghum Mosaic Virus
RT   strains, and development of RT-PCR based RFLPs for strain discrimination";
RL   Phytopathology 87:932-939(1997).
XX
RN   [2]
RP   1-2053
RA   Mirkov T.E.;
RT   ;
RL   Submitted (02-MAY-1996) to the EMBL/GenBank/DDBJ databases.
RL   T. Erik Mirkov, Plant Pathology, Texas A&M, 2415 E. Hwy 83, Weslaco, TX
RL   78596, USA
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .2053
FT                   /organism="Sorghum mosaic virus"
FT                   /strain="M"
FT                   /mol_type="mRNA"
FT                   /db_xref="taxon:32619"
FT   mat_peptide     <1. .828
FT                   /product="nuclear inclusion II protein"
FT   CDS             <1. .1818
FT                   /codon_start=1
FT                   /product="polyprotein"
FT                   /db_xref="GOA:P89210"
FT                   /db_xref="InterPro:IPR001205"
FT                   /db_xref="InterPro:IPR001592"
FT                   /db_xref="InterPro:IPR005121"
FT                   /db_xref="InterPro:IPR007094"
FT                   /db_xref="UniProtKB/TrEMBL:P89210"
FT                   /protein_id="AAB70864.1"
FT                   /translation="CDADGSQFDSSLTPYLINAVLDIRLHFMEDWSIGEKMLRNLYTEI
FT                   VYTPIATPDGSVIKKFKGNNSGQPSTVVDNTLMVIIAFNYTMFSCGIEADMIDEICKMY
FT                   ANGDDLLLAIRPDYEHLLDNFSKHFADLGLNFDFTSRTRDRTELWFMSTRGIKIDNMYI
FT                   PKLEQERIVAILEWDRSLLPQYRLEAICAAMVESWGYPQLLHEIRKFYAWILEMQPFAT
FT                   LAKEGLAPYIAETALRNLYTGEGIKEGELDVYYTQFLKDLPEYIEDELIDVRHQAGGGT
FT                   VDAGAATAEATAQAQRDAAAKAQRDADAKKKADDEAAERQRQEAAAKKKADDDAKAKAD
FT                   ADAKAKSDADAKKKADDEAARKAQNQKDKDVDVGTSGTVAVPKLKAMSKKMKLPQAKGK
FT                   NILHLDFLLGYKPQQQDISNTRATRDEFDRWYDALQKEYELDDTQMTVVASGLMVWVIE
FT                   NGCSPNINGVWTMMDGDEQRKFPLKPVIEYASPTFRQIMHHFSDAAEAYIEYRNSTERY
FT                   MPRYGLQRNLTDYNLARYAFDFYEITSRTPARAREAHMQMKAAAVRGSNTRMFGLDGNV
FT                   GESQENTERHTAGDVSRNMHSLLGVQQHH"
FT   variation       9
FT                   /replace="a"
FT                   /note="nucleotide difference between two independent clones
FT                   for this strain"
FT   variation       264
FT                   /replace="g"
FT                   /note="nucleotide difference between two independent clones
FT                   for this strain; results in amino acid change from F to L"
FT   mat_peptide     829. .1815
FT                   /product="coat protein"
FT   3'UTR           1819. .2053
XX
SQ   Sequence 2053 BP; 681 A; 385 C; 491 G; 496 T; 0 other;

u57360 Length: 2053  14-NOV-2006  Type: N  Check: 3345  ..

       1  tgtgatgccg atggttcgca attcgatagt tcactaacac cttatctcat
      51  caatgcagtg ttggacatca gattgcattt tatggaagat tggagtatcg
     101  gagagaaaat gctcaggaac ctttatacag aaattgttta tactcctata
     151  gcaacaccag atggatccgt cataaagaaa ttcaaaggaa ataatagcgg
     201  acaaccatca accgtcgttg ataacacact aatggtgatc atagcgttta
     251  actatacaat gttttcatgt gggatcgaag ctgatatgat agatgaaata
     301  tgcaaaatgt atgcaaatgg ggacgatctt ttgttagcaa tacggccaga
     351  ttacgaacat ttattggata atttctcaaa acactttgct gatctaggtc
     401  ttaacttcga ttttacatca cgcacaagag ataggacgga attgtggttt
     451  atgtcgacac gaggcattaa aattgacaat atgtacatcc caaaattgga
     501  acaggaaaga atcgttgcta ttttagaatg ggatagatca ttattaccac
     551  aatatagact ggaggcgata tgtgctgcaa tggtggaatc atggggatat
     601  ccacaattat tacatgagat taggaaattt tatgcttgga ttctcgaaat
     651  gcagccattc gccactctag cgaaagaagg acttgccccg tacatagcag
     701  aaacggcttt gcgtaatctt tatacagggg aaggaataaa agaaggggag
     751  ttggatgttt attacacaca attcctcaaa gatttgcctg aatacataga
     801  ggatgaacta attgacgtgc gccatcaggc aggaggcggt acagtagatg
     851  caggagcagc cacagcagaa gcaacagcac aagcacagcg tgatgcagca
     901  gcgaaagctc aacgagatgc tgatgcaaag aagaaggcgg atgatgaagc
     951  ggcagagagg cagagacaag aagccgcggc aaagaagaaa gctgatgatg
    1001  atgcaaaagc taaagctgat gcggatgcta aagcaaaatc agatgctgat
    1051  gcgaaaaaga aagcagacga cgaagcagca agaaaagcac aaaatcaaaa
    1101  agacaaggat gtggatgttg gcacatctgg cacggtggca gtgcctaagc
    1151  tcaaagcaat gtccaagaaa atgaaattac cacaagcaaa agggaaaaac
    1201  attttacact tggattttct tttgggatat aagccacaac aacaagacat
    1251  ttcaaacacc agagccacac gggatgagtt cgataggtgg tatgatgcat
    1301  tgcagaagga atatgaacta gatgatacgc agatgacagt agtcgcaagc
    1351  ggactcatgg tttgggtcat agagaacgga tgctcaccca atattaatgg
    1401  tgtttggaca atgatggatg gagatgagca aaggaaattt ccactcaagc
    1451  ccgttattga atatgcatct ccaacattca gacagataat gcaccacttt
    1501  agtgatgcag ctgaagcgta catagagtat cggaactcga cagagcgtta
    1551  catgccaaga tacggacttc agcgaaactt aaccgactat aacctagccc
    1601  gatacgcatt cgatttctat gaaataactt cgcgtacacc agcgagagct
    1651  agagaggccc acatgcagat gaaagcagca gcagtgcgtg gatcaaacac
    1701  gcgcatgttt ggcttggatg ggaatgtcgg tgagagtcag gagaatacag
    1751  aacgtcacac agctggtgat gtgagtcgca atatgcactc ccttcttgga
    1801  gtgcaacagc atcactgatg tactgagatc ttcattgcag ttttaagagt
    1851  attttatata tttactattt cagtgagggt ctccctcctt agtattatat
    1901  atgtacttta gaaatagtag tcattctgca ggggagtgag gtttacctcc
    1951  aaccctatgg ttactatttc ctactagcgt cgaactacat tacggacacc
    2001  ctgttgtgtg gttctaccac gagtcaggag ctgcgagtat tgtagcaaga
    2051  gac