Sequence of DPV Arabis mosaic virus

Arabis mosaic virus RNA for coat protein, partial cds.

ACC No: D10086

Dated: 2003-05-21 | Length: 2406 | CRC: -1121204961

                !!NA_SEQUENCE 1.0
ID   AMVCP      standard; RNA; VRL; 2406 BP.
XX
AC   D10086;
XX
SV   D10086.1
XX
DT   15-MAY-1992 (Rel. 31, Created)
DT   21-MAY-2003 (Rel. 75, Last updated, Version 4)
XX
DE   Arabis mosaic virus RNA for coat protein, partial cds.
XX
KW   .
XX
OS   Arabis mosaic virus
OC   Viruses; ssRNA positive-strand viruses, no DNA stage; Comoviridae;
OC   Nepovirus; Subgroup A.
XX
RN   [1]
RP   1-2406
RX   MEDLINE; 91341466.
RX   PUBMED; 1875193.
RA   Bertioli D.J., Harris R.D., Edwards M.L., Cooper J.I., Hawes W.S.;
RT   "Transgenic plants and insect cells expressing the coat protein of arabis
RT   mosaic virus produce empty virus-like particles";
RL   J. Gen. Virol. 72:1801-1809(1991).
XX
DR   GOA; Q65028; Q65028.
DR   SPTREMBL; Q65028; Q65028.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .2406
FT                   /db_xref="taxon:12271"
FT                   /mol_type="genomic RNA"
FT                   /organism="Arabis mosaic virus"
FT                   /isolate="lilac"
FT                   /segment="2"
FT   CDS             <1. .2214
FT                   /codon_start=1
FT                   /db_xref="GOA:Q65028"
FT                   /db_xref="SPTREMBL:Q65028"
FT                   /product="polyprotein"
FT                   /protein_id="BAA00982.1"
FT                   /translation="TGQNTLEILYNPVADEEMDDYRDRGMSAVVIDALEIAINPFGMPG
FT                   NPTDLTVVATYGHERNMERAFIGSSSTFLGNGLARRIFFPGLQYSQEEPRRESLIRCMS
FT                   LLPTPTVDADSILAAISVGTLRHDIGSLHNRTVASSVHAAQVQGTTLRATMMGNAVVVS
FT                   PEGSLVTGTPEANVQIGGGSSMRMVGPLAWENVEEPGQTFTIRNRSRSMRVDRNADAGV
FT                   AFPRMRTTTRGLAGRGSVQVPKDCQAGKYLKTLDLRDMVSGFSGIQYEKWITAGIVMPD
FT                   FKVVIRYPANAFTGITWVMSFDAYNRITSSISTTASPAYTLSVPHWLLHHRNGTTSCDL
FT                   DYGELCGHAMWFGATTFESPKLHFTCLTGNNKELAADWEFVVELYAEFEAAKSFLGKPN
FT                   FIYSADAFNGSLKYLTIPPLEYDLSATSAYKSVSLLLGQTLVDGTHKVYNFNNTLLSYY
FT                   LGIGGVVKGKVHICSPCTYGIVLRVVSEWNGVTNNWNQLFKYPGCYIDEDGNFEIEIRS
FT                   PYHRTPLRLLDAQSASSFTSTLNFYAISGPIAPSGETAKMPVVVQIDEIALPDLSVPSF
FT                   PNDYFLWVDFSSFTVDVEEYVIGSRFFDISSTTSTVALGDNPFAHMIACHGLHHGILDL
FT                   KLMWDLEGEFGKSSGGVTITKLCGDKATGMDGASRVCALQSMGCETELYIGNYAGANPN
FT                   TSLSLYSRWLAIKLDKAKSMKMLRILCKPRGNFEFYGRTCFKV"
FT   mat_peptide     697. .2211
FT                   /product="coat protein"
XX
SQ   Sequence 2406 BP; 599 A; 467 C; 587 G; 753 T; 0 other;

    D10086  Length: 2406  May 28, 2003 14:38  Type: N  Check: 34  ..

       1  accggacaaa atactttaga aatattgtat aatccggtgg cagacgagga
      51  aatggatgac taccgtgaca ggggtatgtc agcagtcgtg attgatgcgc
     101  tggaaattgc tatcaatcct tttggaatgc caggcaatcc gactgacctt
     151  actgtcgtgg caacatacgg gcatgaacgt aatatggaac gtgcctttat
     201  tggctcttct tcaaccttcc ttggaaatgg gttggcgaga cgtattttct
     251  tccctggttt gcaatatagc caggaagaac ctaggcgcga atctttgatt
     301  cgctgtatgt cgcttctacc aacgcccact gttgatgctg attctatatt
     351  ggcagccatc agtgttggca ctctgcgcca cgacattggt tcgcttcaca
     401  atagaacagt ggcgagttct gtccatgctg ctcaagtgca gggcactact
     451  ttgagggcta caatgatggg taatgccgtt gttgtttctc ctgagggaag
     501  tcttgtcact ggaacccctg aggccaatgt tcaaattgga ggtggttcga
     551  gtatgcgaat ggttggtccc ctggcatggg aaaatgttga agaaccaggg
     601  cagaccttta ctataagaaa ccgctcccgg tctatgcgag tagatcggaa
     651  tgctgatgcg ggtgttgctt ttcctagaat gaggacaacc acgcggggac
     701  ttgctggtag aggctctgtc caggtgccta aagattgcca ggcgggaaaa
     751  tatctgaaaa ccctcgattt gagagacatg gtaagtgggt tttccggtat
     801  tcaatatgaa aagtggatca ccgctggaat tgtaatgcca gattttaagg
     851  tggtgattcg ctatccggcc aatgccttta caggtattac gtgggttatg
     901  agttttgatg cttacaaccg aattactagc agtatttcaa ctactgctag
     951  tcccgcgtat acactgtctg tccctcattg gcttttgcat catagaaatg
    1001  ggaccacctc atgtgacctt gactatggag aactctgtgg tcatgctatg
    1051  tggtttgggg ctactacttt tgaaagtcct aaattgcact tcacatgcct
    1101  cactggaaac aataaagaat tggcagcgga ttgggagttc gttgtcgaat
    1151  tatatgctga gtttgaggca gcaaaaagct ttcttgggaa acccaacttc
    1201  atttatagcg ctgatgcttt taatgggtct ctcaagtatc tgacaatccc
    1251  gccactggaa tatgacctaa gtgcgaccag tgcctataag agtgtgtccc
    1301  tattgttggg tcaaactctt gttgatggta ctcataaggt gtataatttc
    1351  aataatacac ttttgagtta ctatcttggt attggtggtg tggtcaaggg
    1401  taaagtgcat atatgtagcc cttgtactta tggcatagtc ctaagagttg
    1451  ttagtgaatg gaacggggtc actaacaact ggaaccaatt gtttaaatac
    1501  cccgggtgtt acatcgatga ggatgggaac tttgaaattg aaatacgttc
    1551  tccctatcat cgaactccac tgcgtttact tgatgcacaa tcagcgagct
    1601  cgtttacgag tacgctgaat ttttacgcta tatctgggcc aattgccccc
    1651  agtggagaaa cagctaagat gccagtggta gtgcagattg acgaaattgc
    1701  attaccggat ctttctgtgc catccttccc caatgattat ttcctatggg
    1751  ttgatttctc ttcattcact gtggatgttg aggagtatgt gatagggtcg
    1801  aggttttttg atatttcctc tactactagc actgtagccc ttggagacaa
    1851  tccttttgct cacatgatag cttgtcatgg gctccatcac ggaatccttg
    1901  atcttaagtt aatgtgggat ttggagggtg agtttggaaa gagttctggt
    1951  ggtgttacca tcacaaaatt atgtggtgat aaggctactg gcatggatgg
    2001  agcatctcgt gtttgtgctt tgcaaagcat gggatgcgaa actgaactgt
    2051  acataggaaa ttatgcgggt gctaacccaa atacttcatt gtctttgtac
    2101  agccgatggc ttgctattaa gctcgataaa gctaagagta tgaaaatgct
    2151  cagaattttg tgcaagccca gggggaactt tgaattttac ggcagaacat
    2201  gttttaaagt ttaggtttcc tctcacgtct tgagaatctg acgttaaaag
    2251  acttactttg tattatattt attttagctt gtttactgct tttgtgtgtt
    2301  taatttcatg catttagtgg cgacagtgtg ttgtttgtcc tttggacaca
    2351  cttgccttgt tggacgcaaa aagattttat tttcttttta ctgcttttat
    2401  aaattt