ID   D14995; SV 2; linear; genomic RNA; STD; VRL; 6495 BP.
XX
AC   D14995; S47260;
XX
DT   21-DEC-1993 (Rel. 38, Created)
DT   26-FEB-2008 (Rel. 94, Last updated, Version 7)
XX
DE   Apple stem grooving virus genome, complete sequence.
XX
KW   NTP-binding helicase; RNA-dependent RNA polymerase; serine protease.
XX
OS   Apple stem grooving virus
OC   Viruses; ssRNA positive-strand viruses, no DNA stage; Flexiviridae;
OC   Capillovirus.
XX
RN   [1]
RP   1-6495
RA   Yoshikawa N.;
RT   ;
RL   Submitted (14-APR-1993) to the EMBL/GenBank/DDBJ databases.
RL   Contact:Nobuyuki Yoshikawa Iwate University, Bioscience and Technology;
RL   Ueda 3-18-8, Morioka, Iwate 020, Japan
XX
RN   [2]
RA   Yoshikawa N., Sasaki E., Kato M., Takahashi T.;
RT   "The nucleotide sequence of apple stem grooving capillovirus genome";
RL   Virology 191:98-105(1992).
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .6495
FT                   /organism="Apple stem grooving virus"
FT                   /mol_type="genomic RNA"
FT                   /db_xref="taxon:28347"
FT   CDS             36. .6353
FT                   /codon_start=1
FT                   /product="241k polyprotein"
FT                   /note="contains two consensus sequences associated with
FT                   RNA-dependent RNA polymerase and NTP-binding helicase."
FT                   /db_xref="GOA:P36309"
FT                   /db_xref="InterPro:IPR000606"
FT                   /db_xref="InterPro:IPR001788"
FT                   /db_xref="InterPro:IPR007094"
FT                   /db_xref="InterPro:IPR008745"
FT                   /db_xref="InterPro:IPR008879"
FT                   /db_xref="UniProtKB/Swiss-Prot:P36309"
FT                   /protein_id="BAA03639.1"
FT                   /translation="MAFTYRNPLEIAINKLPSKQSDQLLSLTTDEIEKTLEVTNRFFSF
FT                   SITPEDQELLTKHGLTLAPIGFKSHSHPISKMIENHLLYICVPSLLSSFKSVAFFSLRE
FT                   NKVDSFLKMHSVFSHGKIKSLGMYNAIIDGKDKYRYGDVEFSSFRDRVIGLRDQCLTRN
FT                   KFPKVLFLHDELHFLSPFDMAFLFETIPEIDRVVATTVFPIELLFGDKVSKEPRVYTYK
FT                   VHGSSFSFYPDGVASECYEQNLANSKWPFTCSGIQWANRKIRVTKLQSLFAHHVFSFDR
FT                   GRACNEFNHFDKPSCLLAEEMRLLTKRFDKAVINRSTVSSLSTYMACLKTANAASAVAK
FT                   LRQLEKRDLYPDELNFVYSFGEHFKNFGMRDDFDVSVLQWVKDKFCQVMPHFIAASFFE
FT                   PTEFHLNMRKLLNDLATKGIEVPLSVIILDKVNFIETRFHARMFDIAQAIGVNLDLLGK
FT                   RFDYEAESEEYFSENGYIFMPSKSNPERNWILNSGSLKIDYSRLVRARRFRLRRDFLDP
FT                   ISKGKSPRKQLFLESTGNIKSNPNAEKNSESGEIKIEGSAENDQPHEVSHTSMETEDGQ
FT                   GFEGSIPVDLINCFEPEEIKLPKRRRKNDCVFKAISAHLGIDSQDLLNFLVNEDISDEL
FT                   LDCIEEDKGLSHEMIEEVLITKGLSMVYTSDFKEMAVLNRKYGVNGKMYCTIKGNHCEL
FT                   SSKECFIRLLKEGGEAQMSNENLNADSLFDLGRFVHNRDRAVKLAKSMARGTTGLLNEF
FT                   DLEFCKNMVTLSELFPENFSSVVGLRLGFAGSGKTHKVLQWINYTPSVKRMFISPRRML
FT                   ADEVEPQLKGTACQVHTWETALKKIDGTFMEVFVDEIGLYPPGYLTLLQMCAFRKIVKG
FT                   QSENFLKGKLLELSKTCLNIRCFGDPLQLRYYSAEDTNLLDKTHDIDLMIKTIKHKYLF
FT                   QGYRFGQWFQELVNMPTRVDESKFSRKFFADISSVKTEDYGLILVAKREDKGVFAGRVP
FT                   VATVSESQGMTISKRVLICLDQNLFAGGANAAIVAITRSKVGFDFILKGNSLKEVQRMA
FT                   QKTIWQFIIEGKSIPMERIVNMNPGASFYESPLDVGNSSIQDKASNDLFIMPFINLAEE
FT                   EVDPEEVVGDVIQPVEWFKCHVPVFDTDPTLAEIFDKVAAKEKREFQSVLGLSNQFLDM
FT                   EKNGCKIDILPFARQNVFPHHQASDDVTFWAGVQKRIRKSNWRREKSKFEEFESQGKEL
FT                   LQEFISMLPFEFKVNIKEIEDGEKSFLEKRKLKSEKMWANHSERSDIDWKLDHAFLFMK
FT                   SQYCTKEGKMFTEAKAGQTLACFQHIVLFRFGPMLRAIESAFLRSCGDSYYIHSGKNFF
FT                   CLDSFVTKNASVFDGFSIESDYTAFDSSQDHVILAFEMALLQYLGVSKEFQLDYLRLKL
FT                   TLGCRLGSLAIMRFTGEFCTFLFNTFANMLFTQLKYKIDPRRHRILFAGDDMCSLSSLK
FT                   RRRGERATRLMKSFSLTAVEEVRKFPMFCGWYLSPYGIIKSPKLLWARIKMMSERQLLK
FT                   ECVDNYLFEAIFAYRLGERLYTILKEEDFEYHYLVIRFFVRNSKLLTGLSKSLIFEIGE
FT                   GIGSKWLSSTSTASSRRSNLQTSKLMLSRPQSFTRMQPFSNQTCLIASKGLNQTSRFPL
FT                   DLVTASSCLISNCLMTPKLIQSGRKATSTNTYTMESSWLGSKQCCQTLEAWKGESLYMM
FT                   EPAWIRKEATFARIFSSLSLTVATLVSGQSTVCLPQTQIWPKGLDFVWTLIVHNMNRTL
FT                   SCLLLTLELHTDASTLQGFWKPKLAIQDGLHRQSAAVKHLNSMRKSRWPSWIADPRCFW
FT                   KKVHQTCTLKRDCSEVTRLEGHAQFPLKGGQTQGCKKREDLGPSRLELKDLEKMSLEDV
FT                   LQQARRHRVGVYLWKTHIDPAKELLTVPPPEGFKEGESFEGKELYLLLCNHYCKYLFGN
FT                   IAVFGSSDKTQFPAVGFDTPPVHYNLTTTPKEGETDEGRKARAGSSGEKTKIWRIDLSN
FT                   VVPELKTFAATSRQNSLNECTFRKLCEPFADLAREFLHERWSKGLATNIYKKWPKAFEK
FT                   SPWVAFDFATGLKMNRLTPDEKQVIDRMTKRLFRTEGQKGVFEAGSESNLELEG"
FT   CDS             4787. .5749
FT                   /codon_start=1
FT                   /product="36K protein"
FT                   /note="contains consensus sequence found in the active site
FT                   of several cellular and viral serine proteases."
FT                   /db_xref="GOA:P36698"
FT                   /db_xref="InterPro:IPR001022"
FT                   /db_xref="InterPro:IPR001792"
FT                   /db_xref="InterPro:IPR001815"
FT                   /db_xref="UniProtKB/Swiss-Prot:P36698"
FT                   /protein_id="BAA03640.1"
FT                   /translation="MAIVNVNRFLKEVESTDLKIDAISSSELYKDATFFKPDVLNCIKR
FT                   FESNVKVSSRSGDGLVLSDFKLLDDTEIDSIRKKSNKYKYLHYGVILVGIKAMLPNFRG
FT                   MEGRVIVYDGACLDPKRGHICSYLFKFESDCCYFGLRPEHCLSTTDANLAKRFRFRVDF
FT                   DCPQYEQDTELFALDIGVAYRCVNSARFLETKTGDSGWASQAISGCEALKFNEEIKMAI
FT                   LDRRSPLFLEEGAPNVHIEKRLFRGDKVRRSRSISAKRGPNSRVQEKRGFRSLSARIER
FT                   FGKNEFGRRASASEAPPGRSISMEDSHRPGKGTSDGSSP"
FT   polyA_site      6495
XX
SQ   Sequence 6495 BP; 1985 A; 1196 C; 1496 G; 1818 T; 0 other;

d14995 Length: 6495  26-FEB-2008  Type: N  Check: 2820  ..
       1  aaatttaaca ggcttaattt ccgcgcttta cgtcaatggc tttcacttac
      51  agaaaccccc tcgaaattgc aatcaacaaa cttcctagta agcagtctga
     101  tcaactgctt tccttgacca ccgacgagat tgaaaagacc ttagaagtga
     151  ccaaccgctt cttctctttt tcaatcacac cagaagatca agaattgttg
     201  actaagcatg gtctaacact tgcacctata gggtttaagt cacactccca
     251  tccaatatcc aaaatgatag aaaatcatct cctgtatata tgtgttccga
     301  gtcttttatc ctcctttaag tcagttgcct ttttttcact tagggaaaat
     351  aaagtagaca gttttcttaa gatgcattca gtcttttccc atggaaaaat
     401  taaatctttg gggatgtaca atgctataat tgatgggaaa gataaatata
     451  ggtatggtga tgtagagttt tcatctttta gggatagagt gattggtctt
     501  agagatcaat gccttacacg taacaaattt ccaaaagttc tgtttcttca
     551  cgacgagttg cactttctaa gtccatttga catggctttc ctatttgaga
     601  caatcccaga aattgataga gttgttgcaa ccacagtttt tccaatagaa
     651  cttttattcg gggacaaggt ctctaaggaa cccagggttt atacctacaa
     701  ggtccatggc tcttcatttt cattttatcc ggatggtgtt gcctctgagt
     751  gttacgaaca gaatttggca aattctaaat ggcccttcac ctgcagcggc
     801  atacaatggg ctaacaggaa aattagggta accaagctac agagtctctt
     851  cgcccatcat gttttctcat ttgacagggg gagggcttgt aatgaattta
     901  atcatttcga caaacctagc tgtctacttg cggaagaaat gcgccttttg
     951  accaaaaggt ttgataaagc agttattaac agaagcacag tctcttccct
    1001  cagtacatac atggcttgtc ttaaaactgc aaatgcggct tcagctgttg
    1051  ccaagctgag gcagttggag aagagggatc tttacccaga tgagttgaac
    1101  ttcgtctatt cctttggaga gcatttcaaa aattttggga tgagagatga
    1151  ctttgatgtg tcagttctac aatgggtcaa agacaaattt tgccaggtca
    1201  tgcctcactt catcgccgcc agtttctttg aaccaacaga atttcattta
    1251  aacatgcgca aattgttgaa tgatctggct actaaaggaa tagaggttcc
    1301  cctttctgtg atcatcctgg acaaagtcaa cttcatagag accagatttc
    1351  atgccaggat gttcgacata gcacaggcaa tcggggtgaa cctagattta
    1401  ctggggaaaa gatttgatta tgaggctgag agtgaagagt acttttcaga
    1451  gaacggttac atctttatgc cctctaaatc aaatccagag agaaattgga
    1501  ttctaaattc cggttcgctg aaaattgact attcaagatt ggtaagagcc
    1551  aggagattta gattgagaag agatttccta gatcccatat ctaaaggaaa
    1601  atcccctaga aaacaactct tcttggagtc aacgggaaac attaaatcaa
    1651  atcccaatgc tgaaaaaaat agcgagagcg gcgaaataaa gattgaaggc
    1701  agtgccgaaa atgatcagcc acatgaggta tcacatactt caatggaaac
    1751  cgaggatgga cagggttttg aaggttcaat accagttgat ttaatcaatt
    1801  gctttgaacc agaagaaatc aagcttccaa agagaagaag gaaaaatgat
    1851  tgcgtcttca aggccatctc tgcacacttg gggattgact ctcaagattt
    1901  gttgaatttt ttggtaaatg aagacatatc agatgaatta cttgattgca
    1951  ttgaagagga caaaggactg tcacatgaaa tgattgaaga agttttgatc
    2001  acaaagggtc tttcaatggt ttatacttct gacttcaaag aaatggcagt
    2051  tcttaataga aagtatggag tgaatggcaa gatgtactgc acaattaaag
    2101  gcaatcactg cgagctgagt tccaaagagt gcttcatcag attattgaaa
    2151  gaaggtggtg aagcgcagat gtcaaatgaa aatctaaatg ctgattcctt
    2201  gttcgacctt ggaagatttg tgcataatag agacagggct gtcaagctag
    2251  caaaatcaat ggcaagaggc acaacaggcc tcctgaatga attcgaccta
    2301  gaattctgca agaacatggt gaccctttcg gagttgtttc ctgaaaactt
    2351  ttcttctgtt gtcgggctaa ggcttgggtt tgcgggttct ggtaaaacgc
    2401  ataaggtgct tcaatggatt aattacactc caagtgtcaa aagaatgttt
    2451  ataagtccaa ggagaatgct ggcggatgaa gttgaacctc aactcaaggg
    2501  aacggcctgt caggtgcata catgggagac tgcacttaaa aaaatcgacg
    2551  gaacttttat ggaagttttt gttgatgaaa taggtttgta cccacctgga
    2601  taccttacac tgctacagat gtgtgctttc agaaagattg ttaagggaca
    2651  aagtgaaaat ttcttgaaag gcaaactgtt ggaattgtca aagacttgct
    2701  taaacataag atgttttggt gatccattgc aattaaggta ttactcagct
    2751  gaagacacca atctattgga caaaacacat gatattgacc tcatgatcaa
    2801  gacgatcaag cacaaatatc ttttccaagg gtacaggttc ggtcagtggt
    2851  ttcaagaact ggtgaacatg cccactagag tggatgagtc gaaattctca
    2901  aggaagttct ttgcagacat ttcaagtgta aaaactgaag attacggact
    2951  catcctagtt gccaagagag aagataaagg tgtcttcgct ggaagagttc
    3001  ctgtagcaac agtgagtgaa tctcagggaa tgaccattag caaaagggtg
    3051  ttgatatgtt tggaccaaaa tctttttgcc gggggagcca atgcagccat
    3101  tgttgcaata acaagatcaa aggtcggctt tgactttatc cttaaaggga
    3151  attcattgaa agaggtacag aggatggcac aaaagacaat ttggcagttc
    3201  atcattgaag ggaagtctat tccgatggag aggatagtga acatgaatcc
    3251  tggagccagc ttttatgaga gtcctttgga tgttggaaat tcatcaattc
    3301  aagacaaagc ttctaatgac ctgttcataa tgccttttat aaatttggct
    3351  gaggaagaag ttgacccaga ggaagttgtt ggggacgtaa ttcaacctgt
    3401  tgagtggttc aaatgtcatg tgcctgtctt cgacacagat ccgacgcttg
    3451  cggagatttt tgataaggtt gcagcaaaag aaaaaaggga attccagtct
    3501  gtgctgggtc tttcaaatca atttcttgac atggaaaaga atggatgcaa
    3551  aatagacatc ttgccctttg cgcgacaaaa tgtttttcca catcatcaag
    3601  cgtctgatga tgttactttc tgggcaggtg ttcaaaaaag aattagaaag
    3651  tcgaactgga gaagggagaa atcgaagttt gaggaatttg aaagccaagg
    3701  gaaagaactt cttcaagaat tcatctcaat gctaccgttt gaattcaaag
    3751  tgaatatcaa ggagattgaa gatggagaga agagcttttt agaaaaaaga
    3801  aagctaaaat ctgagaaaat gtgggcaaat cattcggaga gatcagacat
    3851  tgactggaaa cttgaccacg cctttctctt catgaaatca caatattgca
    3901  cgaaggaagg gaagatgttc accgaagcta aagctggcca aactttggcc
    3951  tgtttccaac atatagtcct atttagattt ggacccatgt tgagagcaat
    4001  tgaaagtgcc tttttgagaa gctgtggaga ctcatactac atacactccg
    4051  ggaaaaactt cttctgcctg gatagctttg tgacaaagaa tgcaagtgtc
    4101  tttgatggat tttcaattga gtcagactac acggcctttg actcatctca
    4151  ggaccacgtc atattggcct ttgaaatggc actgttacaa tacctgggcg
    4201  tgtcaaagga gtttcagcta gattacctta gactgaaatt aactctcgga
    4251  tgccgtctcg gatcactagc aataatgagg ttcacaggag aattttgcac
    4301  tttcttattc aacacatttg ccaatatgct gtttactcaa ttgaagtaca
    4351  agatagaccc aaggaggcat aggattttat ttgctgggga cgatatgtgt
    4401  tccttgagct ctctcaaaag aaggagaggg gagagagcga caagattgat
    4451  gaagagcttt tccctaactg cagtagaaga ggtgagaaaa ttcccaatgt
    4501  tttgtggatg gtacttaagt ccatatggta tcattaaatc tccaaaattg
    4551  ctgtgggcca ggatcaagat gatgagtgag agacagcttt tgaaggaatg
    4601  tgttgataat tacctatttg aggcgatatt tgcctacaga ttaggtgaga
    4651  ggctttacac aattttgaaa gaagaggatt ttgaatacca ttatcttgtc
    4701  ataagatttt ttgttagaaa ttcaaaattg ttaacagggt tgagcaaaag
    4751  cttgatattt gaaattgggg agggcatcgg gtccaaatgg ctatcgtcaa
    4801  cgtcaaccgc ttcctcaagg aggtcgaatc tacagacctc aaaattgatg
    4851  ctatctcgtc ctcagagctt tacaaggatg caaccttttt caaaccagac
    4901  gtgcttaatt gcatcaaaag gtttgaatca aacgtcaagg tttcctctcg
    4951  atctggtgac ggcctcgtcc tgtctgattt caaactgctt gatgacaccg
    5001  aaattgattc aatcaggaag aaaagcaaca agtacaaata cttacactat
    5051  ggagtcatcc tggttgggat caaagcaatg ttgccaaact ttagaggcat
    5101  ggaagggaga gtcattgtat atgatggagc ctgcctggat ccgaaaagag
    5151  gccacatttg ctcgtatctt ttcaagtttg agtctgactg ttgctacttt
    5201  ggtctcaggc cagagcactg tttgtctacc acagacgcaa atttggccaa
    5251  aaggtttaga tttcgtgtgg actttgattg tccacaatat gaacaggaca
    5301  ctgagttgtt tgctcttgac attggagttg catacagatg cgtcaactct
    5351  gcaaggtttt tggaaaccaa aactggcgat tcaggatggg cttcacaggc
    5401  aatcagcggc tgtgaagcac ttaaattcaa tgaggaaatc aagatggcca
    5451  tcctggatcg cagatccccg ctgtttctgg aagaaggtgc accaaacgtg
    5501  cacattgaaa agagattgtt cagaggtgac aaggttagaa ggtcacgctc
    5551  aatttccgct aaaagggggc caaactcaag ggtgcaagaa aagagaggat
    5601  ttaggtccct ctcggctaga attgaaagat ttggaaaaaa tgagtttgga
    5651  agacgtgctt cagcaagcga ggcgccaccg ggtaggagta tatctatgga
    5701  agactcacat agacccggca aaggaacttc tgacggttcc tccccctgaa
    5751  ggatttaagg aaggtgaaag ctttgagggc aaagagcttt accttcttct
    5801  ttgcaaccat tactgtaaat acttgttcgg taatattgct gtctttgggt
    5851  catctgataa gacccagttt cccgctgttg gatttgatac acctccggtt
    5901  cattataatt tgacaacgac cccaaaggaa ggggagactg acgaaggaag
    5951  gaaggccaga gcgggttcgt ctggcgaaaa aacaaaaatt tggaggatcg
    6001  atttgtcaaa tgttgttcct gaattgaaaa cctttgctgc cacttccagg
    6051  cagaactctt tgaacgaatg tacgttcaga aagctttgcg agccatttgc
    6101  cgatttggct cgagaatttc tacatgaaag gtggtctaag ggattggcca
    6151  ccaatattta caagaaatgg cccaaagctt tcgaaaaaag tccatgggtg
    6201  gcctttgatt ttgccactgg tctgaaaatg aatcgtctaa cacctgatga
    6251  gaaacaggtg attgatagaa tgaccaaaag actttttcgt actgaaggac
    6301  aaaaaggggt tttcgaggca ggttcggaaa gtaacctgga actggagggt
    6351  taggagtcgt gtgaaattcc gcaaacttgg tcgcggtctt gcaggttgac
    6401  atgcctgcct ttatacttaa ttaaagggtt cccccggttt tctgagcatt
    6451  tccgggttag tgtggttttt ctagagtcta gagtttgtcc actct