Sequence of DPV Sweet potato mild mottle virus
Sweet potato mild mottle virus isolate ARU60 polyprotein gene, partial cds.
ACC No: FJ999764
Dated: 2010-03-19 | Length: 7781 | CRC: -1727173875
ID FJ999764; SV 1; linear; genomic RNA; STD; VRL; 7781 BP.
XX
AC FJ999764;
XX
DT 25-JAN-2010 (Rel. 103, Created)
DT 19-MAR-2010 (Rel. 104, Last updated, Version 2)
XX
DE Sweet potato mild mottle virus isolate ARU60 polyprotein gene, partial cds.
XX
KW .
XX
OS Sweet potato mild mottle virus
OC Viruses; ssRNA positive-strand viruses, no DNA stage; Potyviridae;
OC Ipomovirus.
XX
RN [1]
RP 1-7781
RX PUBMED; 19923261.
RA Tugume A.K., Mukasa S.B., Kalkkinen N., Valkonen J.P.;
RT "Recombination and selection pressure in the ipomovirus sweet potato mild
RT mottle virus (Potyviridae) in wild species and cultivated sweetpotato in
RT the centre of evolution in East Africa";
RL J. Gen. Virol. 91(PT 4):1092-1108(2010).
XX
RN [2]
RP 1-7781
RA Tugume A.K., Mukasa S.B., Valkonen J.P.T.;
RT ;
RL Submitted (30-APR-2009) to the EMBL/GenBank/DDBJ databases.
RL Department of Applied Biology, University of Helsinki, PO Box 27
RL (Latokartanonkaari 7), FIN-00014, Helsinki, Finland
XX
FH Key Location/Qualifiers
FH
FT source 1. .7781
FT /organism="Sweet potato mild mottle virus"
FT /host="Ipomoea batatas"
FT /isolate="ARU60"
FT /mol_type="genomic RNA"
FT /country="Uganda:Arua"
FT /collection_date="01-Apr-2007"
FT /db_xref="taxon:41459"
FT 5'UTR 1. .139
FT CDS 140. .>5867
FT /codon_start=1
FT /product="polyprotein"
FT /note="coding region disrupted by sequencing gap"
FT /db_xref="UniProtKB/TrEMBL:D2T0C2"
FT /protein_id="ACS29485.1"
FT /translation="MGKSKLTYKQCIAKWGKAALEAHNNGSARNVSVGTHQIAANIFAF
FT YDAKDYHLFAMGKRGGLTPAAEQLRIAIERGTIYKVQYDCHFCPDCDAIVDSEEGWFCE
FT DCGEQFNKRDDNVLGNKNDVARALGGWTEYEDATWASFEVAKADMFEVAPTVGQLEKEI
FT RAIEKSAGKRLTAYEEEMIEELAYKLDVAKMQEEEQEKVLEETDFSISNDEFPTLGGPQ
FT DGVVNETTEEHDKESVVEVAREVEQAGEFKTTHERTNEPISDVAETHVVATPVATLGVT
FT DFGAAISKEKLVEKPKTVMWVAKSKAPAAVPATPSKTAVWISKPKLADVLSTVEPVAKP
FT PLRTYSDVISIGSMVCPIMIAAIESHSHQNEKLNGFPNAQVIDGPKEEEPVIKYNITFG
FT SFNYEVSAKGEQVQAAVKLDEMLEGPDTEPILICQTGSSHKHETKKAAKGLLVQDRFSV
FT IGNKVLCKSFPAFKNFMNETRLGGIYRTRKGSYKNAALRLLKATKVQVFYDGIKDIFEC
FT PYCHVSSNELEGLNGDNCEKCKDLFYKHIDDPRKVEEEYLMVPLVPIDQHVHEEHSVIS
FT KAKWEAHESICEGEANIVKIFNGKSTASKKSFKTKQALNVANIPLDDFMQELVEICLER
FT NTPIEIIGGVKSFNVVKLRHTTRDISKSGEDDMYPTEREWFCHTHKLCLCGRIDRGKKI
FT RSFEVRPGWSGVILHRNQVAECDWNKFVFIDDICVVQGRNLITDKIENALEKKGANRLK
FT QMQFYASFIIPNFKDEFDRASRLKADHEPDESSNNELIGRLAKLVAAVIPKGHLYCKTC
FT CFRVIKSKRIDIVNALNKAKQRGERDEFIYDELIKLFELQAPPPYKIASITDEGDVLRT
FT LGLSGDLYNGRLSLIMQHLQGLHTSISMLHQSLAGAQNDQQIDRQALHNQVRILHQRNE
FT EHMPFLKKAVDEIQLLNATDQVANARELYLDTRATSTGDFDILRKYQSIHEFFPNIMSR
FT ANKAGMAVIKSETSLSKAFALMNSAKSMDAIHTLIGEDVMEHISGACLLKNDKTLFSIG
FT CKQGVNGSKMYGPLCPTKQHVRIHRVESNMQIPMPTFHDATVWEFNEGYCYANQFAIMV
FT GFINEDEMEFYKNQMNQIVLNLGAWPTFEQYLIELRAISLDYPKVRGCPAAIHLVSHVN
FT KLIHVMGQFGTINQGWHALEVATVGELVDLCHKKVEGEMLTYKVGGIYDWVTKKNAFID
FT LFEQHPENIFKICTSPSVLWLFARSCEKHDFINYIMARDHSLVGLFIKLEYVGKHLHIF
FT QSVDDVCVEYAASMREIIEEHADIHGLRDSVVDRMVHAYHNEVREANKYELVDRILEKN
FT IGSIAKEISSRKLITMYHQDIFSWHEWQRLKLSAHSLNAQKLFDEANERAYGKQSWNLR
FT VIWGACKEVLYAVTRGVCVRVKGTTVRCADAVVYGFYGRTRAMVTSWASEAWGAIFTSC
FT LRALVVMVVTAYISTWIPKIRKMIKREKKQFEELGGGELYVEQHGKREEAFLFKICPIF
FT ALIAGIVDYEWGAAACATMNKVKSICTVMGSVGIESHANKPDDKVEQDLKESLKFTSFE
FT IEVPNEVLSQDDMTFERGFQHQIQYGNVCADPIYSGPLRMLAITENYARELAMNIRTSG
FT ETDLRVYSGVGGGKSTRLPKELSMFGHVLICVPTRVLAESLLTSFMVLFNMDVNVAYRG
FT RIHTGNAPITIMTYGYAFNFLVHKPLELNRYDYVLLDEIHTNPVEFAPLFSFIKTTDPK
FT KKIVKLSATHAGMDCECETRHKIKVETLSEMPIESWVSMQGSGVVGDATSVGDVILVFV
FT ASFKDVDTCANGLRSKGFKVLKVDSRNFRRDADVDKQIQSLGEGKKFIVATNIIENGVT
FT FNIDVVVDFGEKISPNLSSDERCITLGRQRISRAE"
FT mat_peptide 140. .2413
FT /product="P1 proteinase"
FT mat_peptide 2414. .3775
FT /product="helper component proteinase; HC-Pro"
FT mat_peptide 3776. .4657
FT /product="P3 protein"
FT mat_peptide 4658. .4813
FT /product="6K1 protein"
FT mat_peptide 4814. .>5867
FT /product="cytoplasmic inclusion protein; CI"
FT gap 5868. .5967
FT /estimated_length=unknown
FT CDS <5968. .7473
FT /codon_start=1
FT /product="polyprotein"
FT /note="coding region disrupted by sequencing gap"
FT /db_xref="UniProtKB/TrEMBL:D2T0C3"
FT /protein_id="ACS29486.1"
FT /translation="GNNSGQPSTVVDNTLILMIAMEYAISKVFVTRPEIKYVCNGDDLL
FT INCPRSTANAISEYFKDAFADLSLNYDFDHICDEITSVDFMSHSFMWLDDEQMYIPKLD
FT KERIVAILEWERSDEQFRTRSALNAAYIESFGYEDLMNEIENFASFWAKEHGLENVLME
FT REKVRDLYINEDFDASAFEKFYPETFSPFDVYVEPHASTSKTIEELQQEMEDLDADTTI
FT TVVQRETQKARIRDQIETLRAQQIVRPSEAQLQPDVTPTQIVTFEPPRVTGFGALWIPR
FT QQRSYMTPAYVEKIKAYVPHSNLIESGLASEAQLTSWIESTCRDYQVSMDVFMTTVLPA
FT WIVNCIINGTSQERTNEHTWRAVVMANMEDQEVLYYPIKPIIVNAQPTLRQVMRHFGEQ
FT AVAQYMNSLQVGKPFTVKGAVTAGYANVQDAWLGIDFLRDTMPLTTKQMEVKHQIIAAN
FT VTRRKIRVFALAAPGDGDELDTERHVVDDVARGRHSLRGAQLD"
FT mat_peptide <5968. .6564
FT /product="nuclear inclusion body protein; NIb"
FT mat_peptide 6565. .7470
FT /product="coat protein; CP"
FT 3'UTR 7474. .7781
XX
SQ Sequence 7781 BP; 2365 A; 1364 C; 1905 G; 2047 T; 100 other;
fj999764 Length: 7781 19-MAR-2010 Type: N Check: 7074 ..
1 taaataaaaa tgatatagag aaaatttgaa atacaagcac gaattcagcg
51 aacttatttg caattcagct ttactttaaa cactttattt taattttatc
101 gaactgtggt gtccgcttag aaacggatca gacgcagaaa tggggaaatc
151 caaactcact tacaaacaat gcattgctaa gtggggaaaa gctgcactgg
201 aagcgcacaa caatggatca gcaaggaatg tgtctgttgg cactcatcaa
251 attgcggcaa atatttttgc gttctatgac gcaaaagact accacctatt
301 tgccatgggc aaaagaggtg gactgactcc agcagccgag caattgagaa
351 tcgcaattga acgaggaacg atttacaagg tgcagtatga ctgccacttc
401 tgtcccgatt gtgatgcaat tgttgactct gaggaaggat ggttctgcga
451 ggattgtggt gagcaattca ataagcgcga cgataacgta cttggtaaca
501 agaatgatgt tgcacgcgct cttggcgggt ggactgaata cgaggacgct
551 acctgggcgt cgtttgaggt tgctaaggct gatatgttcg aagtggctcc
601 aactgtgggt caactggaga aagaaatcag agccatagaa aaatcagctg
651 gtaagagact caccgcttat gaagaagaaa tgattgagga gctagcctat
701 aaacttgatg tggccaaaat gcaagaggaa gaacaggaga aagtcctcga
751 ggagactgat ttctcaatct ccaatgatga atttccaact ctcggaggac
801 cgcaagatgg agtagttaat gaaactactg aggaacacga taaggaaagc
851 gtggttgaag tcgcgagaga ggttgagcag gctggagagt ttaaaactac
901 acacgagagg actaatgaac caattagcga cgttgctgaa acgcacgtgg
951 ttgcgacacc tgttgcgact ttaggagtga ccgattttgg agctgcaatt
1001 agcaaggaga agctagttga gaaaccaaaa acagttatgt gggttgcgaa
1051 gtcgaaagct cctgctgcag ttcctgcaac gccatctaag acggctgtct
1101 ggatctctaa gcctaagcta gcggacgtgc tttcaaccgt tgaaccagtc
1151 gctaagccac cattgagaac ttacagtgat gtgatcagta taggctctat
1201 ggtgtgtcct atcatgatag cagctattga atcacatagc catcaaaacg
1251 aaaagcttaa tgggtttcca aacgcacaag ttatcgacgg accgaaagaa
1301 gaagagccag tgattaagta caatataact tttggctcat ttaattatga
1351 agtgagtgcg aaaggggaac aagtgcaggc tgctgtcaaa cttgacgaga
1401 tgctcgaagg acctgacacc gaacctattt tgatttgcca aacaggaagc
1451 tcccataaac atgaaactaa gaaggcggcg aaagggcttc ttgttcaaga
1501 tagattttca gtgattggaa ataaggtttt gtgcaagtca ttccctgctt
1551 tcaagaattt catgaatgaa acaaggttgg gaggaattta cagaacaaga
1601 aaaggaagtt acaaaaacgc tgcattgcgt ttattgaagg cgactaaagt
1651 tcaagtgttt tatgatggaa taaaggatat tttcgaatgt ccgtattgcc
1701 acgtgagcag caacgaacta gagggcttaa atggagacaa ttgtgaaaag
1751 tgtaaggacc tattttacaa acacattgac gacccgagaa aagttgagga
1801 ggaatatctt atggttcctc tcgtgcccat tgatcagcat gttcatgagg
1851 aacatagcgt catcagtaag gctaaatggg aagcgcatga atccatctgt
1901 gagggagaag cgaatattgt caagattttc aatggcaaat caactgcaag
1951 caaaaagagc tttaaaacta agcaagccct gaatgtcgca aatataccac
2001 ttgacgactt tatgcaggaa ttagttgaaa tctgtcttga gaggaacacc
2051 cctattgaaa tcattggggg tgtgaaatct tttaatgttg ttaagttgag
2101 gcacaccacg agagacatta gcaagtctgg tgaggatgat atgtatccaa
2151 ctgaaagaga gtggttctgc cacacgcata agctatgttt atgtggaaga
2201 attgatcgtg gcaagaaaat caggagtttc gaggtgcgtc caggttggag
2251 tggagtgatc ttgcatagga atcaagtggc ggagtgtgat tggaataagt
2301 ttgttttcat tgatgacata tgcgtggttc aaggtagaaa cttgatcacg
2351 gataaaattg agaatgcact tgaaaagaaa ggggccaata ggcttaagca
2401 gatgcaattc tatgcaagct ttattatacc taatttcaag gacgaatttg
2451 acagggctag ccggctaaaa gctgatcatg aaccggatga gtcgtctaac
2501 aatgaactca ttggcaggct agctaaactt gttgcagcag ttattccaaa
2551 aggccattta tactgcaaaa catgctgttt tagagtcatt aagagcaagc
2601 gcatagacat agtcaacgca ttgaataagg ctaagcagcg tggtgaacgg
2651 gatgagttca tttatgatga gcttatcaaa ctgttcgagc tacaagcacc
2701 accaccgtat aagattgcat cgataaccga cgaaggtgac gtgttgcgca
2751 cgttaggatt gagtggagac ctgtataatg gacgcctgag tcttataatg
2801 caacatttgc aggggttgca tacgtcgatt tcaatgcttc accaatcact
2851 tgcaggagcc caaaatgacc aacaaattga caggcaagcg cttcataacc
2901 aagtgcggat tctacatcag aggaatgagg agcacatgcc tttcctaaag
2951 aaagcagttg atgaaatcca gcttttaaat gcaacagacc aagttgccaa
3001 cgctcgggaa ttgtacctgg atactagggc aactagtact ggagattttg
3051 acatcttacg aaaataccag agcatacatg agttctttcc caacattatg
3101 agccgggcca ataaagcagg gatggcagtc atcaaatctg aaacttcatt
3151 gagcaaagct tttgccctta tgaacagtgc taagtcgatg gatgctattc
3201 atacgctcat aggggaggac gtaatggaac acattagtgg tgcttgcttg
3251 ctgaagaatg acaaaacact tttctcaatt ggttgtaagc aaggtgttaa
3301 tggtagcaag atgtacggtc ctctttgtcc gaccaagcag catgtgagaa
3351 ttcaccgagt tgaatctaac atgcaaattc ctatgccaac attccatgat
3401 gcgactgtct gggagtttaa tgaaggttat tgctacgcca atcaatttgc
3451 cataatggtt ggttttatta atgaggatga gatggagttt tataagaatc
3501 aaatgaatca aattgtgttg aacttagggg cttggccaac ttttgagcaa
3551 tacttaattg agctacgagc aatctcgctt gattatccaa aagttagagg
3601 gtgtccagct gctatccatt tagtttccca tgtgaacaag cttatccatg
3651 ttatgggaca atttggaaca ataaatcaag gttggcatgc acttgaagta
3701 gcaactgtag gtgagcttgt agacttgtgc cataagaaag ttgaaggtga
3751 aatgttaact tataaagttg gtggcatata tgactgggtg actaagaaaa
3801 atgctttcat tgatctattt gaacaacatc cagaaaacat attcaagatc
3851 tgtacatcac cgtcagtgtt gtggttattt gctcgcagct gcgaaaagca
3901 tgattttatt aactacatca tggcaaggga ccattcacta gtgggtctgt
3951 tcataaaatt ggagtacgtc ggcaagcatc ttcacatttt ccaaagtgtt
4001 gacgatgttt gtgttgaata cgcagcgtct atgagggaaa tcattgaaga
4051 gcacgctgat attcatgggc tgcgggattc agtggtggat agaatggtcc
4101 atgcatatca caatgaggtg agagaagcaa ataaatatga gctagttgat
4151 aggatccttg aaaaaaacat tggatcaatt gcgaaagaaa tttcatcgcg
4201 aaagctcata acaatgtatc atcaagacat attctcttgg catgaatggc
4251 agcgtttgaa attaagcgca cactctttga atgcgcagaa acttttcgac
4301 gaggcaaacg aacgcgcata cgggaaacaa tcatggaatt tacgcgtgat
4351 ttggggtgcg tgcaaggaag tgttatatgc agtcacacgc ggagtttgtg
4401 taagagtcaa aggaacaact gtgcgctgtg ccgacgccgt agtatatggc
4451 ttctatggta gaacacgagc aatggtaaca agctgggcta gcgaggcctg
4501 gggagctatt ttcacgtcgt gtttgagagc attagttgtt atggtggtca
4551 cggcatatat ctcaacttgg attcctaaga ttaggaagat gattaaacga
4601 gaaaagaaac agtttgaaga acttgggggt ggagagttat atgtagagca
4651 acatggcaaa agagaggagg cgtttttatt caagatttgc cccattttcg
4701 cacttatagc aggcatagtt gactacgaat ggggagcagc tgcatgcgca
4751 acaatgaata aagttaagag tatatgcact gttatgggat ctgttggaat
4801 tgaaagccac gccaataagc ctgatgacaa agttgagcaa gatttgaagg
4851 aatctctcaa attcacatct ttcgagattg aagtgccaaa cgaagtttta
4901 tcacaggatg atatgacttt cgaaaggggg ttccaacatc agatccaata
4951 tggcaatgtt tgtgcggatc ccatttatag tggtccacta cgcatgttag
5001 caattacgga aaactatgca agagagttgg caatgaacat tcgcacgtca
5051 ggagaaaccg atttgcgtgt ttatagtggt gttggtgggg gaaaatcaac
5101 aaggttaccc aaggaactaa gcatgtttgg acacgtgcta atctgtgttc
5151 cgacgagagt tttggcggaa agcttattga catcatttat ggtgttgttc
5201 aatatggatg ttaatgtggc ttaccgagga aggatacata ctggtaacgc
5251 gccaatcaca ataatgactt atggttatgc attcaatttc ttggtgcaca
5301 aaccactgga attaaaccga tatgattatg tgttgctaga tgaaattcac
5351 accaatcctg ttgagttcgc gcctttgttt tcttttatca aaaccacaga
5401 tccaaagaag aaaattgtta agttgtcagc aactcacgct ggtatggatt
5451 gcgagtgcga aactaggcac aagattaagg ttgaaacttt gagcgaaatg
5501 ccaatcgaaa gctgggtctc catgcaaggc tctggggttg tcggtgatgc
5551 aacatctgtt ggggatgtaa ttctagtctt tgttgcatct ttcaaggatg
5601 tcgacacatg cgctaatgga ttgcgatcta aaggcttcaa agttttgaaa
5651 gttgatagta gaaactttag acgtgatgca gatgtggata aacaaatcca
5701 atcacttggt gaagggaaga aattcatagt cgccacaaat ataattgaga
5751 atggagtgac attcaacatt gatgttgttg tcgattttgg cgagaaaatt
5801 agccctaatc tatcgagcga tgagcggtgt atcacactgg ggaggcagag
5851 aatatcgcga gcagaaannn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
5901 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
5951 nnnnnnnnnn nnnnnnnggt aataatagtg gtcagccttc aacggttgtt
6001 gacaacacat taatcttgat gattgccatg gagtatgcaa tttctaaagt
6051 ttttgtcaca cgccctgaaa tcaagtatgt ttgtaatggg gatgatcttt
6101 taataaattg cccaaggagc accgcgaatg ccattagtga gtatttcaag
6151 gatgcatttg cagacttaag tttgaattat gattttgatc acatttgtga
6201 tgaaattaca agtgttgatt ttatgagtca cagtttcatg tggttagatg
6251 atgagcagat gtatatcccg aagttggaca aggagcgaat tgtggcaatt
6301 ttggagtggg aaagaagcga cgagcaattc aggacaagga gcgccttaaa
6351 cgcagcttac attgagagtt ttggatatga agatctgatg aatgagattg
6401 agaatttcgc ttctttttgg gcgaaagagc atggtctcga aaatgtgctg
6451 atggaacgag aaaaagttag agatttgtac atcaatgagg atttcgatgc
6501 gtcagccttt gagaagttct atccggagac attttcgcca tttgatgttt
6551 atgtcgaacc acacgcatcg acatctaaaa caatcgaaga actgcagcaa
6601 gaaatggagg acttggatgc ggacacaaca atcactgtgg ttcaaaggga
6651 gacacagaag gcacggataa gagaccaaat tgagacactt agggcacaac
6701 aaattgtgag accttctgaa gcgcagcttc aacctgatgt gactcccacg
6751 caaattgtca cgtttgagcc accgagagtc actggttttg gtgctttatg
6801 gattccgcgc caacaaagga gttacatgac gccagcttac gtcgagaaga
6851 taaaggctta tgttccacac tcaaacttga ttgaatctgg actagcaagt
6901 gaagctcaat tgactagttg gatcgaaagc acgtgcaggg actatcaagt
6951 tagtatggac gtcttcatga ctacagtact gccagcatgg atagttaatt
7001 gcataattaa tggaacgtct caggagcgta cgaatgagca cacatggaga
7051 gctgtggtta tggcaaatat ggaagatcag gaggtgcttt attaccctat
7101 caagcccata attgtaaatg ctcagccaac tttgaggcaa gtgatgcgcc
7151 attttggcga gcaagccgtt gctcaataca tgaacagtct tcaagttgga
7201 aaacccttca cagtgaaggg tgccgtaact gctggttatg ctaatgttca
7251 agatgcttgg ttgggtattg actttctccg agacacgatg ccgctgacaa
7301 caaaacagat ggaagtcaag caccaaatta tcgcagcgaa cgttacaagg
7351 cggaaaattc gtgtttttgc tcttgcagca ccgggagatg gcgatgaatt
7401 agacacagaa aggcatgttg tcgacgatgt tgctagaggt cgtcatagtt
7451 tgagaggagc tcaactcgat taaatgagca tgttatcttt aatttcaact
7501 gtttgctttc attttaatta cgtttcatgc tttgtgtttg cctgttgtgg
7551 cacttgaacc aggtacagct ggcaggtgtt tcggcatggt gtggttagac
7601 aattggtttt caccggtagt ctaagaagcg ctatgtatca cgtggttggt
7651 taattcatag tctatatggg ttaatcaaga agcgtctatc gcccaaaagg
7701 gtaccagaaa atagttgcgt cattcgtggc gcaattagtt tttggagttt
7751 ggctaggtaa cttgtcgcct tcccaaaagc c