Sequence of DPV Rice tungro bacilliform virus
Rice tungro bacilliform virus for partial polyprotein, genomic DNA
ACC No: AM087225
Dated: 2005-12-16 | Length: 2744 | CRC: 935617604
ID AM087225 standard; genomic DNA; VRL; 2744 BP.
XX
AC AM087225;
XX
SV AM087225.1
XX
DT 16-DEC-2005 (Rel. 86, Created)
DT 16-DEC-2005 (Rel. 86, Last updated, Version 1)
XX
DE Rice tungro bacilliform virus for partial polyprotein, genomic DNA
XX
KW coat protein; IR region; movement protein; P194; polyprotein; protease.
XX
OS Rice tungro bacilliform virus
OC Viruses; Retro-transcribing viruses; Caulimoviridae; Tungrovirus.
XX
RN [1]
RP 1-2744
RA Tandon V.;
RT ;
RL Submitted (21-SEP-2005) to the EMBL/GenBank/DDBJ databases.
RL Tandon V., Department of Plant Molecular Biology, Delhi University, South
RL Campus, University of Delhi, Benito Juarez Road, New Delhi, 110021, INDIA.
XX
RN [2]
RA Tandon V.;
RT "Analysis of the complete nucleotide sequence of a representative Indian
RT isolate of rice tungro bacilliform virus";
RL Unpublished.
XX
FH Key Location/Qualifiers
FH
FT source 1. .2744
FT /country="India:Orissa"
FT /db_xref="taxon:10654"
FT /mol_type="genomic DNA"
FT /virion
FT /organism="Rice tungro bacilliform virus"
FT /specific_host="Oryza sativa"
FT CDS <1. .>2744
FT /codon_start=1
FT /note="P194 (194KDa) polyprotein"
FT /note="ORFIII"
FT /product="polyprotein"
FT /protein_id="CAJ32141.1"
FT /translation="SIAIKTIGRLTTNIQARYKMNVKDIVEQISSQGITMVAPMEIDSS
FT HLDGNEWNLSKFMIQEGTSRVPSKALIYQNLHGGESLRFSNYTQTKMHDPTEINSDEDE
FT DLKILGEQLNAKMATFQEETLEQKLERIKEEKKQLLAKLEAKKKEIESKSLKMAVIEDD
FT FNPNNEYLDDSYSELEDLEFEQLGLTGWEDLDQESLETEEITEWENPNQVLTREIKAFK
FT SVSEQIEDIFGELLKEHGNYDMALKNLEERYDLKNLEGAKNLEEIAKASTSKMMDVKPV
FT KRPKEEQTAYEDDMRDDWRRKELTANPEVSSKDRNFERIGGSYKKNFYPSKSEILNLDH
FT VPPQFYYDQIITWEGIVKNEWEARKKDGMDMWSWMDGRITGMVLYLIQDWISKNQAAYN
FT DIKSRGDRPENFVKMVKDRFLIEDPTDERRTALQRLALRELEALNCEDPVKIQPFMAEY
FT LKKAAEAKKGFDVIYVERLFDRLPEAVGKLIKKEFLDAGNSYEAGIGVAVSYISTWMRL
FT KCIKETEAKTQKKASLAFCRSIYTIGDYKKRKTLKRVTNYNRNKRKNYVRKPNIKRKCR
FT CYICQDENHLANRCPRRYVNQARASMIEGLEEDIVSIASDDEDYENFFEIIELEEFLSK
FT SGQQDHEHTWETGGKKEKTCDICDYYTDFNKTIVCKTCEIQYCTTCANQLGIEVEKTYK
FT KSREEELYEELRKTVVNLDLRLTIVEHKLEMNKLQEQFDSLQLSKEPSSSETIKALAMQ
FT AKESNFIKTNINRTAGCYVEVKLTFLNNSKVTTALIDSGSTHNIICPLLVPEIWIKSLN
FT LDIVMTTIDNSKYSLNRGLHDEVKIQFKEVDESFGIKYNLGQTYVAPKPTGTFIIGHRF
FT MTSEHGSITIHKDYVTIQKTTGIYPTARHELKSEFARKHGGQRP"
FT mat_peptide <1. .306
FT /product="movement protein"
FT misc_feature 307. .921
FT /note="IR region"
FT mat_peptide 922. .1449
FT /note="37 kDa protein"
FT /product="coat protein"
FT misc_feature 1500. .2163
FT /note="IR region"
FT mat_peptide 2164. .>2744
FT /product="protease"
XX
SQ Sequence 2744 BP; 1199 A; 393 C; 490 G; 662 T; 0 other;
am087225 Length: 2744 16-DEC-2005 Type: N Check: 9041 ..
1 tccatagcca taaaaactat aggaagatta acaacaaata tccaggccag
51 atataaaatg aatgttaaag acatcgtaga acaaatatca tcccaaggaa
101 taactatggt agcacccatg gaaatagatt catcacattt agatgggaat
151 gaatggaatt taagtaaatt catgattcaa gaaggaacaa gtagagttcc
201 tagtaaagcc ctaatatatc aaaatctgca tggaggagaa tcacttagat
251 tttcaaacta tacacaaacc aaaatgcatg atccaacaga aataaattct
301 gatgaagacg aagatttaaa aattttagga gaacaattaa atgccaaaat
351 ggcaaccttt caagaagaaa ccctagaaca aaaattagaa cgcataaaag
401 aagaaaagaa acaacttcta gcaaaactag aagctaaaaa gaaggaaatt
451 gaatcaaaat ccttaaaaat ggcggtcata gaagatgact ttaacccaaa
501 caacgaatat ttagacgatt catattctga attagaagat ctagaattcg
551 aacaattagg attaactggt tgggaagatc tagatcaaga aagtctagaa
601 acagaagaaa ttaccgaatg ggaaaaccca aaccaagtcc taactagaga
651 aataaaagcc tttaaatcag tatctgaaca aatagaagat atatttggag
701 aattattaaa agaacatgga aattatgaca tggcccttaa aaatttagaa
751 gaaagatatg atctaaaaaa cttagaagga gccaaaaacc tagaagaaat
801 agctaaagca tccacatcaa aaatgatgga tgtaaaacca gtaaaaagac
851 caaaagaaga acaaacggca tatgaagatg atatgagaga tgattggaga
901 agaaaagaat taacagccaa tccagaagta tcctcaaaag atagaaactt
951 tgaaagaata ggaggatcat ataagaaaaa tttttaccct agcaaaagtg
1001 aaatcctaaa cctagaccat gtaccccccc agttctacta tgatcaaatc
1051 ataacttggg aaggaatagt aaaaaatgaa tgggaagcaa gaaaaaagga
1101 tggtatggac atgtggtctt ggatggatgg aagaataaca ggaatggttt
1151 tatatctaat acaagattgg atatctaaaa accaagccgc ctacaatgat
1201 ataaaatctc gaggagatag acccgaaaat ttcgtaaaaa tggtaaaaga
1251 taggttctta atagaagatc ctacagatga aaggagaaca gccttacaaa
1301 gattagctct aagagaatta gaagctttaa actgtgagga tccagttaaa
1351 attcagccat ttatggcaga ataccttaag aaagctgctg aagctaaaaa
1401 aggatttgat gtcatatatg tcgaaagact atttgacaga cttcctgagg
1451 cagtaggaaa attaataaaa aaggaatttc tagatgcagg aaattcatat
1501 gaagcaggca taggagtagc tgtttcatat atatccacat ggatgagact
1551 aaaatgcata aaagaaacag aagctaaaac acaaaagaaa gcatcattag
1601 cattctgtcg atctatatat actataggtg attataagaa aagaaagaca
1651 ctaaaacgtg ttacgaatta caatagaaat aaaagaaaaa attatgttag
1701 aaaacctaac ataaaaagaa agtgtagatg ttatatctgc caggatgaaa
1751 accacctagc aaatagatgt cctagaagat atgtcaatca agctagagct
1801 agcatgattg aaggactaga agaagatata gtgtccatag cttcagatga
1851 tgaagactat gagaacttct ttgaaataat tgaattagaa gagtttttga
1901 gtaaatcagg acaacaagat cacgagcaca cctgggaaac aggaggtaag
1951 aaagaaaaaa cttgtgatat ttgtgattat tatactgatt tcaataaaac
2001 catagtatgc aaaacatgtg aaatacaata ttgtactact tgtgcaaatc
2051 aacttggtat agaagtagaa aaaacatata agaaatctag agaagaagaa
2101 ttatatgagg aattaagaaa aacagttgtc aacctagatc taagattgac
2151 aatagttgaa cataaactag aaatgaataa attacaagaa caatttgatt
2201 ccttgcaatt atctaaagaa cctagttctt cagagaccat aaaagcctta
2251 gccatgcaag caaaagaatc aaatttcata aaaaccaata ttaatagaac
2301 agcaggatgt tatgtagaag taaagttaac atttttaaat aactctaaag
2351 taaccactgc attaatagat tctggttcca cacataatat catatgtcct
2401 ttattagtac cagaaatatg gattaaaagt ttgaatttag atattgttat
2451 gaccacaata gataatagta aatatagcct caacagaggt ctacatgatg
2501 aagtaaaaat acaatttaaa gaagtagatg aaagttttgg gataaaatac
2551 aacttaggac aaacttatgt tgctcctaaa cctacaggaa cttttataat
2601 tggacatagg tttatgacca gtgaacatgg gagtattaca atccataaag
2651 actatgttac aatacaaaaa accacgggaa tttaccccac agctcgtcat
2701 gaactcaaat cagagtttgc gcgaaagcat ggtggacaaa gacc