Sequence of DPV Cacao swollen shoot virus

Cacao swollen shoot virus complete genome, isolate Peki

ACC No: AJ609019

Dated: 2005-04-15 | Length: 7141 | CRC: 182545047

                !!NA_SEQUENCE 1.0
ID   AJ609019   standard; circular genomic DNA; VRL; 7141 BP.
XX
AC   AJ609019;
XX
SV   AJ609019.1
XX
DT   25-NOV-2004 (Rel. 81, Created)
DT   15-APR-2005 (Rel. 83, Last updated, Version 3)
XX
DE   Cacao swollen shoot virus complete genome, isolate Peki
XX
KW   aspartyl protease; capsid protein; complete genome;
KW   nucleic acid-binding protein; ORF1; ORF2; ORF3; ORF4; ORFX; ORFY;
KW   polyprotein; reverse transcriptase; ribonuclease H.
XX
OS   Cacao swollen shoot virus
OC   Viruses; Retroid viruses; Caulimoviridae; Badnavirus.
XX
RN   [1]
RP   1-7141
RA   Muller E.;
RT   ;
RL   Submitted (25-NOV-2003) to the EMBL/GenBank/DDBJ databases.
RL   Muller E., CIRAD, UMR BGPI TA 41/K, Campus International de Baillarguet,
RL   34398 Montpellier cedex 5, FRANCE.
XX
RN   [2]
RA   Muller E., Sackey S.;
RT   "Four new full sequences of Cacao swollen shoot virus: a PCR full length
RT   cloning strategy and a variability analysis";
RL   Unpublished.
XX
RN   [3]
RA   Muller E., Sackey S.;
RT   "Molecular variability analysis of five new complete cacao swollen
RT   shootvirus genomic sequences";
RL   Arch. Virol. 50(1):53-66(2005).
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .7141
FT                   /country="Ghana:Peki"
FT                   /db_xref="taxon:31559"
FT                   /mol_type="genomic DNA"
FT                   /virion
FT                   /organism="Cacao swollen shoot virus"
FT                   /specific_host="Theobroma cacao"
FT                   /isolate="Peki"
FT   CDS             296. .727
FT                   /db_xref="InterPro:IPR005479"
FT                   /db_xref="InterPro:IPR010746"
FT                   /db_xref="UniProt/TrEMBL:Q5TJI7"
FT                   /note="ORF1"
FT                   /product="hypothetical protein"
FT                   /protein_id="CAE81277.1"
FT                   /translation="MSSRWEDSIQEWYEKSHTANLEYLDLASTSKVTNNQLAHNLAVTF
FT                   DRVNLGNRVFIKNLKQIQESILELNTRIDTVEVALRRLTKQFRENKPLSESEVKKLVEE
FT                   IAQQPKIVEKQALEISQQLELKLEKVEKLLHKLDQWVGQ"
FT   CDS             724. .1161
FT                   /db_xref="UniProt/TrEMBL:Q5TJI6"
FT                   /note="ORF2"
FT                   /product="nucleic acid-binding protein"
FT                   /protein_id="CAE81278.1"
FT                   /translation="MSESPSYQEALKEAEKIDPPAIGLTTSSGVTAVQGFRTVIKQNNV
FT                   QICLLATIADKLEELVQDQKKARKDKAKEIAIPEDLITKLQGLSIREKGEAKVTRKPEP
FT                   KGTLFGFKDPYKILAAEKAKITPKPVKEKKDESSKTATSSS"
FT   CDS             1127. .6577
FT                   /db_xref="GOA:Q5TJI5"
FT                   /db_xref="InterPro:IPR000477"
FT                   /db_xref="InterPro:IPR001878"
FT                   /db_xref="InterPro:IPR001969"
FT                   /db_xref="UniProt/TrEMBL:Q5TJI5"
FT                   /note="ORF3"
FT                   /note="putative capsid protein, aspartyl protease,
FT                   ribonuclease H and reverse transcriptase"
FT                   /product="polyprotein"
FT                   /protein_id="CAE81279.1"
FT                   /translation="MSRARPQPPVPSVTSTTSEQNREGPLYEDQIRDYRRSQRRIFNLR
FT                   RRARRLRRSMMGSRYQETLEQEIDPQTTLRLSMQERARLVPAEVLYRSRRDTVHHRVYT
FT                   HRSEESVLCVGGNQVDRAFIQPESLEQLQRTGMSFIHIGILQVRIQILHRQEEGTMALV
FT                   VFRDNRWSGDQSIFAQMEIDLTKGSQLVFVIPDTMMTIGDFARNVQLSILTRGYENWQN
FT                   GEANLLITRGMTGRLSNTPNVAFAYQIASATDYLASHGVKAIAGKKMNLQHLRNQQWIL
FT                   RPPQADITPMQPRSVETRNLVDGSISIRFHDYEAATSTSRPHYNEEDEEVESETESEIR
FT                   EHTVAVWIGEEEVPDQTGRKKVWEESSNGNGRFFRYYTPPPTSDEQIIATGWGSDDDYD
FT                   EIPPKWDESPDEEGSSETTWDQEEKEEEDEYDPNIYMAYLQKEENEWQEIAASLQEEME
FT                   MEYPRRRPRTETVFSETVDYTPPGDTLMTPVGYPPASSSRSTVTTPSRPPLFEGRVTHV
FT                   PRFLKRDDYTEWWQLPSSQGTTGALFVMPKQMGLFHEVFSRWESITKNYVAAQGFTDPT
FT                   EKMEFMENLLGETEKLTWIQWRMNYEAEYQQLLTQADGRQGTQNILSQIKRVFSLEDPA
FT                   SGSTRIQDAAYRDLERLTCHNIKDIVQFLNDYGRLAAKSGRLFLGTELSEKLWMKMPPE
FT                   LGNRMKEAFQKEYSGNEVGVFPRILFAYRYLEQECKDAAFKRSLKSLSFCKDMPLTGYY
FT                   DKTPKYGMRKSKTYKGKPHASHARIEKRKHLIRNKKCKCYLCGDEGHFARECPNNKRDV
FT                   KRVAIFEGINLPEGFDIVSVEEGEDDSDAIYSISENENGEELDAEVVQEKVFMMREEDQ
FT                   SYWLGKTNHWTAMVRVSSQQYHCLHQWEHNKEITVVAHINCHFCKQPTQLRSRIHCSTC
FT                   KLTSCFMCAPIYCNITVQQQPKPPTPFNTNTLLQQQAAYIQWLEGENQRLTEAVEFYKK
FT                   EAADLRLEKELEKDRKDLEPKIQDRGKKVQILDPEAGPSDDEQTAYLEEDTVSRIIGHT
FT                   VEEQQEVKKPVKRGNMLYNLDVVLLIPEVGRPIKVKAILDTGATTCCININSVPKTAIE
FT                   QNTFLVQFRGINSTQSVDKKLKYGRMTISNHQFRIPYCYAFPLSLGDGIEMILGCNFIR
FT                   GMYGGLRIEGHTITFYKNVTTIQTRLAAVMVGGTTTSELGEEGTEPIFEIEEETEEFDS
FT                   EVHQQIMSHVAAQAQQQKLDPKLQQLMERLKDQGFIGENPMQHWAKNKILCRLDIKNPD
FT                   LIIEDKPIKHLTPAMEKQFQKHVKALLDIGVIRPSKSKHRTTAFIVESGTVIDPVTKKT
FT                   IHGKERMVFNYKRLNDNTEKDQYSLPGIQTILKRVGNKKIFSKFDLKSGFHQVAMAKES
FT                   IPWTAFWVPQGLYEWLVMPFGLKNAPAVFQRKMDQCFKGTEEFIAVYIDDILVFSETMA
FT                   EHTKHIGIMLTICQENGLVLSPNKICLAQREIEFLGTIISQGQMKLQPHIIKKIVNKAD
FT                   MELETTKGLRSFLGLLNYARIYIPNLGKKLSPLYAKTSPTGEKKFNRQDWHLIKEIKNM
FT                   VQKLPNLAIPPARCCIIIESDGCMEGWGAVCKWKLAKEDSRTTEKICAYASGKFGIIKS
FT                   TIDAEIFALIKALESFKIFYLDKKHLVARTDCQAIVTFYNKTSTHKPSRIRWITFSDYI
FT                   TGLGVQVTIEHINGKENQLADTLSRLVYTTWNQSQAHLSEEEEPEKSPHLSLAVLAIPI
FT                   AWPMTAFYSRRRTPLLKGGSPWQQNKPSQHSCIASKSKQPEKHSWPYETYRTYCTPSET
FT                   T"
FT   CDS             2310. .2582
FT                   /db_xref="UniProt/TrEMBL:Q5TJI4"
FT                   /note="ORFX"
FT                   /product="hypothetical protein"
FT                   /protein_id="CAE81280.1"
FT                   /translation="MMIMMKSLQNGMKVLMKKDQVRQLGIRKKKKKKTNMIPTSIWPIY
FT                   KRKKMSGKKSPPVYKKKWKWNIHGGGHGRRQYSLKQLTIHHLVTH"
FT   CDS             4212. .4499
FT                   /db_xref="GOA:Q5TJI3"
FT                   /db_xref="InterPro:IPR000169"
FT                   /db_xref="InterPro:IPR001356"
FT                   /db_xref="UniProt/TrEMBL:Q5TJI3"
FT                   /note="ORF4"
FT                   /product="hypothetical protein"
FT                   /protein_id="CAE81281.1"
FT                   /translation="MMNKQRILRKIPLAELSAILWKSNKRSKNQSKGGTCYTTSTWFYL
FT                   SLKLEDLSRLKLSLIQAQLPVASTSTLFPRQQLNRTLFWFNSEALIPRNQ"
FT   CDS             6307. .6702
FT                   /db_xref="UniProt/TrEMBL:Q5TJI2"
FT                   /note="ORFY"
FT                   /product="hypothetical protein"
FT                   /protein_id="CAE81282.1"
FT                   /translation="MEPVPSSPVGGRRAGEISTSQLSGVSYPYSLAYDGLLQQKKNAIT
FT                   QGRLTLATEQAISAQLYRIEEQAARKALMALRDLQDVLHSKRDYLTATATRDNWASDRL
FT                   PAAQQDSAALDQHADVINAIIERAVQP"
XX
SQ   Sequence 7141 BP; 2414 A; 1542 C; 1598 G; 1587 T; 0 other;

 AJ609019  Length: 7141  April 19, 2005 08:52  Type: N  Check: 5349  ..

       1  tggtatcaaa gcttggtttt agcaatggtc atgtccggct aagttagtca
      51  gtctaggttc agggaaagag tgaggtaacc gttaggcgtc aaaatactgt
     101  tccaaaatac tattcaccaa atcactaggg gtatcctgtt tatgtgaaaa
     151  agacgtaacc cagacgaaaa gtacccacga agggggaaaa cttggggaaa
     201  aggtgatcag taaacagaaa acaaactatt cctcgtatgc tctacggttg
     251  cagccacggc tgatcctccg tatcagaaaa ggaaaagttt ggtgtatgtc
     301  tagccgatgg gaagatagta tccaggaatg gtatgagaag tctcacactg
     351  ccaaccttga gtaccttgac ctggcctcta ccagtaaagt gaccaacaac
     401  caactagctc ataacctcgc agtaaccttt gatagagtca acttaggaaa
     451  ccgagtcttc attaaaaacc ttaaacaaat tcaagagtct atcctagaac
     501  taaataccag aattgacact gtagaagtag ccttgagaag gctaaccaag
     551  cagttccgag aaaacaaacc actgtctgaa tctgaagtaa agaaactagt
     601  ggaagagata gcccagcaac ccaaaattgt tgagaaacag gcactggaaa
     651  tttctcaaca gttagaactc aaactagaaa aagtggaaaa gcttctacac
     701  aagcttgacc agtgggttgg tcaatgagtg aaagcccctc ttaccaagaa
     751  gcacttaagg aagctgaaaa gatcgaccct ccagcgattg gactaaccac
     801  atccagtgga gtaactgcgg tccaaggctt caggactgtc atcaaacaaa
     851  acaacgtcca gatttgccta cttgcgacta tagcagacaa acttgaagaa
     901  ctggtgcaag atcagaagaa agcaagaaaa gacaaggcca aggagattgc
     951  tattcctgag gatcttatca caaaactcca aggattatcc atccgggaga
    1001  aaggtgaagc aaaggtcaca agaaaaccag agccaaaagg aacactgttt
    1051  ggattcaaag atccttacaa aattttggca gcagaaaagg ctaagatcac
    1101  accaaagcct gtaaaagaaa agaaagatga gtcgagcaag accgcaacct
    1151  ccagttccta gtgtgacatc taccactagt gaacagaaca gagaagggcc
    1201  tctttatgag gatcaaatca gagattacag aaggagtcag aggaggatct
    1251  tcaacctaag aaggagagcc agaaggttga gaagatcaat gatggggtct
    1301  agataccagg agaccctaga acaagaaatt gatccacaga caacactgag
    1351  gttgtccatg caagaaagag cgcgattagt accagctgaa gtactgtaca
    1401  gatcacgacg agacactgtt caccacaggg tctacaccca tcgctctgaa
    1451  gaatccgtcc tatgtgttgg cggaaatcaa gttgacaggg ccttcattca
    1501  gcccgaaagt ttagagcaac ttcagaggac tggaatgtcc ttcattcaca
    1551  taggaatcct gcaagttagg attcaaatcc tgcatcgtca agaagaaggt
    1601  accatggctt tagttgtctt ccgtgacaac agatggtctg gggaccagtc
    1651  tatcttcgca caaatggaga tagatctaac taaaggcagc cagttggtat
    1701  ttgtcatacc agataccatg atgacgatcg gagattttgc ccggaacgta
    1751  caactatcaa tcctcacacg aggatatgag aattggcaaa atggagaagc
    1801  caacctttta atcacacgtg gcatgacggg acgactatcc aacacaccta
    1851  atgtcgcctt tgcctaccaa attgccagcg cgacagatta tctggcaagc
    1901  cacggagtaa aagccatcgc agggaaaaag atgaacttac aacacctgcg
    1951  aaatcaacag tggattctac gaccaccgca agctgacatc acaccgatgc
    2001  aaccaagatc ggtagaaacc aggaacctgg tagatggcag catttccatc
    2051  agattccatg attatgaggc agccacctca acatcaagac cccattacaa
    2101  cgaagaagat gaagaagtgg agtctgaaac agaatctgaa ataagagaac
    2151  acactgtagc agtctggatt ggggaagaag aagttccaga ccaaacagga
    2201  agaaagaaag tatgggaaga atctagtaat ggaaatggaa ggttcttccg
    2251  gtactacact cctccaccaa catctgatga acaaatcata gccactggtt
    2301  ggggaagtga tgatgattat gatgaaatcc ctccaaaatg ggatgaaagt
    2351  cctgatgaag aaggatcaag tgagacaact tgggatcagg aagaaaaaga
    2401  agaagaagac gaatatgatc ccaacatcta tatggcctat ttacaaaagg
    2451  aagaaaatga gtggcaagaa atcgccgcca gtttacaaga agaaatggaa
    2501  atggaatatc cacggcggag gccacggacg gagacagtat tctctgaaac
    2551  agttgactat acaccacctg gtgacacact gatgacacct gtcggatatc
    2601  caccggcctc gtcatcaaga tcaacagtca caacaccaag taggccccct
    2651  ttatttgaag gaagggttac acacgtgcca agattcttaa aacgggatga
    2701  ctacacagaa tggtggcaac taccatcatc ccaaggcaca actggggcac
    2751  tatttgtgat gcccaaacaa atgggcctat ttcatgaggt cttctccaga
    2801  tgggagtcca tcaccaaaaa ctatgttgcg gcccaaggtt tcacggaccc
    2851  aacagaaaag atggagttca tggaaaattt acttggagaa acagaaaaac
    2901  taacctggat ccaatggaga atgaattatg aggctgagta ccagcagctg
    2951  ttaacccaag ctgatggacg gcaagggacc cagaatatct tgtcccaaat
    3001  taagagagtc ttctctctag aagaccccgc ctcaggatct acgaggatac
    3051  aagatgctgc atacagagac cttgaaagat taacctgcca caacataaaa
    3101  gatatcgttc agttcctgaa tgattatggg cggttagcag caaaaagtgg
    3151  gcgactgttt ctaggaacag agctcagtga aaaattatgg atgaagatgc
    3201  caccagaact agggaatcgc atgaaggaag catttcaaaa ggaatactca
    3251  ggcaatgaag taggagtctt cccgcgtatc ttgttcgcgt acagatactt
    3301  agaacaagaa tgcaaagatg cagcttttaa gcgcagcctg aaatcgttga
    3351  gtttctgtaa ggacatgccg ttgacaggtt actatgataa aacacccaaa
    3401  tacggcatga ggaagtcaaa aacttacaaa ggaaagccac acgcatcaca
    3451  tgcaagaata gaaaagagaa agcacttaat caggaataaa aagtgcaagt
    3501  gctatctgtg tggagatgaa ggacattttg ccagagaatg ccctaataac
    3551  aaaagagatg tcaagagagt agccattttt gaaggcatca atcttcctga
    3601  gggtttcgac attgtctcag tagaagaagg ggaagatgac tcagatgcta
    3651  tttatagcat atctgagaat gaaaatgggg aagaacttga cgcagaagta
    3701  gtccaagaga aggtcttcat gatgcgagaa gaggaccaat cctactggct
    3751  aggaaaaaca aatcactgga cagcaatggt acgagtcagc agccaacagt
    3801  atcattgctt gcaccaatgg gagcacaaca aggagatcac ggtggtggcc
    3851  cacatcaact gccacttctg taagcagcct actcaactga ggagtcgaat
    3901  acactgttcc acgtgtaaac tcaccagctg cttcatgtgt gcccctatct
    3951  actgcaatat aacggtccaa cagcagccta aaccgcctac gccattcaac
    4001  accaacacct tgctccaaca acaagcggcg tatatccaat ggttagaagg
    4051  agaaaaccag cggttaacag aggcagttga attttataaa aaggaagctg
    4101  cagatctaag gctcgaaaaa gaattggaaa aagatagaaa agatttagag
    4151  ccaaagatac aagacagggg gaagaaggtt caaattcttg atccagaagc
    4201  aggaccctct gatgatgaac aaacagcgta tcttgaggaa gataccgtta
    4251  gccgaattat cggccatact gtggaagagc aacaagaggt caaaaaacca
    4301  gtcaaaaggg ggaacatgtt atacaacctc gacgtggttt tacttatccc
    4351  tgaagttgga agacctatca aggttaaagc tatccttgat acaggcgcaa
    4401  ctacctgttg catcaacatc aactctgttc ccaagacagc aattgaacag
    4451  aacacttttt tggttcaatt cagaggcatt aattccacgc aatcagtaga
    4501  taaaaagcta aaatacgggc ggatgactat tagcaaccac cagttccgga
    4551  tcccgtactg ttatgccttt cctctatctc ttggagacgg aatagaaatg
    4601  atcctcgggt gtaacttcat ccgtgggatg tatggcggtt tgaggattga
    4651  aggtcacaca atcaccttct acaaaaacgt caccactatt caaacccgcc
    4701  ttgctgctgt aatggttggt ggtacaacca cttctgagtt gggggaggaa
    4751  ggtactgaac ccatttttga aattgaagaa gaaacagaag agtttgactc
    4801  agaagtccat caacaaatta tgagtcatgt tgcagcccaa gcccaacaac
    4851  aaaaattaga cccaaaactc caacaactaa tggaacggtt aaaggatcag
    4901  ggctttattg gggaaaatcc gatgcaacat tgggctaaaa acaagatcct
    4951  gtgtagattg gatatcaaga atccagacct tatcatagaa gacaagccca
    5001  ttaaacactt aacaccggct atggagaagc agttccagaa gcatgtcaaa
    5051  gctctcctgg acattggtgt tatcaggcct agtaagtcaa aacacaggac
    5101  tacggccttc atagtagaat caggcactgt tattgatcca gtaacaaaga
    5151  agaccataca cggcaaagaa agaatggtct ttaactacaa acgcctgaat
    5201  gacaatacgg agaaggatca atactcgcta cctggtatac agaccatcct
    5251  gaagcgagta ggcaacaaaa agattttcag caagttcgat ttaaaatcgg
    5301  gcttccatca ggttgccatg gcaaaagagt ccatcccttg gactgctttt
    5351  tgggtaccgc agggcctata cgagtggtta gttatgccct ttgggctcaa
    5401  aaacgctcct gcagtatttc aaagaaaaat ggaccaatgt ttcaaaggca
    5451  cagaagaatt catagctgtg tatattgatg acatcttggt cttcagcgag
    5501  actatggcgg aacacaccaa gcatattgga atcatgctaa caatctgcca
    5551  agaaaatggg ctggtcctaa gcccaaataa aatatgtctt gctcaacgag
    5601  agattgaatt tttgggcaca atcatctctc aaggtcaaat gaagcttcag
    5651  cctcatatca taaagaagat agtcaacaag gcagatatgg agctcgaaac
    5701  aactaaaggc ctaagatcat ttttgggcct cctgaactat gcccgaatct
    5751  acatacccaa tctggggaag aagctaagtc cactatatgc caaaaccagt
    5801  cccaccggag aaaagaagtt taatcgacag gattggcatt tgataaagga
    5851  gattaaaaat atggtccaaa agctcccaaa cctcgctatc ccaccagcaa
    5901  gatgctgtat tatcatagaa agcgatggtt gcatggaagg atggggggcc
    5951  gtatgcaagt ggaaattagc aaaagaagat tcccgcacta ctgaaaagat
    6001  atgtgcctac gctagtggaa aattcggcat aatcaagtcc acaattgacg
    6051  ccgagatttt cgcactcata aaagcattag aatcttttaa aatcttctat
    6101  ctggacaaaa aacatttggt ggcgcgtact gactgtcagg cgatagtgac
    6151  gttttataac aagacaagca ctcacaagcc ttctcgtata cgttggatca
    6201  ccttttccga ctatataacg gggttaggag ttcaagttac tatcgaacac
    6251  ataaacggaa aggagaacca gttagcagat acactaagca gactagtgta
    6301  caccacatgg aaccagtccc aagctcacct gtcggaggaa gaagagccgg
    6351  agaaatctcc acatctcagc ttagcggtgt tagctatccc tatagcttgg
    6401  cctatgacgg ccttctacag cagaagaaga acgccattac tcaagggagg
    6451  ctcaccttgg caacagaaca agccatctca gcacagctgt atcgcatcga
    6501  agagcaagca gccagaaaag cactcatggc cctacgagac ctacaggacg
    6551  tactgcactc caagcgagac tacttgactg cgactgccac acgtgacaac
    6601  tgggccagtg atagactgcc agctgcccaa caagattcag ccgccctcga
    6651  ccaacatgct gacgtgatta acgccattat cgaaagggct gtccaaccct
    6701  agtttggacg atagtgtgta ataattaagt gtgctttact ttccagctgt
    6751  ccaaccctag tttggacgat agtgtgtaat aattaagtgt gctttacttt
    6801  ccagctgtcc aactctagtt tggacgatag tgtgtaataa ttaagtatgc
    6851  tttactttcc agctgtaact cattatagag tagacatagt gattgacgat
    6901  ggggcccgaa gagcacccgg attactactc taccatctat aaatgggagt
    6951  gtgtaaggct tagccatcaa ggaggaagag atatccatcc tgattctaag
    7001  cattctagag cttgtaagtt tttaagagaa ataaagaatc ttcttagtga
    7051  ttatttcttt gtttcccctg ggaaaatttt aaacagtttc tgttattctg
    7101  tttaatcctg ttccacgttt catttcctct gcaagcctac t