Sequence of DPV Merremia mosaic virus
Merremia mosaic virus isolate PR80-H3 segment DNA-A, complete sequence.
ACC No: DQ644557
Dated: 2009-06-03 | Length: 2556 | CRC: 1991145737
ID DQ644557; SV 1; circular; genomic DNA; STD; VRL; 2556 BP.
XX
AC DQ644557;
XX
DT 14-MAY-2008 (Rel. 95, Created)
DT 03-JUN-2009 (Rel. 101, Last updated, Version 3)
XX
DE Merremia mosaic virus isolate PR80-H3 segment DNA-A, complete sequence.
XX
KW .
XX
OS Merremia mosaic virus
OC Viruses; ssDNA viruses; Geminiviridae; Begomovirus;
OC unclassified Begomovirus.
XX
RN [1]
RP 1-2556
RA Idris A., Brown J.;
RT "Molecular and biological characterization of Merremia mosaic virus: a
RT bipartite begomovirus from Puerto Rico";
RL Unpublished.
XX
RN [2]
RP 1-2556
RA Idris A., Brown J.;
RT ;
RL Submitted (18-MAY-2006) to the EMBL/GenBank/DDBJ databases.
RL Plant Sciences, University of Arizona, Tucson, AZ 85721, USA
XX
FH Key Location/Qualifiers
FH
FT source 1. .2556
FT /organism="Merremia mosaic virus"
FT /segment="DNA-A"
FT /isolate="PR80-H3"
FT /mol_type="genomic DNA"
FT /country="Puerto Rico"
FT /db_xref="taxon:77813"
FT CDS 141. .887
FT /codon_start=1
FT /product="coat protein"
FT /note="AV1"
FT /protein_id="ABG90892.1"
FT /translation="MVKRDAPWRLMAGTTKVSRNANFSPRGGMGPKAAAWVNRPMYRKP
FT RIYRTLRGPDVPKGCEGPCKVQSFEQRHDISHVGKVICISDVTRGNGITHRVGKRFCVK
FT SVYILGKIWMDENIKLKNHTNSVMFWLIRDRRPYGTPMDFGQVFNMYDNEPSTATVKND
FT LRDRFQVMHRFYAKVTGGQYASNEQALVRRFWKVNNYVVYNHQEAGKYENHTENALLLY
FT MACTHASNPVYATLKIRSYFYDSISN"
FT CDS complement(884. .1282)
FT /codon_start=1
FT /product="AC3"
FT /protein_id="ABG90895.1"
FT /translation="MDSRTGEPITAHQAMNGVFIWEVPNPLYFKTIQVEEPIYTTTRIY
FT TIQIRFNYNLRKALSLHKAYLNFQIWTTSVQASGTTYLNRFKDLVLMYLDQLGVVSLNN
FT VIRAVRFATDKPYVNCVLERHSIKFNLY"
FT CDS complement(1029. .1460)
FT /codon_start=1
FT /product="AC2"
FT /protein_id="ABG90894.1"
FT /translation="MKTHLSERGRSTMQNSSSSTPPSIKAQHRRAKRSKSVIRRRRLDL
FT DCGCSIYVHINCRNHGFTHRGTHHCSSSNEWRFYLGGSKSPLFQDNPSGGANIHHNQNI
FT HHPNTVQLQPEEGVESTQSIPELPNLDDISSSFWDDIFK"
FT CDS complement(1345. .2421)
FT /codon_start=1
FT /product="AC1"
FT /note="replication-associated protein"
FT /protein_id="ABG90893.1"
FT /translation="MPRKGSFSIKAKNYFLTYPICSLAKEEALSQIKALHTPVNKKFIK
FT ICRELHDNGEPHLHVLIQFEGKYNCTNNRFFDLVSPTRSAHFHPNIQGAKSSSDVKSYI
FT DKDGDTIEWGQFQIDGRSARGGQQSSNDTYAKALNAASAEEALQIIKEEQPQHFFLQHH
FT NLVANATRIFQKSPEPWVPPFQLSSFTNVPDEMQEWADNYFGRGAAARADRPISIIIEG
FT DSRTGKTMWARSLGKHNYLSGHLDFNGRVYSNDVEYNVIDDISPNYLKLKHWKELIGAQ
FT KDWQSNCKYGKPVQIKGGIPSIVLCNPGEGASYKDFLDKDENASLRAWTIHNAKFIFLN
FT SPLYQSTAQESEEIQICH"
XX
SQ Sequence 2556 BP; 646 A; 527 C; 588 G; 795 T; 0 other;
dq644557 Length: 2556 03-JUN-2009 Type: N Check: 6858 ..
1 accggatggc cgcccgccgc gcccccctgg gcccacatat taaagccgtc
51 caatcacaaa gcgtcctgga agtctaattg tttaaaataa gcctataaat
101 acattggagt ccgtctatac cccaccaact ttaatttaaa atggttaaga
151 gggacgcccc atggcgttta atggcgggga ccactaaagt tagtcgcaac
201 gccaatttct cgccacgtgg aggtatgggc cctaaggccg ctgcttgggt
251 taacaggccc atgtacagga agcccagaat ttatcgcact ttgagagggc
301 ctgatgttcc taaaggttgt gaaggcccat gtaaggtaca gtctttcgag
351 cagcgtcatg atatttctca tgttggtaag gtaatctgta tatccgatgt
401 aactcgtggt aacggtatta cccaccgtgt tggcaagcgt ttttgtgtga
451 agtctgtgta tattctaggt aaaatatgga tggatgagaa cataaagctg
501 aagaaccaca cgaacagcgt catgttttgg ttgattcgtg acaggagacc
551 ctatggtacc cctatggatt ttggtcaggt gtttaacatg tatgacaatg
601 agccgagtac tgctaccgtc aagaacgatc ttcgcgatcg atttcaagtc
651 atgcataggt tctatgccaa agtaactggt ggtcagtatg ccagtaacga
701 gcaggcattg gttcggcgat tttggaaggt taacaactac gtcgtgtata
751 accatcagga agcaggaaaa tacgagaatc acacggagaa tgctctgtta
801 ttgtatatgg catgtactca tgcttctaat cctgtgtatg ctaccttgaa
851 aattcgtagt tatttttatg actccatttc gaattaataa agattaaatt
901 ttattgaatg tctttcgagc acacaattta catatggttt atccgttgcg
951 aaacgaacag ctctaatgac attgttaagc gaaacaacac ctaattgatc
1001 taaatacatt aaaactaaat ctttaaatct atttaaatat gtcgtcccag
1051 aagcttgaac tgatgtcgtc cagatttgga agttcaggta tgctttgtgt
1101 agactcaacg ccttcctcag gttgtagttg aaccgtattt ggatggtgta
1151 tattctggtt gtggtgtata ttggctcctc cacttggatt gtcttgaaat
1201 agaggggatt tggaacctcc cagataaaaa cgccattcat tgcttgatga
1251 gcagtgatgg gttcccctgt gcgtgaatcc atggtttctg cagttgatgt
1301 gtacgtaaat tgaacagcca cagtccaggt ctaaccttct ccgtctaatg
1351 acagatttgg atctcttcgc tctcctgtgc tgtgctttga tagagggggg
1401 agttgaggaa gatgaatttt gcattgtgga tcgtccacgc tctgagagat
1451 gcgttttcat ctttatcgag gaagtcttta tagctagccc cctctcctgg
1501 attgcacagc acgattgagg gtattcctcc tttaatttga actggcttcc
1551 cgtatttaca gttggactgc cagtcctttt gggcccctat caattctttc
1601 caatgcttta atttcaaata attagggctt atgtcatcaa tgacgttata
1651 ttcgacgtca ttcgaataga ctctgccatt aaagtcaaga tgtccactaa
1701 gataattatg tttacctaat gaacgggccc acattgtttt tccagttcga
1751 ctatctcctt cgatgatgat acttatcggt ctatctgccc gcgcagcggc
1801 acccctccca aagtagttgt ctgcccattc ttgcatctca tctggaacgt
1851 tcgtgaagga ggagagttga aacggaggaa cccatggttc tggagacttc
1901 tgaaatattc ttgttgcgtt cgcaacgaga ttgtgatgtt gaagaaagaa
1951 gtgttgtggt tgttcctcct ttattatttg cagtgcttcc tctgcagaag
2001 ctgcgtttaa cgcctttgcg tatgtatcgt tagaagactg ctgacctcct
2051 ctagcagatc ttccgtcgat ctggaattgt ccccattcaa ttgtatctcc
2101 gtccttgtcg atgtaggact tgacatcgga gctggattta gctccctgaa
2151 tgttcggatg gaaatgtgct gacctggttg gggataccaa atcgaagaat
2201 ctgttattcg tgcagttgta ttttccttcg aactggataa gcacatgaag
2251 atgaggttcc ccattatcgt gaagctctct acagatcttg atgaattttt
2301 tgtttacagg ggtgtgcaga gctttgattt gggacagtgc ttcttctttg
2351 gctaatgaac atatagggta tgtgaggaaa tagtttttgg cttttattga
2401 gaatgaaccc ttccgtggca tttttgtaat aagggatgtt cccccaattg
2451 ctccgctctc aaaactctat atgaatcggg ggaactgggg gtacatttat
2501 actagaactc tcattaaagg gatttgcaac acgtggcggc catccgctat
2551 aatatt