Sequence of DPV Sida yellow vein Vietnam virus

Sida yellow vein Vietnam virus segment DNA-A, complete sequence.

ACC No: DQ641696

Dated: 2007-12-21 | Length: 2753 | CRC: 443037450

                
ID   DQ641696; SV 1; circular; genomic DNA; STD; VRL; 2753 BP.
XX
AC   DQ641696;
XX
DT   03-MAY-2007 (Rel. 91, Created)
DT   21-DEC-2007 (Rel. 94, Last updated, Version 2)
XX
DE   Sida yellow vein Vietnam virus segment DNA-A, complete sequence.
XX
KW   .
XX
OS   Sida yellow vein Vietnam virus
OC   Viruses; ssDNA viruses; Geminiviridae; Begomovirus;
OC   unclassified Begomovirus.
XX
RN   [1]
RP   1-2753
RX   PUBMED; 18089756.
RA   Ha C., Coombs S., Revill P., Harding R., Vu M., Dale J.;
RT   "Molecular characterization of begomoviruses and DNA satellites from
RT   Vietnam: additional evidence that the New World geminiviruses were present
RT   in the Old World prior to continental separation";
RL   J. Gen. Virol. 89(PT 1):312-326(2008).
XX
RN   [2]
RP   1-2753
RA   Ha C.V., Coombs S., Revill P.A., Harding R.M., Vu M.T., Dale J.L.;
RT   ;
RL   Submitted (18-MAY-2006) to the EMBL/GenBank/DDBJ databases.
RL   Institute of Health and Biomedical Innovation, Queensland University of
RL   Technology, 2 George Street, Brisbane, Queensland 4001, Australia
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .2753
FT                   /organism="Sida yellow vein Vietnam virus"
FT                   /segment="DNA-A"
FT                   /specific_host="Arrow leaf (Sida rhombiforlia, Malvaceae);
FT                   weed"
FT                   /mol_type="genomic DNA"
FT                   /country="Viet Nam:Hanoi"
FT                   /note="acronym: SiYVVNV"
FT                   /db_xref="taxon:390441"
FT   gene            133. .483
FT                   /gene="AV2"
FT   CDS             133. .483
FT                   /codon_start=1
FT                   /gene="AV2"
FT                   /product="AV2 protein"
FT                   /db_xref="InterPro:IPR002511"
FT                   /db_xref="InterPro:IPR005159"
FT                   /db_xref="UniProtKB/TrEMBL:A5H176"
FT                   /protein_id="ABG26036.1"
FT                   /translation="MWDPLLNEFPETVHGLRCMLAVKYCHLLEQTYAPDTVGYDLVRDL
FT                   ISVIRARDYVEATRRYSHFNARIQGTSTAELRQPRHEPCCCPHCPRHKSKAIVDLQAHE
FT                   SQTQNVQDVQKP"
FT   gene            293. .1066
FT                   /gene="AV1"
FT   CDS             293. .1066
FT                   /codon_start=1
FT                   /gene="AV1"
FT                   /product="CP protein"
FT                   /db_xref="GOA:A5H177"
FT                   /db_xref="InterPro:IPR000263"
FT                   /db_xref="InterPro:IPR000650"
FT                   /db_xref="UniProtKB/TrEMBL:A5H177"
FT                   /protein_id="ABG26035.1"
FT                   /translation="MSKRPADIVISTPASKVRRRLNFDSPGMSRAAAPTVLVTNRKRSW
FT                   TYRPMNRKPRMYRMYRSPDVPRGCEGPCKVQSYEARHDIAHVGKVLCVSDVTRGNGLTH
FT                   RVGKRFCVKSVYVLGKVWMDENIKKKNHTNTVMFFLVRDRRPFGTPQDFGQVFNMYDNE
FT                   PSTATVKNDNRDRFQVLRRFQATVTGGEHACKEQALVRKFMKINNHVTYNHQEAAKYDN
FT                   HTENALLLYMACTHASNPVYATLKIRIYFYDSVQN"
FT   gene            complement(1063. .1467)
FT                   /gene="AC3"
FT   CDS             complement(1063. .1467)
FT                   /codon_start=1
FT                   /gene="AC3"
FT                   /product="REn protein"
FT                   /db_xref="InterPro:IPR000657"
FT                   /db_xref="UniProtKB/TrEMBL:A5H178"
FT                   /protein_id="ABG26039.1"
FT                   /translation="MDFRTGEPITATQAENGAYIWTVNNPLYFKITQHDERPFSTKHDV
FT                   ITVQVQFNYNLRKALGIHQCFLICQIWTRLRPQTWRFLRVFKYQCMKYLNNLGVISINN
FT                   VIRAMNHVLYDKLEGTIEAHFSYIIKFNLY"
FT   gene            complement(1208. .1615)
FT                   /gene="AC2"
FT   CDS             complement(1208. .1615)
FT                   /codon_start=1
FT                   /gene="AC2"
FT                   /product="TrAP protein"
FT                   /db_xref="GOA:A5H179"
FT                   /db_xref="InterPro:IPR000942"
FT                   /db_xref="UniProtKB/TrEMBL:A5H179"
FT                   /protein_id="ABG26038.1"
FT                   /translation="MQNSSPSRSHCTQVPIKVQHRTAKKRPIRRRRVDLPCGCSYYFGL
FT                   NCASHGFSHRGTHHCNSGREWRLYLDGQQSPIFQDHPARREAVFDETRRNNSPSPVQLQ
FT                   PAESVGDTPVFSDLPNLDSFTSSDLAFLKSI"
FT   gene            complement(1515. .2603)
FT                   /gene="AC1"
FT   CDS             complement(1515. .2603)
FT                   /codon_start=1
FT                   /gene="AC1"
FT                   /product="rep protein"
FT                   /db_xref="GOA:A5H180"
FT                   /db_xref="InterPro:IPR001191"
FT                   /db_xref="InterPro:IPR001301"
FT                   /db_xref="UniProtKB/TrEMBL:A5H180"
FT                   /protein_id="ABG26037.1"
FT                   /translation="MPPPRRFRLSAKNYFLTYPQCSLTKEEALSQLQNLNTPTNKKYIK
FT                   ICRELHEDGSPHLHVLIQFEGKYVCTNNRFFDLVSPTRSAHFHPNIQGAKSSSDVKSYI
FT                   DKDGDTLEWGEFQIDGRSARGGQQSANDAYAAALNTGSKSEALNIIRELAPKDYVLQFH
FT                   NLNANLDRIFAPPIEVFVCPFLSSSFDQVPEELECWVSENVRDAAARPWRPISIVIEGE
FT                   SRTGKTMWARSLGPHNYLCGHLDLSPKVYSNDAWYNVIDDVDPHYLKHFKEFMGAQRDW
FT                   QSNTKYGKPIQIKGGIPTIFLCNPGPTSSYKEYLDEEKNSAIRDWALKNAEFFTLTEPL
FT                   YSSTHQSPTPDSEEEAHSEASR"
FT   gene            complement(2156. .2446)
FT                   /gene="AC4"
FT   CDS             complement(2156. .2446)
FT                   /codon_start=1
FT                   /gene="AC4"
FT                   /product="AC4 protein"
FT                   /db_xref="InterPro:IPR002488"
FT                   /db_xref="UniProtKB/TrEMBL:A5H175"
FT                   /protein_id="ABG26040.1"
FT                   /translation="MGALISTCLSSSRENTSARITDSSTWYPQPGQHISIRTYRELNPA
FT                   PTSSPTSIRTETPSSGASFRSTDDLQEGANNQPMTLTPQHLTQAVSQRLLI"
XX
SQ   Sequence 2753 BP; 706 A; 578 C; 673 G; 796 T; 0 other;

dq641696 Length: 2753  21-DEC-2007  Type: N  Check: 9909  ..

       1  accggatggc cgcgcgattt tttagggtgg tccccgcacg cgcttttgtc
      51  ttccaatcaa aacgctccct caaagctagt ttaaaaaaaa cccgctataa
     101  atacttaggg actaagttat gaaggaaata aaatgtggga tccactgttg
     151  aacgagtttc cagaaaccgt tcacggtcta cggtgtatgc ttgcggtgaa
     201  atactgtcat ctattggagc agacgtacgc gccggatacg gtgggttacg
     251  atcttgtcag agacttgata tccgtcatac gagctcgtga ctatgtcgaa
     301  gcgacccgcc gatatagtca tttcaacgcc cgcatccaag gtacgtcgac
     351  ggctgaactt cgacagccca ggcatgagcc gtgctgctgc ccccactgtc
     401  ctcgtcacaa atcgaaagcg atcgtggact tacaggccca tgaatcgcaa
     451  acccagaatg tacaggatgt acagaagccc tgatgttcct cgtggatgtg
     501  aaggcccgtg taaggtccag tcgtatgaag cccgtcatga tatagcccat
     551  gtaggtaagg tattgtgtgt gtctgatgtc acgcgtggta atgggttgac
     601  ccatcgagtt ggtaagaggt tttgtgttaa gtccgtgtac gtgttgggta
     651  aggtctggat ggatgaaaac atcaagaaga aaaatcacac taacactgtt
     701  atgttttttt tagttcgtga tcggagaccg tttgggactc ctcaggattt
     751  cggtcaggtg tttaatatgt atgataatga gcctagtacg gcaactgtga
     801  agaacgataa tagggatcgt ttccaggtat tgcgtcgatt tcaggcaacg
     851  gtcactggtg gagagcacgc gtgtaaggag caagcccttg ttaggaagtt
     901  tatgaaaatt aacaatcatg taacctataa tcatcaggag gcagcgaagt
     951  atgataacca tactgagaat gcgttgttat tgtatatggc atgtactcat
    1001  gctagtaatc cagtgtatgc tacgttgaaa atcaggatct acttctatga
    1051  ttctgttcag aattaataaa gattgaattt tattatatat gaaaaatgag
    1101  cctcaattgt gccctcgagt ttatcgtaca gtacatgatt cattgcccta
    1151  attacattgt taatgctaat tactcctaaa ttgttcaaat acttcatgca
    1201  ttgatactta aatactctta agaaacgcca agtctgagga cgtaaacgag
    1251  tccagatttg gcagatcaga aaacactggt gtatccccaa cgctttccgc
    1301  aggttgtagt tgaactggac ttggactgtt attacgtcgt gtttcgtcga
    1351  aaacggcctc tcgtcgtgct gggtgatctt gaaatatagg ggattgttga
    1401  ccgtccagat ataggcgcca ttctctgcct gagttgcagt gatgggttcc
    1451  cctgtgcgaa aatccatgac ttgcgcaatt gagtccaaaa taataagagc
    1501  aaccgcaggg aagatcaacg cgacgcctcc gaatgggcct cttcttcgct
    1551  gtccggtgtt ggactttgat gggtacttga gtacaatggc tccgtgaggg
    1601  tgaagaattc tgcattcttt aatgcccagt ctctgattgc tgaattcttt
    1651  tcctcgtcca agtactcttt atacgatgaa gttggtcctg gattgcaaag
    1701  gaagattgtc gggatacctc ctttaatttg aattggtttc ccgtacttcg
    1751  tgttgctttg ccagtccctt tgggccccca tgaattcctt gaaatgtttc
    1801  aggtagtggg ggtcgacgtc atcgattacg ttgtaccacg catcattgct
    1851  gtagaccttt gggcttaggt ccagatgtcc acacagatag ttgtgtgggc
    1901  ctagagatcg tgcccacatc gtcttcccgg ttctactctc accctctatg
    1951  actatactga tcggtcgcca aggccgcgca gcggcatccc tcacgttctc
    2001  tgacacccag cattcaagtt cttctggaac ttggtcgaaa gaagaagaaa
    2051  gaaatggaca aacaaaaacc tctataggag gtgcaaaaat cctatctaaa
    2101  ttagcattta aattatgaaa ctgtaaaaca taatctttgg gagctaattc
    2151  cctaattata ttaagagcct ctgacttact gcctgtgtta agtgctgcgg
    2201  cgtaagcgtc attggctgat tgttggcccc ctcttgcaga tcgtccgtcg
    2251  atctgaaact cgccccactc gagggtgtct ccgtccttat cgatgtagga
    2301  cttgacgtcg gagctggatt tagctccctg tatgttcgga tggaaatgtg
    2351  ctgacctggt tggggatacc aggtcgaaga atctgttatt cgtgcagacg
    2401  tattttccct cgaactggat aagcacgtgg agatgagggc tcccatcttc
    2451  gtgaagttct ctgcagatct tgatgtattt tttatttgtg ggggtgttta
    2501  ggttttgtag ttgggaaagt gcctcctctt tggtgagaga gcactgtgga
    2551  taagtgagga aataattttt agcagataac ctaaaacgac gcggaggagg
    2601  catattgcgc gtcgttttgt atcggtgaca atcaactctg tggaatgaat
    2651  tggtgactgg tgtacaatat atagttgtca ccaaatggca ttctcgtaaa
    2701  tcctcataga aattcaaaat tcgaattggg aaagcggcca tccgtataat
    2751  att