Sequence of DPV Influenza A virus
Influenza A/Puerto Rico/8/34(Cambridge) (H1N1), nucleoprotein (seg 5 of complete genome).
ACC No: J02147
Dated: 2000-03-04 | Length: 1565 | CRC: 2084030073
!!NA_SEQUENCE 1.0
ID FLANPM0X standard; RNA; VRL; 1565 BP.
XX
AC J02147;
XX
SV J02147.1
XX
DT 13-JAN-1992 (Rel. 30, Created)
DT 04-MAR-2000 (Rel. 63, Last updated, Version 2)
XX
DE Influenza A/Puerto Rico/8/34(Cambridge) (H1N1), nucleoprotein (seg 5 of
DE complete genome).
XX
KW complete genome; nucleoprotein.
XX
OS Influenzavirus A
OC Viruses; ssRNA negative-strand viruses; Orthomyxoviridae;
OC Influenza A viruses.
XX
RN [1]
RP 1-1517
RX MEDLINE; 81236585.
RA van Rompuy L., Min J.W., Huylebroeck D., Devos R., Fiers W.;
RT "Complete nucleotide sequence of the nucleoprotein gene from the human
RT influenza strain A/PR/8/34 (HON1)";
RL Eur. J. Biochem. 116:347-353(1981).
XX
RN [2]
RP 1-1565
RX MEDLINE; 82041445.
RA Winter G., Fields S.;
RT "the structure of the gene encoding the nucleoprotein of human influenza
RT virus A/PR/8/34";
RL Virology 114:423-428(1981).
XX
RN [3]
RP 1-1565
RA van Rompuy L., Min J.W., Huylebroeck D., Devos R., Fiers W.;
RT "Complete nucleotide sequence of the nucleoprotein gene from the human
RT influenza strain A/PR/8/34 (HON1), Correction";
RL Eur. J. Biochem. 116:645-645(1982).
XX
DR SPTREMBL; Q67228; Q67228.
XX
CC assignment of coding region by consideration of open reading frames
CC and by comparison of predicted mw to nucleoprotein mw. [Eur. J.
CC Biochem. 116, 645-645 (1981)] is a
CC major revision of [1], so the sequence shown below reflects only
CC [2],[Eur. J. Biochem. 116, 645-645 (1981)]. [1] compared with
CC grantham's data.
CC Complete source information:
CC influenza [2]: A/Puerto Rico/8/34 cdna to rna from human; [1],[Eur.
CC J. Biochem. 116, 645-645 (1981)]:
CC x31 (a laboratory recombinant containing the A/Puerto Rico/8/34
CC nucleoprotein segment) cdna to rna, originally from human.
XX
FH Key Location/Qualifiers
FH
FT source 1. .1565
FT /db_xref="taxon:11320"
FT /organism="Influenzavirus A"
FT CDS 46. .1542
FT /codon_start=1
FT /db_xref="SPTREMBL:Q67228"
FT /note="nucleoprotein"
FT /protein_id="AAA43467.1"
FT /translation="MASQGTKRSYEQMETDGERQNATEIRASVGKMIGGIGRFYIQMCT
FT ELKLSDYEGRLIQNSLTIERMVLSAFDERRNKYLEEHPSAGKDPKKTGGPIYRRVNGKW
FT MRELILYDKEEIRRIWRQANNGDDATAGLTHMMIWHSNLNDATYQRTRALVRTGMDPRM
FT CSLMQGSTLPRRSGAAGAAVKGVGTMVMELVRMIKRGINDRNFWRGENGRKTRIAYERM
FT CNILKGKFQTAAQKAMMDQVRESRDPGNAEFEDLTFLARSALILRGSVAHKSCLPACVY
FT GPAVASGYDFEREGYSLVGIDPFRLLQNSQVYSLIRPNENPAHKSQLVWMACHSAAFED
FT LRVLSFIKGTKVVPRGKLSTRGVQIASNENMETMESSTLELRSRYWAIRTRSGGNTNQQ
FT RASAGQISIQPTFSVQRNLPFDRTTVMAAFTGNTEGRTSDMRTEIIRMMESARPEDVSF
FT QGRGVFELSDEKAASPIVPSFDMSNEGSYFFGDNAEEYDN"
FT unsure 589
FT /note="g in 2 clones, a in 1 clone [2]"
XX
SQ Sequence 1565 BP; 504 A; 314 C; 412 G; 335 T; 0 other;
J02147 Length: 1565 September 18, 2002 10:52 Type: N Check: 1656 ..
1 agcaaaagca gggtagataa tcactcactg agtgacatca aaatcatggc
51 gtcccaaggc accaaacggt cttacgaaca gatggagact gatggagaac
101 gccagaatgc cactgaaatc agagcatccg tcggaaaaat gattggtgga
151 attggacgat tctacatcca aatgtgcaca gaacttaaac tcagtgatta
201 tgagggacgg ttgatccaaa acagcttaac aatagagaga atggtgctct
251 ctgcttttga cgaaaggaga aataaatacc tggaagaaca tcccagtgcg
301 gggaaggatc ctaagaaaac tggaggacct atatacagaa gagtaaacgg
351 aaagtggatg agagaactca tcctttatga caaagaagaa ataaggcgaa
401 tctggcgcca agctaataat ggtgacgatg caacggctgg tctgactcac
451 atgatgatct ggcattccaa tttgaatgat gcaacttatc agaggacaag
501 ggctcttgtt cgcaccggaa tggatcccag gatgtgctct ctgatgcaag
551 gttcaactct ccctaggagg tctggagccg caggtgctgc agtcaaagga
601 gttggaacaa tggtgatgga attggtcagg atgatcaaac gtgggatcaa
651 tgatcggaac ttctggaggg gtgagaatgg acgaaaaaca agaattgctt
701 atgaaagaat gtgcaacatt ctcaaaggga aatttcaaac tgctgcacaa
751 aaagcaatga tggatcaagt gagagagagc cgggacccag ggaatgctga
801 gttcgaagat ctcacttttc tagcacggtc tgcactcata ttgagagggt
851 cggttgctca caagtcctgc ctgcctgcct gtgtgtatgg acctgccgta
901 gccagtgggt acgactttga aagagaggga tactctctag tcggaataga
951 ccctttcaga ctgcttcaaa acagccaagt gtacagccta atcagaccaa
1001 atgagaatcc agcacacaag agtcaactgg tgtggatggc atgccattct
1051 gccgcatttg aagatctaag agtattgagc ttcatcaaag ggacgaaggt
1101 ggtcccaaga gggaagcttt ccactagagg agttcaaatt gcttccaatg
1151 aaaatatgga gactatggaa tcaagtacac ttgaactgag aagcaggtac
1201 tgggccataa ggaccagaag tggaggaaac accaatcaac agagggcatc
1251 tgcgggccaa atcagcatac aacctacgtt ctcagtacag agaaatctcc
1301 cttttgacag aacaaccgtt atggcagcat tcactgggaa tacagagggg
1351 agaacatctg acatgaggac cgaaatcata aggatgatgg aaagtgcaag
1401 accagaagat gtgtctttcc aggggcgggg agtcttcgag ctctcggacg
1451 aaaaggcagc gagcccgatc gtgccttcct ttgacatgag taatgaagga
1501 tcttatttct tcggagacaa tgcagaggag tacgacaatt aaagaaaaat
1551 acccttgttt ctact