Sequence of DPV Influenza C virus

Influenza C/California/78, hemagglutinin (seg 4), cDNA.

ACC No: K01689

Dated: 2000-03-04 | Length: 2071 | CRC: -533378438

                !!NA_SEQUENCE 1.0
ID   ORCC78HA   standard; RNA; VRL; 2071 BP.
XX
AC   K01689;
XX
SV   K01689.1
XX
DT   07-NOV-1985 (Rel. 07, Created)
DT   04-MAR-2000 (Rel. 63, Last updated, Version 6)
XX
DE   Influenza C/California/78, hemagglutinin (seg 4), cDNA.
XX
KW   glycoprotein; haemagglutinin.
XX
OS   Influenzavirus C
OC   Viruses; ssRNA negative-strand viruses; Orthomyxoviridae.
XX
RN   [1]
RP   1-2071
RX   MEDLINE; 84138802.
RA   Nakada S., Creager R.S., Krystal M., Aaronson R.P., Palese P.;
RT   "Influenza C virus hemagglutinin: Comparison with influenza A and B virus
RT   hemagglutinins";
RL   J. Virol. 50:118-124(1984).
XX
DR   SWISS-PROT; P03465; HEMA_INCCA.
XX
CC   The location of the hemagglutinin gene was deduced by computer
CC   analysis of influenza C/Cal/78 and comparison with the
CC   organizations of influenzas B/Lee/40 and A/PR/8/43. The alignment
CC   of the amino acid sequence of the C virus HA remains tentative,
CC   since it is based solely on structural homologies. Plus strand is
CC   shown.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .2071
FT                   /db_xref="taxon:11552"
FT                   /organism="Influenzavirus C"
FT   CDS             22. .1986
FT                   /codon_start=1
FT                   /db_xref="SWISS-PROT:P03465"
FT                   /note="hemagglutinin precursor (putative); putative"
FT                   /protein_id="AAA43791.1"
FT                   /translation="MFFSLLLMLGLTEAEKIKICLQKQVNSSFSLHNGFGGNLYATEEK
FT                   RMFELVKPKAGASVLNQSTWIGFGDSRTDQSNSAFPRSLMSAKTADKFRSLSGGSLMLS
FT                   MFGPPGKVDYLYQGCGKHKVFYEGVNWSPHAAIDCYRKNWTDIKLNFQKSIYELASQSH
FT                   CMSLVNALDKTIPLQVTKGVAKNCNNSFLKNPALYTQEVKPLEQICGEENLAFFTLPTQ
FT                   FGTYECKLHLVASCYFIYDSKEVYNKRGCGNYFQVIYDSSGKVVGGLDNRVSPYTGNSG
FT                   DTPTMQCDMLQLKPGRYSVRSSPRFLLMPERSYCFDMKEKGPVTAVQSIWGKGRKSDYA
FT                   VDQACLSTPGCMLIQKQKPYIGEADDHHGDQEMRELLSGLDYEARCISQSGWVNETSPF
FT                   TEEYLLPPKFGRCPLAAKEESIPKIPDGLLIPTSGTDTTVTKPKSRIFGIDDLIIGLLF
FT                   VAIVEAGIGGYLLGSRKESGGGVTKESAEKGFEKIGNDIQILRSSTNIAIEKLNDRISH
FT                   DEQAIRDLTLEIENARSEALLGELGIIRALLVGNISIGLQESLWELASEITNRAGDLAV
FT                   EVSPGCWIIDNNICDQSCQNFIFKFNETAPVPTIPPLDTKIDLQSDPFYWGSSLGLAIT
FT                   AANLMAALVISGIAICRTK"
FT   sig_peptide     22. .63
FT                   /note="hemagglutinin signal peptide (putative); putative"
FT   mat_peptide     64. .1356
FT                   /note="hemagglutinin HA1 chain (putative); putative"
FT   mat_peptide     1357. .1983
FT                   /note="hemagglutinin HA2 chain (putative); putative"
XX
SQ   Sequence 2071 BP; 696 A; 381 C; 440 G; 554 T; 0 other;

K01689  Length: 2071  September 18, 2002 10:52  Type: N  Check: 4604  ..

       1  agcaaaagca ggggtttaat aatgtttttc tcattactct tgatgttggg
      51  cctcacagag gctgaaaaaa taaagatatg ccttcaaaag caagtgaaca
     101  gtagcttcag cctacacaat ggcttcggag gaaatttgta tgccacagaa
     151  gaaaaaagaa tgtttgagct tgttaagccc aaagctggag cctctgtctt
     201  gaatcaaagc acatggattg gctttggaga ttcaagaact gaccaaagca
     251  attcagcttt tcctaggtcg ctgatgtcag caaaaactgc tgataaattt
     301  cgttctttgt ctggtggatc cttgatgttg agtatgtttg gcccacctgg
     351  gaaggtagat tacctttacc aaggatgtgg aaagcataaa gttttttatg
     401  aaggagtcaa ctggagtcca catgctgcta tagattgtta cagaaaaaat
     451  tggactgaca tcaaactgaa tttccagaaa agcatttatg aattggcttc
     501  acaatcacat tgcatgagct tggtgaatgc cttggacaaa actattcctt
     551  tacaagtgac taaaggagtt gcaaaaaatt gcaacaacag cttcttaaaa
     601  aatccagcat tgtacacaca agaagtcaaa cctttagagc aaatatgtgg
     651  ggaagaaaat cttgcttttt tcacacttcc aacccaattt ggaacctatg
     701  agtgcaaact gcatcttgtg gcttcttgct atttcatcta tgatagcaaa
     751  gaagtgtaca ataaaagagg atgtggcaac tactttcaag tgatctatga
     801  ttcatctgga aaagttgttg gagggctaga taacagggta tcaccttaca
     851  cagggaattc tggagacact ccaacaatgc aatgtgacat gctccagctg
     901  aaacctggaa gatattcagt aagaagctct ccaagattcc ttttaatgcc
     951  tgaaaggagt tattgctttg acatgaaaga aaaaggacca gtcactgctg
    1001  tccaatccat ctggggaaaa ggcagaaaat ctgactatgc agtagatcag
    1051  gcttgcttga gcactccagg gtgcatgttg atccaaaagc aaaagccata
    1101  cattggagag gctgatgatc accatggaga tcaagaaatg agggagttgc
    1151  tgtcaggact ggactatgaa gctagatgca tatcacaatc agggtgggtg
    1201  aatgaaacca gtccttttac ggaagaatac ctccttcctc ccaaatttgg
    1251  aagatgtccc ttggccgcaa aggaagaatc cattccaaaa atcccagatg
    1301  gacttctaat tcccaccagt ggaactgata ccactgtaac caaacctaaa
    1351  agcagaattt ttggaatcga tgaccttatt attggtctac tatttgttgc
    1401  aattgttgaa gcaggaattg gaggctatct gcttggaagt agaaaagaat
    1451  caggaggagg tgtgacaaaa gaatcagctg aaaaagggtt tgaaaaaatt
    1501  ggaaatgaca tacaaatctt aagatcttct acaaatattg caatagaaaa
    1551  actgaacgac agaatttctc atgatgagca agccatcaga gatctaactt
    1601  tagaaattga aaatgcaaga tctgaagctc tattaggaga attgggaata
    1651  ataagagcct tgctggtagg aaatataagc ataggattac aagaatcttt
    1701  atgggaacta gcttcagaaa taacaaatag agcaggagac ctggcagtcg
    1751  aagtctctcc aggttgctgg ataatcgaca ataacatttg tgatcaaagt
    1801  tgtcaaaact ttattttcaa gttcaacgaa actgcgcctg ttccaaccat
    1851  tccccctctt gacacaaaaa ttgatctgca atcagatcct ttttactggg
    1901  gaagcagctt gggcttagca ataactgctg ctaatctaat ggcagctttg
    1951  gtgatctctg ggatcgccat ctgcagaact aaatgatcag gacaattttg
    2001  aaaaatggat aatatattag tcaatatttt gtacagcttt ataaaaaaac
    2051  aaaaaacccc ttgctactgc t