Genome mutations in novel human H7N9 influenza virus
We gratefully acknowledge the authors, originating and submitting laboratories of the sequences from GISAID’s EpiFlu™ Database on which this analysis is based. A list of the sequences, their submitting labs and GISAID links is provided on this page.
This page presents the analysis of mutations in the virus genomes occurred before and during the emergence of human H7N9 in China.
Segment 1 - PB2 gene
Maximum likelihood tree of available PB2 outbreak sequences (as of 2013-05-01) omitting the 627 codon to avoid biasing the tree if this has convergent mutations.
Now there are many more sequences from the outbreak, a clearer pattern of the E627K mutation in PB2 appears. The 627K is only found in human cases and even then not all of them. This would appear to mean that these mutations are occuring de novo in each human case. This may be due to the fact that these are severe, hospitalized cases which can be associated with a longer infection period giving the virus time to adapt to the human host. This would also suggest this mutation is not involved in the increased human-susceptability that seems to be associated with this virus. All the human cases with 627K involve a GAG -> AAG codon mutation (i.e., a G to A transition in the first codon position) but as both glutamic acid and lysine are both encoded by only 2 codons, this is the only available path between these amino acids.
Previous notes from Tommy Lam (see revisions for full details).
In summary, there are a number of distinctive mutations. (1) A short deletion (69-73aa) in stalk region of NA protein. (2) 627 E > K mutation in PB2 sequence, indicative of mammalian adaptation. (3) Quite a few amino acid substitutions in NP protein. Interestingly, a recent paper (Bogs et al. 2011) suggests PB2 627 mutation depends on the origin of NP protein.
Points to note - whether these substitutions on the 'pre-emergence' branch did actually occur before the human H7N9 lineage relies on the assumption that the same mutation has not independently occurred during the infections (or tranmissions to) of the H7N9 patients. Even this assumption holds, it is expected that the most recent common ancestor (e.g. red dot) carry these mutations; Addressing whether these mutations occurred in the source animal population or just during the cross-species infection in the human requires the further understanding of the number of such inter-species transmissions that have led to this H7N9 emergence, as well as the genome sequences of the direct virus precursors from the animal population.
It is especially controversial as to whether PB2 627 E > K mutation occurred before or after (or during) jumping to human. Previous experiments have demonstrated this mutation could be volatile during cell passage. It is also noteworthy that most of the 627K mutations in H5N1 have occurred in mammalian hosts (but not solely - some bird H5N1 viruses do carry that mutation).