Deep Mutational Scanning and Structural Modeling of Avian Influenza HA: Predicting Zoonotic Risk from Computational Binding Landscapes
Introduction
Avian influenza viruses, particularly those of the H5N1 and H7N9 subtypes, represent a persistent threat to poultry health and a potential source of zoonotic spillover events [1, 2]. The primary molecular barrier to cross-species transmission is the binding specificity of the viral hemagglutinin (HA) glycoprotein for host cell surface sialic acid receptors [3]. Avian influenza viruses preferentially bind to alpha-2,3-linked sialic acids (SAα2,3Gal), which are abundant in the intestinal and respiratory tracts of birds, whereas human-adapted influenza viruses bind to alpha-2,6-linked sialic acids (SAα2,6Gal) prevalent in the human upper respiratory tract [4, 3]. A switch in HA receptor-binding specificity from SAα2,3Gal to SAα2,6Gal is a critical step for zoonotic transmission and pandemic potential [5, 4].
Deep mutational scanning (DMS) has emerged as a powerful experimental technique to systematically measure the functional impact of every possible single amino acid mutation in a protein [6, 7]. When applied to avian influenza HA, DMS generates comprehensive fitness landscapes that reveal which mutations are tolerated and which alter receptor-binding specificity [6, 5]. These experimental data are increasingly integrated with computational structural models, such as those generated by Rosetta and AlphaFold2, to predict the biophysical consequences of mutations on HA-receptor interactions [8, 9]. This article reviews the methodological framework for combining DMS data with structural modeling to construct computational binding landscapes that forecast zoonotic risk in avian influenza viruses.
Deep Mutational Scanning of Avian Influenza Hemagglutinin
Experimental Principles of DMS
Deep mutational scanning involves the construction of a library of viral HA variants, each containing a single amino acid substitution [6]. This library is expressed on the surface of cells or in viral particles and subjected to a functional selection, such as binding to a specific receptor analog or antibody [10, 7]. The relative enrichment or depletion of each variant after selection is quantified using high-throughput sequencing, yielding a functional score for each mutation [6]. These scores are typically normalized to the wild-type sequence and can be interpreted as a measure of fitness or binding affinity [6, 5].
For avian influenza HA, DMS experiments have been performed to assess the impact of mutations on receptor-binding specificity, thermostability, and antibody escape [11, 10, 7]. For example, DMS of H5N1 HA has identified mutations in the receptor-binding site (RBS) that enhance binding to human-type SAα2,6Gal receptors while maintaining or reducing binding to avian-type SAα2,3Gal receptors [5]. Similarly, DMS of H9N2 HA has revealed mutations that potentiate virus transmission in warming environments, linking thermostability to host range [11].
Key Mutations Identified by DMS
Several canonical mutations in the HA RBS are known to modulate receptor-binding specificity. The Q226L and G228S substitutions (H3 numbering) in the HA1 subunit are among the most well-characterized switches that shift binding preference from SAα2,3Gal to SAα2,6Gal in H2 and H3 subtypes [5, 4]. In H5N1 and H7N9 viruses, analogous mutations at these positions have been shown to enhance human-type receptor binding [5, 12]. DMS data have confirmed that these mutations are highly enriched under selection for human-type receptor binding [5]. Additional mutations, such as N158K, T160A, and S137A, have also been implicated in modulating receptor specificity and antigenicity [13, 14].
The DMS approach has further revealed that the fitness landscape of HA is highly epistatic, meaning that the effect of a mutation depends on the genetic background [6, 5]. For instance, the Q226L mutation may only confer a strong switch to SAα2,6Gal binding in the presence of permissive secondary mutations that stabilize the RBS [5]. This epistasis complicates simple sequence-based predictions and underscores the need for structural modeling to interpret mutational effects in a three-dimensional context [6, 9].
Structural Modeling of Hemagglutinin-Receptor Interactions
Computational Tools for Structure Prediction
High-resolution crystal structures of HA in complex with sialic acid receptor analogs provide the foundation for computational modeling [15, 16]. However, for emerging viral strains or engineered variants, experimental structures may not be available. In such cases, homology modeling and deep learning-based methods such as AlphaFold2 are employed to generate reliable structural models of HA [8, 15, 9]. AlphaFold2 has demonstrated remarkable accuracy in predicting protein structures, including the globular head domain of HA that contains the RBS [8, 9].
Rosetta is another widely used suite for protein modeling and design [7, 8]. Rosetta can be used to predict the structure of HA mutants, calculate binding free energies between HA and receptor analogs, and design stabilizing mutations [7, 8]. The Rosetta energy function evaluates van der Waals interactions, hydrogen bonding, solvation, and electrostatic contributions to estimate the binding affinity of a protein-ligand complex [8, 17].
Free Energy Perturbation and Molecular Dynamics
To quantitatively predict changes in receptor-binding affinity upon mutation, free energy perturbation (FEP) calculations are often employed [18, 17]. FEP is a rigorous thermodynamic method that computes the difference in binding free energy between a wild-type and mutant complex by simulating the alchemical transformation of one residue into another [18]. When applied to HA-receptor complexes, FEP can predict the relative binding affinity for SAα2,3Gal versus SAα2,6Gal receptors [18, 17].
Molecular dynamics (MD) simulations complement FEP by providing insights into the conformational dynamics of the HA-receptor interface [5, 18]. MD simulations can reveal how mutations alter the flexibility of key loops in the RBS, affect hydrogen bond networks, and modulate the overall stability of the HA trimer [5, 4]. For example, MD simulations of H5N1 HA have shown that host-switching mutations suppress site-specific activation dynamics, thereby stabilizing the pre-fusion conformation required for receptor binding [5].
Integrating DMS Data with Structural Models
Computational Binding Landscapes
The integration of DMS data with structural models yields computational binding landscapes that map the functional impact of mutations onto the three-dimensional structure of HA [6, 8]. In this framework, each mutation is assigned a DMS-derived functional score, and its structural context is analyzed using Rosetta or AlphaFold2 models [6, 7]. Mutations that cluster in the RBS and are predicted to alter binding free energy toward human-type receptors are flagged as high-risk candidates for zoonotic spillover [8, 5].
A typical computational pipeline for constructing binding landscapes includes the following steps:
- DMS Data Generation: Perform a deep mutational scan of the HA gene, selecting for binding to SAα2,6Gal receptor analogs [6, 5].
- Structure Prediction: Generate a structural model of the wild-type HA using AlphaFold2 or a crystal structure if available [8, 15].
- Mutation Modeling: Introduce each single amino acid substitution into the structural model using Rosetta's fixed-backbone design or flexible backbone protocols [7, 8].
- Binding Free Energy Calculation: Compute the binding free energy of each HA variant with SAα2,3Gal and SAα2,6Gal receptor analogs using Rosetta or FEP [8, 17].
- Correlation Analysis: Correlate DMS functional scores with computed binding free energy changes to validate the computational model [6, 8].
- Risk Scoring: Assign a zoonotic risk score to each mutation based on its DMS score, predicted binding affinity switch, and structural context [8, 9].
Workflow Diagram
The following Mermaid diagram illustrates the integrated workflow for predicting zoonotic risk from DMS and structural modeling.
graph TD
A[HA Gene Library], > B[Deep Mutational Scanning]
B, > C[Functional Scores for Each Mutation]
D[Wild-Type HA Sequence], > E[AlphaFold2 / Rosetta Structure Prediction]
E, > F[Structural Model of HA]
C, > G[Correlation Analysis]
F, > G
G, > H[Validated Computational Model]
H, > I[Free Energy Perturbation / MD Simulations]
I, > J[Predicted Binding Affinity for SAα2,3Gal vs SAα2,6Gal]
J, > K[Zoonotic Risk Score Assignment]
K, > L[High-Risk Mutation Identification]
L, > M[Surveillance and Experimental Validation]
Predicting Zoonotic Spillover
Key Structural Determinants of Host Range
The receptor-binding site of HA is a shallow pocket formed by the 130-loop, 190-helix, and 220-loop [5, 4]. The identity of residues at positions 226 and 228 is critical for determining receptor specificity [5, 4]. In avian-adapted HAs, residue 226 is typically glutamine (Q), which forms a hydrogen bond with the SAα2,3Gal linkage [4]. The substitution to leucine (L) at position 226 creates a hydrophobic environment that favors the SAα2,6Gal conformation [5, 4]. Similarly, glycine at position 228 (G228) allows for a more open pocket that accommodates the SAα2,6Gal linkage, whereas serine (S228) can form additional hydrogen bonds with the receptor [4].
DMS studies have shown that these canonical mutations are not sufficient on their own to confer full human-type receptor binding in all HA subtypes [6, 5]. For example, in H5N1 clade 2.3.4.4b viruses, the Q226L and G228S mutations must be accompanied by additional changes in the 130-loop and 190-helix to achieve high-affinity binding to SAα2,6Gal [5, 1]. Computational modeling has identified permissive mutations such as S137A, T160A, and N158K that stabilize the RBS and enhance the effect of the primary specificity-switching mutations [5, 13].
Machine Learning and AI Integration
Recent advances in machine learning have enabled the prediction of HA receptor-binding specificity directly from sequence and structure [8, 9]. AI-powered methods can identify human cell surface protein interactors of HA, providing a broader view of potential host range determinants beyond sialic acid binding [8]. For instance, machine learning models trained on DMS data and structural features can predict the pathogenicity of avian influenza viruses with high accuracy [9]. These models incorporate features such as electrostatic potential, hydrogen bond donor/acceptor density, and pocket volume at the RBS [8, 9].
Template-based structure prediction combined with machine learning has been used to classify avian influenza viruses as high or low pathogenicity based on HA structural features [9]. This approach leverages the fact that pathogenic mutations often cluster in specific structural regions, such as the RBS and the fusion peptide [9, 4]. By integrating DMS data with these predictive models, researchers can prioritize viral strains for enhanced surveillance and experimental characterization [8, 9].
Cross-Linking to Related Articles
For a broader discussion of how DMS and computational modeling are applied to other viral glycoproteins, readers are directed to the article on Deep Mutational Scanning and Computational Modeling of Avian Influenza Hemagglutinin for Zoonotic Risk Prediction. The principles of receptor-binding dynamics are further explored in Structural Dynamics of Avian Influenza Hemagglutinin: Molecular Modeling and Receptor Binding Predictions for Pandemic Risk Assessment. For a comparative perspective on coronavirus spike protein modeling, see Deep Mutational Scanning and Computational Modeling of SARS-CoV-2 Spike-ACE2 Binding Dynamics for Predicting Zoonotic Risk.
Limitations and Future Directions
Challenges in Computational Prediction
Despite significant progress, several limitations remain in the computational prediction of zoonotic risk from HA binding landscapes. First, DMS data are typically generated in vitro using simplified receptor analogs, which may not fully recapitulate the complex glycan environment of the human respiratory tract [6, 19]. Second, the computational prediction of binding free energies using Rosetta or FEP is computationally expensive and can be sensitive to the force field parameters used [18, 17]. Third, epistatic interactions between mutations are difficult to model accurately, and current methods may underestimate the impact of non-additive effects [6, 5].
Need for Experimental Validation
Computational predictions must be validated experimentally using techniques such as glycan microarray binding assays, surface plasmon resonance, and viral pseudotype entry assays [5, 3]. The integration of computational and experimental approaches is essential for building robust risk assessment frameworks [8, 5]. Furthermore, surveillance of circulating avian influenza viruses through platforms such as GISAID provides the sequence data necessary to apply these predictive models in real time [1, 20].
Emerging Technologies
Advances in cryo-electron microscopy are providing high-resolution structures of HA in complex with native receptors, which can improve the accuracy of computational models [21]. Additionally, the development of protein language models trained on large sequence databases is enabling the prediction of mutational effects without the need for explicit structural modeling [6, 8]. These tools are likely to become increasingly important for rapid zoonotic risk assessment during outbreaks.
Conclusion
Deep mutational scanning and structural modeling of avian influenza HA provide a powerful framework for predicting zoonotic risk from computational binding landscapes. By integrating experimental functional scores with biophysical simulations, researchers can identify mutations that alter receptor-binding specificity and assess the pandemic potential of emerging viral strains. The continued refinement of these computational methods, combined with robust experimental validation and genomic surveillance, will enhance our ability to anticipate and mitigate the threat of avian influenza spillover into mammalian hosts.
References
[1] Singh K, Malik YS, Hemida MG. Comprehensive Evolutionary and Structural Analysis of the H5N1 Clade 2.4.3.4b Influenza a Virus Based on the Sequences and Data Mining of the Hemagglutinin, Nucleoprotein and Neuraminidase Genes Across Multiple Hosts. Pathogens. 2025. URL: https://pubmed.ncbi.nlm.nih.gov/41011764/
[2] Trinh TT, Tiwari I, Durairaj K et al. Genetic Characterization and Pathogenesis of Avian Influenza Virus H7N3 Isolated from Spot-Billed Ducks in South Korea, Early 2019. Viruses. 2021. URL: https://pubmed.ncbi.nlm.nih.gov/34067187/
[3] Verhagen JH, Eriksson P, Leijten L et al. Host Range of Influenza A Virus H1 to H16 in Eurasian Ducks Based on Tissue and Receptor Binding Studies. J Virol. 2021. URL: https://pubmed.ncbi.nlm.nih.gov/33361418/
[4] Russell CJ. Hemagglutinin Stability and Its Impact on Influenza A Virus Infectivity, Pathogenicity, and Transmissibility in Avians, Mice, Swine, Seals, Ferrets, and Humans. Viruses. 2021. URL: https://pubmed.ncbi.nlm.nih.gov/33923198/
[5] Kephart SM, Awatramani KF, Saunders MI et al. Host switching mutations in H5N1 influenza hemagglutinin suppress site-specific activation dynamics. Nat Commun. 2025. URL: https://pubmed.ncbi.nlm.nih.gov/41350454/
[6] Ahn JJ, Yu TC, Dadonaite B et al. Influenza hemagglutinin subtypes have different sequence constraints despite sharing extremely similar structures. Virus Evol. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/41971714/
[7] Dosey A, Dadonaite B, Gillespie RA et al. Stabilization of the H5 clade 2.3.4.4b hemagglutinin improves vaccine-elicited neutralizing antibody responses in mice. Sci Transl Med. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/41779867/
[8] Poitras C, Coulombe B. AI-Powered Identification of Human Cell Surface Protein Interactors of the Hemagglutinin Glycoprotein of High-Pandemic-Risk H5N1 Influenza Virus. Viruses. 2025. URL: https://pubmed.ncbi.nlm.nih.gov/41472307/
[9] Shin JH, Kim SJ, Kim G et al. Machine Learning Using Template-Based-Predicted Structure of Haemagglutinin Predicts Pathogenicity of Avian Influenza. J Microbiol Biotechnol. 2024. URL: https://pubmed.ncbi.nlm.nih.gov/39252651/
[10] Song W, Wang C, Xie W et al. Identification of a Key Hemagglutinin Mutation Mediating Antibody Escape in Influenza A(H1N1)pdm09 Viruses. Viruses. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/41902257/
[11] Pei X, Chen W, Xiao H et al. HA thermostability mutations S84F, G167N, and D168N potentiate H9N2 virus transmission in a warming environment. Emerg Microbes Infect. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42343620/
[12] Zhao J, Zhu Y, Hu J et al. [Characterization of a monoclonal antibody against the hemagglutinin stem of H7N9 subtype avian influenza virus]. Sheng Wu Gong Cheng Xue Bao. 2022. URL: https://pubmed.ncbi.nlm.nih.gov/35142127/
[13] Wang Z, Chen Y, Chen H et al. A single amino acid at position 158 in haemagglutinin affects the antigenic property of Eurasian avian-like H1N1 swine influenza viruses. Transbound Emerg Dis. 2022. URL: https://pubmed.ncbi.nlm.nih.gov/34396699/
[14] Dong J, Gilchuk I, Li S et al. Anti-influenza H7 human antibody targets antigenic site in hemagglutinin head domain interface. J Clin Invest. 2020. URL: https://pubmed.ncbi.nlm.nih.gov/32749241/
[15] Cueno ME, Kamio N, Imai K. Avian influenza A H5N1 hemagglutinin protein models have distinct structural patterns re-occurring across the 1959-2023 strains. Biosystems. 2024. URL: https://pubmed.ncbi.nlm.nih.gov/39349133/
[16] Ohno A, Maita N, Tabata T et al. Crystal structure of inhibitor-bound human MSPL that can activate high pathogenic avian influenza. Life Sci Alliance. 2021. URL: https://pubmed.ncbi.nlm.nih.gov/33820827/
[17] Khan MY, Shah AU, Duraisamy N et al. Repurposing of Some Nucleoside Analogs Targeting Some Key Proteins of the Avian H5N1 Clade 2.3.4.4b to Combat the Circulating HPAI in Birds: An In Silico Approach. Viruses. 2025. URL: https://pubmed.ncbi.nlm.nih.gov/40733589/
[18] Alharby TN, Alanazi M, Khan KU et al. Identifying antivirals against influenza PA endonuclease with machine learning-based activity prediction, DFT optimization, and molecular dynamics simulation. Mol Divers. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/41264202/
[19] Page CK, Mubassir MHM, Chopra P et al. N-glycosylation at the receptor binding site drives differences in receptor binding specificity between influenza B virus lineages. J Virol. 2025. URL: https://pubmed.ncbi.nlm.nih.gov/41190812/
[20] Wille M, Tolf C, Latorre-Margalef N et al. Evolutionary features of a prolific subtype of avian influenza A virus in European waterfowl. Virus Evol. 2022. URL: https://pubmed.ncbi.nlm.nih.gov/36128050/
[21] Karakus U, Mena I, Kottur J et al. H19 influenza A virus exhibits species-specific MHC class II receptor usage. Cell Host Microbe. 2024. URL: https://pubmed.ncbi.nlm.nih.gov/38889725/
[22] Jones JC, Andreev K, Sheth V et al. Baloxavir alleviates severe disease and viremia in ferrets infected with avian- or bovine-origin influenza A(H5N1) virus. Sci Transl Med. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/41706870/
[23] Caputo V, Libera M, Campos Mota Y et al. Extended Heterosubtypic Neutralization and Preclinical Model In Vivo Protection from Clade 2.3.4.4b H5 Influenza Virus Infection by Broadly Neutralizing Antibodies. Vaccines (Basel). 2026. URL: https://pubmed.ncbi.nlm.nih.gov/41600987/
[24] Puente-Massaguer E, Galdino Andrade T, Scherm MJ et al. An H5N1 clade 2.3.4.4b virus vaccine that elicits cross-protective antibodies against conserved domains of H5 and N1 glycoproteins. Nat Commun. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/41577702/
[25] Alzua GP, León AN, Yellin T et al. Human monoclonal antibodies that target clade 2.3.4.4b H5N1 hemagglutinin. Nat Commun. 2025. URL: https://pubmed.ncbi.nlm.nih.gov/41390501/
[26] Zhang J, Wu Y, Wang W et al. Genomic and structural characterization of a reassortant H9N2 avian influenza virus from a human case. Int J Infect Dis. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/41360219/
[27] Meng F, Cheng Z, Feng Z et al. H128N Substitution in the Sa Antigenic Site of HA1 Causes Antigenic Drift Between Eurasian Avian-like H1N1 and 2009 Pandemic H1N1 Influenza Viruses. Viruses. 2025. URL: https://pubmed.ncbi.nlm.nih.gov/41157633/
[28] Puente-Massaguer E, Andrade TG, Scherm MJ et al. A clade 2.3.4.4b H5N1 virus vaccine that elicits cross-protective antibodies against conserved domains of H5 and N1 glycoproteins. bioRxiv. 2025. URL: https://pubmed.ncbi.nlm.nih.gov/40832286/
[29] Wang Z, Tian C, Zhu J et al. Avian influenza mRNA vaccine encoding hemagglutinin provides complete protection against divergent H5N1 viruses in specific-pathogen-free chickens. J Nanobiotechnology. 2025. URL: https://pubmed.ncbi.nlm.nih.gov/39881325/
[30] Rathnasinghe R, Chang LA, Pearl R et al. Sequential immunization with chimeric hemagglutinin ΔNS1 attenuated influenza vaccines induces broad humoral and cellular immunity. NPJ Vaccines. 2024. URL: https://pubmed.ncbi.nlm.nih.gov/39300090/
[31] Tarrahimofrad H, Rahimnahal S, Zamani J et al. Designing a multi-epitope vaccine to provoke the robust immune response against influenza A H7N9. Sci Rep. 2021. URL: https://pubmed.ncbi.nlm.nih.gov/34966175/
[32] AboElkhair MA, Hasan ME, Mousa A et al. In-silico evidence for enhancement of avian influenza virus H9N2 virulence by modulation of its hemagglutinin (HA) antigen function and stability during co-infection with infectious bronchitis virus in chickens. Virusdisease. 2021. URL: https://pubmed.ncbi.nlm.nih.gov/34631979/
[33] Zhu WZ, Wen YC, Lin SY et al. Anti-Influenza Protective Efficacy of a H6 Virus-Like Particle in Chickens. Vaccines (Basel). 2020. URL: https://pubmed.ncbi.nlm.nih.gov/32825685/
[34] Barberis A, Boudaoud A, Gorrill A et al. Full-length genome sequences of the first H9N2 avian influenza viruses isolated in the Northeast of Algeria. Virol J. 2020. URL: https://pubmed.ncbi.nlm.nih.gov/32680533/
[35] Liao HY, Wang SC, Ko YA et al. Chimeric hemagglutinin vaccine elicits broadly protective CD4 and CD8 T cell responses against multiple influenza strains and subtypes. Proc Natl Acad Sci U S A. 2020. URL: https://pubmed.ncbi.nlm.nih.gov/32669430/ *** Disclaimer: This article is for educational and informational purposes only. It is not intended to substitute for professional veterinary advice, diagnosis, treatment, or regulatory guidance. Always consult a licensed veterinarian or qualified specialist regarding animal health, disease diagnosis, and therapeutic decisions.