Predicting Zoonotic Spillover: Computational Modeling of Receptor-Binding Dynamics in Emerging Bat Coronaviruses
Abstract
Zoonotic spillover of bat coronaviruses into peridomestic and domestic animal species represents a critical pathway for the emergence of novel viral pathogens with pandemic potential. The receptor-binding domain (RBD) of the coronavirus spike protein and its interaction with host angiotensin-converting enzyme 2 (ACE2) is a primary molecular determinant of host range and cross-species transmission. This article reviews the computational methods used to predict RBD-ACE2 binding dynamics, including homology modeling, molecular docking, free energy calculations, and sequence-based machine learning. We discuss the integration of these approaches into surveillance frameworks such as the RAISE (Risk Assessment of Interspecies Spillover Events) tool and examine how structural predictions inform veterinary vaccine design and targeted wildlife monitoring. Emphasis is placed on bat-origin alphacoronaviruses and sarbecoviruses, with comparative analysis of ACE2 receptor variability across mammalian species. The article concludes with an assessment of current limitations and future directions for computational spillover risk modeling.
Introduction
Bat species of the order Chiroptera serve as reservoir hosts for a diverse array of coronaviruses, many of which have demonstrated the capacity for cross-species transmission to humans and domestic animals [1, 2, 3]. Ecological and anthropogenic factors, including habitat encroachment and roosting in human structures, increase the frequency of bat-human and bat-livestock interfaces, thereby elevating spillover risk [4, 5, 6]. The molecular gateway for coronavirus entry is the interaction between the viral spike protein RBD and host cell surface receptors, most commonly ACE2 for sarbecoviruses and related alphacoronaviruses [7, 8]. Computational modeling of this interaction has become an indispensable tool for predicting which bat coronaviruses pose the greatest zoonotic threat, as experimental characterization of every novel virus is impractical [9, 3].
The RBD-ACE2 binding interface is characterized by a complex network of hydrogen bonds, van der Waals contacts, and electrostatic interactions that determine binding affinity and specificity [8]. Bat coronaviruses, particularly those from the subgenera Sarbecovirus and Merbecovirus, exhibit RBD sequences that can be phylogenetically grouped and structurally modeled to infer receptor compatibility [10, 11]. Computational structural biology methods, including homology modeling, molecular docking, and molecular dynamics (MD) simulations, allow researchers to construct three-dimensional models of RBD-ACE2 complexes and compute binding free energies [12, 8]. These predictions are then integrated with epidemiological data, viral genomics, and machine learning classifiers to produce spillover risk scores [11, 13, 9].
This article focuses exclusively on the computational modeling of receptor-binding dynamics in bat coronaviruses relevant to veterinary medicine and wildlife surveillance. Human clinical outcomes are discussed only where direct comparative host-range parallels are drawn, such as the use of human ACE2 as a reference receptor for evaluating zoonotic potential in animal models [14, 12].
Computational Methods for Predicting RBD-ACE2 Interactions
Homology Modeling of Bat Coronavirus RBDs
The three-dimensional structure of the RBD is not experimentally determined for every bat coronavirus isolate. Homology modeling, also known as comparative modeling, provides a reliable alternative when a template structure with sufficient sequence identity exists [15]. The RBD core structure is conserved among coronaviruses, allowing the use of solved structures such as SARS-CoV-2 RBD (PDB 6M0J) or SARS-CoV RBD (PDB 2AJF) as templates for bat sarbecovirus RBDs [12]. Tools such as MODELLER and SWISS-MODEL are commonly employed to generate models based on sequence alignment [15]. The quality of the model is evaluated using DOPE scores, QMEAN, and Ramachandran plots to ensure stereochemical plausibility [15]. For bat alphacoronaviruses that use receptors other than ACE2, such as CEACAM6, homology modeling is adapted to the specific receptor-binding region [16].
Molecular Docking of RBD-ACE2 Complexes
Once a structural model of the bat coronavirus RBD is built, molecular docking is used to predict the binding pose and affinity with host ACE2 orthologs [15, 12]. Rigid-body docking algorithms, such as those implemented in ClusPro or ZDOCK, evaluate millions of potential orientations and score them based on shape complementarity and electrostatic compatibility [15]. Flexible docking approaches, which allow side-chain conformational changes, are often necessary to capture induced-fit effects at the interface [12]. The docking score, typically expressed as a binding energy or a probability score, provides a quantitative estimate of the likelihood of receptor recognition [15, 12]. For example, screening candidate intermediate hosts for porcine respiratory coronavirus using molecular docking revealed species-specific binding preferences that correlated with known susceptibility [15].
Free Energy Calculations and Molecular Dynamics Simulations
Molecular dynamics (MD) simulations offer a more detailed and dynamic view of the RBD-ACE2 interaction than static docking. By simulating the solvated complex over nanosecond to microsecond timescales, MD captures conformational fluctuations, water-mediated hydrogen bonds, and the stability of the binding interface [12, 8]. Free energy perturbation (FEP) and molecular mechanics generalized Born surface area (MM/GBSA) methods are then applied to compute the binding free energy (ΔG) of the complex [12]. These calculations can distinguish between high-affinity and low-affinity interactions and identify specific residues that contribute disproportionately to binding [12]. For instance, MM/GBSA analysis of bat sarbecovirus RBDs with human ACE2 has revealed that mutations at residue positions 493, 498, and 501 can significantly alter binding affinity, thereby modulating spillover potential [12, 17].
Sequence-Based Machine Learning for Spillover Risk Prediction
Machine learning models trained on viral genomic and proteomic features have been developed to predict zoonotic potential without requiring a solved structure [18, 13, 9]. Features such as RBD sequence composition, predicted secondary structure, and phylogenetic distance to known zoonotic viruses are used as inputs for classifiers like random forest, support vector machines, and deep neural networks [18, 13, 19]. The RAISE (Risk Assessment of Interspecies Spillover Events) tool exemplifies this approach, integrating RBD sequence features with ACE2 sequence similarity across vertebrate hosts to produce a spillover risk score for sarbecoviruses [11]. Other models incorporate host ecological traits, such as anthropogenic roosting behavior, to predict which bat species are most likely to harbor viruses with high zoonotic potential [4, 5, 20]. A critical challenge in machine learning for spillover prediction is the limited availability of experimentally validated negative examples (i.e., viruses that do not infect humans), which can lead to overestimation of risk [18, 9].
Case Studies: Computational Modeling of Bat Coronavirus RBDs
Sarbecovirus RBD-ACE2 Binding and the RAISE Framework
Sarbecoviruses, including SARS-CoV, SARS-CoV-2, and numerous bat isolates, use ACE2 as the primary entry receptor. The RAISE computational tool was developed to systematically evaluate the spillover potential of bat sarbecoviruses by combining RBD-ACE2 docking scores, sequence conservation of critical binding residues, and phylogenetic relatedness to known human-infecting viruses [11]. In a large-scale screening, RAISE identified several bat sarbecoviruses with predicted high binding affinity to human ACE2, many of which were subsequently confirmed in vitro [11]. The tool also assesses the likelihood of spillover into intermediate hosts, such as pangolins and civets, by docking bat RBDs against ACE2 orthologs from a panel of mammalian species [11, 14]. Comparative analysis of ACE2 sequences from bats, rodents, and carnivores reveals that certain residues at the RBD binding interface (e.g., positions 31, 35, 38, and 353) are highly variable and can determine species susceptibility [14, 12].
Alphacoronavirus RBD Interactions with Alternative Receptors
Not all bat coronaviruses use ACE2. Heart-nosed bat alphacoronaviruses have been shown to use human CEACAM6 as an entry receptor, a finding that emerged from a combination of homology modeling and viral pseudotype entry assays [16]. Computational docking of the alphacoronavirus RBD to CEACAM6 revealed a binding interface distinct from that of sarbecovirus-ACE2 complexes, highlighting the need for receptor-specific modeling approaches [16]. Similarly, porcine respiratory coronavirus uses aminopeptidase N (APN) as a receptor, and molecular docking studies have been used to screen potential intermediate hosts based on APN sequence variation [15]. These examples underscore the importance of expanding computational models to include non-ACE2 receptors, as many bat coronaviruses may use alternative molecules for cell entry.
Implications for Veterinary Surveillance and Vaccine Design
Computational predictions of RBD-ACE2 binding dynamics directly inform surveillance strategies for emerging bat coronaviruses in domestic animal populations. By identifying bat coronaviruses with high predicted affinity for livestock ACE2 receptors, targeted serological and molecular surveillance can be implemented in high-risk regions [13, 21]. For example, models predicting that bat sarbecoviruses can bind to ACE2 of pigs, cattle, and dogs have prompted surveillance in Southeast Asian pig farms and live animal markets [21, 12]. The RAISE tool provides a prioritized list of bat coronaviruses for which intermediate host screening should be conducted [11].
In veterinary vaccine design, knowledge of the RBD-ACE2 interface enables the selection of immunogens that elicit broadly neutralizing antibodies across multiple sarbecovirus lineages. Structural modeling of the RBD bound to ACE2 can identify conserved epitopes that are less prone to escape mutations [22]. A trivalent vaccine strategy based on immunogenic relationship mapping of sarbecovirus RBDs has been proposed to provide broad protection against bat-origin coronaviruses with zoonotic potential [22]. Computational design of RBD-based subunit vaccines relies on accurate modeling of the RBD architecture to ensure proper folding and antigen presentation [22].
The following Mermaid diagram illustrates the typical computational workflow for assessing spillover risk from bat coronavirus RBD sequences.
flowchart TD
A[Bat coronavirus RBD sequence], > B[Homology modeling of RBD 3D structure]
B, > C[Selection of host ACE2 orthologs]
C, > D[Molecular docking of RBD-ACE2 complex]
D, > E[Free energy calculation (MM/GBSA or FEP)]
E, > F{Binding affinity threshold exceeded?}
F, >|Yes| G[High spillover risk candidate]
F, >|No| H[Low risk]
G, > I[In vitro binding validation]
I, > J[Field surveillance in intermediate hosts]
G, > K[Machine learning risk classifier (e.g., RAISE)]
K, > L[Spillover risk score]
L, > M[Surveillance prioritization]
Challenges and Future Directions
Despite significant advances, computational modeling of receptor-binding dynamics faces several limitations. First, the accuracy of homology models depends on template availability; RBDs from highly divergent bat coronaviruses may not be modeled reliably [15]. Second, docking and free energy calculations often ignore the role of glycosylation on both the RBD and the receptor, which can affect binding kinetics [12, 8]. Third, machine learning models are constrained by data imbalance and the difficulty of obtaining true negative labels for non-zoonotic viruses [18, 9]. Fourth, the dynamic nature of the spike protein, including conformational changes between closed and open states, requires enhanced sampling MD simulations that are computationally expensive [12].
Future directions include the integration of deep learning models, such as AlphaFold2, for predicting RBD structures with high accuracy even in the absence of close templates [11]. The application of large language models trained on viral nucleotide sequences has shown promise in improving the generalizability of spillover predictions across viral families [18]. Additionally, combining in silico predictions with in vitro binding assays and ecological data from bat roosting ecology will enhance the resolution of spillover risk maps [4, 5, 20]. The table below summarizes the main computational methods discussed and their applications.
| Method | Input | Output | Application to Bat Coronaviruses |
|---|---|---|---|
| Homology modeling | RBD sequence | 3D structure of RBD | Generate structure for novel bat coronavirus RBDs [15] |
| Molecular docking | RBD and ACE2 structures | Binding pose and score | Predict receptor compatibility with host ACE2 orthologs [15, 12] |
| Free energy calculation (MM/GBSA) | Docked complex | Binding free energy (ΔG) | Quantify affinity differences between RBD variants [12] |
| Machine learning (e.g., random forest) | RBD sequence features, host traits | Spillover risk score | Prioritize bat coronaviruses for surveillance [11, 18, 13] |
| MD simulations | Solvated RBD-ACE2 complex | Conformational dynamics and stability | Identify key residues for binding and escape mutations [12] |
Conclusion
Computational modeling of receptor-binding dynamics provides a powerful framework for predicting zoonotic spillover potential of emerging bat coronaviruses. Homology modeling, molecular docking, free energy calculations, and machine learning each contribute unique insights into the molecular determinants of host range. The RAISE tool and similar platforms now allow systematic triage of bat coronaviruses for surveillance and vaccine design. Continued refinement of these methods, coupled with experimental validation and ecological data, will be essential for preempting the next zoonotic coronavirus spillover event into domestic animal populations.
References
[1] Biswas R, Debnath C, Samanta I et al. Ecology of bats and their role in emerging zoonotic diseases: a review. Revue scientifique et technique. 2020. URL: https://www.semanticscholar.org/paper/d3a6128146d1b974fc60dad2531d5ac87e2f6316
[2] Hayman DTS, Bowen RA, Cryan P et al. Ecology of Zoonotic Infectious Diseases in Bats: Current Knowledge and Future Directions. Zoonoses and Public Health. 2012. URL: https://www.semanticscholar.org/paper/152011abc4bd0a92e3cf011bbfcf1e5fdc001b24
[3] Olival K, Hosseini P, Zambrana-Torrelio C et al. Host and viral traits predict zoonotic spillover from mammals. Nature. 2017. URL: https://www.semanticscholar.org/paper/5833bef1f77157d1e3e6e1553ee9fdd2ff049ee0
[4] Betke BA, Gottdenker N, Meyers LA et al. Bat anthropogenic roosting ecology influences taxonomic and geographic predictions of viral hosts. bioRxiv. 2026. URL: https://www.semanticscholar.org/paper/aaf46a7f9ec4a7d9d8c94181f274aa44da0b028f
[5] Betke BA, Gottdenker N, Meyers LA et al. Bat anthropogenic roosting ecology influences taxonomic and geographic predictions of zoonotic risk. Journal. 2026. URL: https://www.semanticscholar.org/paper/f47c4bcf6b97d903d4aeb0bc758872b45fa58307
[6] Williams EP, Spruill-Harrell B, Taylor MK et al. Common Themes in Zoonotic Spillover and Disease Emergence: Lessons Learned from Bat- and Rodent-Borne RNA Viruses. Viruses. 2021. URL: https://www.semanticscholar.org/paper/b86199675e3dcd1bec6aa1b654fa08d6385b5fbf
[7] Westhoven S, Bertzbach LD, Kloehn M et al. From zoonotic spillover to endemicity: the broad determinants of human coronavirus tropism. mBio. 2025. URL: https://www.semanticscholar.org/paper/38e7aeb6ec6f680883b9da203fb076c2af3aa278
[8] Kuchipudi S, Nelli R, Gontu A et al. Sialic Acid Receptors: The Key to Solving the Enigma of Zoonotic Virus Spillover. Viruses. 2021. URL: https://www.semanticscholar.org/paper/b0c9ef677cd86c91447fb33dba565c9977434ab7
[9] Mollentze N, Streicker D. Predicting zoonotic potential of viruses: where are we? Current Opinion in Virology. 2023. URL: https://www.semanticscholar.org/paper/69691179c585b9566d193f11d44b76b69a3b6c16
[10] Li X, Kang M, Jiao XY et al. Addressing the zoonotic threat of merbecoviruses. Nat Microbiol. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42342927/
[11] Huang H, Kong L, Zhu Y et al. RAISE: A computational tool for evaluating sarbecovirus spillover potential. Nat Commun. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42034636/
[12] Tan ZH, Yong K, Shu JJ. Predicting potential SARS-CoV-2 spillover and spillback in animals. Journal of microbiology, immunology, and infection. 2024. URL: https://www.semanticscholar.org/paper/cf6c0ae52b4015cc43892fbb6e9f3275a760b8d7
[13] Pandit P, Anthony S, Goldstein T et al. Predicting the potential for zoonotic transmission and host associations for novel viruses. Communications Biology. 2022. URL: https://www.semanticscholar.org/paper/be77a73fcbbde954b584acc947c4296afeab3dc9
[14] Frank JA, Gan EX, Hooper WB et al. Systematic multi-reference vertebrate ACE2 sequence similarity analysis predicts species susceptibility to SARS-related sarbecoviruses. Sci Rep. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/41851226/
[15] Sootichote R, Chamkasem A, Toniti W et al. Screening candidate intermediate hosts for porcine respiratory coronavirus using molecular docking. Comp Immunol Microbiol Infect Dis. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42361779/
[16] Gallo G, Di Nardo A, Lugano D et al. Heart-nosed bat alphacoronaviruses use human CEACAM6 to enter cells. Nature. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42020746/
[17] Wang J, Pan Y, Yang L et al. Individual bat virome analysis reveals co-infection and spillover among bats and virus zoonotic potential. Nature Communications. 2023. URL: https://www.semanticscholar.org/paper/2d9e981d29c7813690965c8dc0820f25ac359619
[18] Kawasaki J, Suzuki T, Hamada M. Hidden challenges in evaluating spillover risk of zoonotic viruses using machine learning models. bioRxiv. 2024. URL: https://www.semanticscholar.org/paper/2d9c99fb0b1b11f0ccb577589841120099e22572
[19] Eng CLP, Tong J, Tan T. Predicting Zoonotic Risk of Influenza A Viruses from Host Tropism Protein Signature Using Random Forest. International Journal of Molecular Sciences. 2017. URL: https://www.semanticscholar.org/paper/d7c110ea6dfe34e53911553af107a5b1e27e4221 *** Disclaimer: This article is for educational and informational purposes only. It is not intended to substitute for professional veterinary advice, diagnosis, treatment, or regulatory guidance. Always consult a licensed veterinarian or qualified specialist regarding animal health, disease diagnosis, and therapeutic decisions.
[20] Guy C, Ratcliffe J, Mideo N. The influence of bat ecology on viral diversity and reservoir status. Ecology and Evolution. 2020. URL: https://www.semanticscholar.org/paper/7884fab374e664ffbc6387b3a3b3e92c1874ef13
[21] Evans TS, Tan C, Aung O et al. Exposure to diverse sarbecoviruses indicates frequent zoonotic spillover in human communities interacting with wildlife. International Journal of Infectious Diseases. 2023. URL: https://www.semanticscholar.org/paper/0ef1e7a9e0f3e4ce49c00fb53563d44e1184fdbe
[22] Sun Y, Cheng Z, Wu X et al. Immunogenic relationship mapping supports a minimal-set trivalent vaccine strategy for broad sarbecovirus protection. Signal Transduct Target Ther. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/41656339/
[23] Guzzi PH, Branda F, Scarpa F et al. Integrated Downstream Analysis and Epidemiological Modelling of Hantavirus Infection: From Host Transcriptomics to Transmission Dynamics. Pathogens. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42347213/
[24] Mazumder S, Kapoor S, Kaur H et al. Bioinformatics-driven genome-wide identification of viral miRNAs in high spillover bat coronaviruses and their target genes in human. Arch Microbiol. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42060194/
[25] Imrie RM, Bissett LA, Raveendran S et al. Post-pandemic changes in population immunity have reduced the likelihood of emergence of zoonotic coronaviruses. Nat Commun. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/41876522/
[26] Yu C, Hoem T, Ou TP et al. Timing is infectious: eco-epidemiological modelling of Pteropus lylei in Cambodia suggests regular annual coronavirus outbreaks in bats. Proc Biol Sci. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/41592774/
[27] Tahir M. Predicting Zoonotic Spillover Events: A Fractional Modeling Framework for Nipah Virus Dynamics. SciNexuses. 2025. URL: https://www.semanticscholar.org/paper/734e89eb9a09929b75b41e84307bfae169708316
[28] Ogunremi J, Igwilo NN, Babalola OO. Predicting Zoonotic Spillover Events through a Real-Time GeoAI Framework for Global Hotspot Detection Integrating Satellite Intelligence, Viral Genomics, and Human Mobility Networks. International Journal of Research Publication and Reviews. 2026. URL: https://www.semanticscholar.org/paper/024c95d52d1cba90bdc10cf08916d9a8edfffda9
[29] Longest A, Grace TM, Tran M et al. Predicting Influenza Virus Host Tropism and Zoonotic Spillover Risk from Protein Sequences. bioRxiv. 2026. URL: https://www.semanticscholar.org/paper/c90a38774e21959faffc6956f72cedfa645892e8
[30] Aziz SAA, Shad KF. Analysis of SARS-CoV-2 and factors predicting next spillover of its more contagious variant. INTERNATIONAL JOURNAL OF ENDORSING HEALTH SCIENCE RESEARCH (IJEHSR). 2021. URL: https://www.semanticscholar.org/paper/4fb5343dbb851153fba9dcc39d1709b3928b970e
[31] Johnson CK, Hitchens P, Evans TS et al. Spillover and pandemic properties of zoonotic viruses with high host plasticity. Scientific Reports. 2015. URL: https://www.semanticscholar.org/paper/334b202da3abae2695868424f4e86f7a9ad6b75b
[32] Washburne A, Crowley D, Becker DJ et al. Taxonomic patterns in the zoonotic potential of mammalian viruses. PeerJ. 2018. URL: https://www.semanticscholar.org/paper/e398e5c7fc3583c5354b8745b86509038b94d1cc
[33] Zarzees M, Shah U, Murali T et al. A Cophylogenetic Approach for Virus-Host Interaction Prediction. bioRxiv. 2026. URL: https://www.semanticscholar.org/paper/371721600e98970ab6be933cd68e171df8afcec6
[34] Dhama K, Patel S, Sharun K et al. SARS-CoV-2: Jumping the Species Barrier, Lessons from SARS and MERS, Its Zoonotic Spillover, Transmission to Humans, Preventive and Control Measures and Recent Developments to Counter This Pandemic Virus. 2020. URL: https://www.semanticscholar.org/paper/7dfe9f9f72fe3b195ec3e878d79bb14fb39b50a2