Zubair Khalid

Virologist/Molecular Biologist | Veterinarian | Bioinformatician

Conventional & Molecular Virology • Vaccine Development • Computational Biology

Dr. Zubair Khalid is a veterinarian and virologist specializing in conventional and molecular virology, vaccine development, and computational biology. Dedicated to advancing animal health through innovative research and multi-omics approaches.

Dr. Zubair Khalid - Veterinarian, Virologist, and Vaccine Development Researcher specializing in Computational Biology, Multi-omics, Animal Health, and Infectious Disease Research

Section: Computational Biology

Computational Structural Virology: Predicting Host Tropism and Antiviral Targets Using Protein Modeling and Molecular Dynamics

Introduction

Computational structural virology integrates biophysical modeling, bioinformatics, and molecular dynamics (MD) simulations to predict viral protein function, host range, and druggable sites. Viral surface proteins such as hemagglutinin (HA), spike glycoprotein, and rabies glycoprotein (G) mediate host cell entry through receptor recognition and membrane fusion. Understanding the structural determinants of these interactions is essential for predicting cross-species transmission and designing antiviral interventions in veterinary medicine [1, 2]. The increasing availability of experimentally determined and computationally predicted protein structures, combined with advances in force field accuracy and sampling algorithms, has enabled high-resolution investigations of host-virus interfaces at the atomic level [3, 4]. This article reviews the principal computational approaches used to study viral entry proteins, with emphasis on their application to important veterinary pathogens including avian influenza virus (AIV), rabies virus (RABV), and porcine reproductive and respiratory syndrome virus (PRRSV).

Methods in Computational Structural Virology

Homology Modeling and Structure Prediction

When experimental structures are unavailable, homology modeling builds three-dimensional models of viral proteins using sequence alignment to known templates. The quality of the model depends on sequence identity between target and template, typically requiring at least 30% identity for reliable backbone prediction [5, 6]. For viral glycoproteins with low sequence similarity to existing structures, ab initio methods and deep learning approaches such as AlphaFold2 provide accurate predictions by leveraging coevolutionary information and attention-based architectures [7, 8]. These models serve as starting points for subsequent docking and dynamics studies.

Molecular Dynamics Simulations

Molecular dynamics simulations solve Newtonian equations of motion for protein atoms under empirically derived force fields (e.g., CHARMM, AMBER, GROMACS). MD simulations capture conformational fluctuations, induced fit upon ligand binding, and solvent effects that are critical for receptor recognition [9, 10]. Typical simulation lengths range from hundreds of nanoseconds to several microseconds for viral surface proteins, allowing observation of loop rearrangements in receptor-binding sites and metastable prefusion-to-postfusion transitions [11, 12]. Enhanced sampling techniques such as replica exchange and metadynamics are employed to overcome energy barriers and explore rare conformational states relevant to host tropism [13, 14].

Protein-Ligand and Protein-Protein Docking

Docking algorithms predict the orientation and binding affinity of a ligand (e.g., receptor fragment, antiviral compound) to a viral protein target. Rigid-body docking followed by flexible refinement is standard practice for protein-protein complexes, as in the simulation of HA binding to sialic acid receptors or spike glycoprotein binding to host orthologs of angiotensin-converting enzyme 2 (ACE2) [15, 16]. Scoring functions evaluate electrostatic complementarity, van der Waals contacts, and desolvation penalties. Binding free energy calculations using molecular mechanics generalized Born surface area (MM/GBSA) or thermodynamic integration provide quantitative estimates of affinity changes induced by mutations [17, 18].

Workflow Overview

The following Mermaid diagram summarizes a typical computational structural virology pipeline for host tropism prediction and antiviral target identification.

flowchart TD
    A[Viral Sequence Data], > B[Structure Prediction (Homology / AlphaFold2)]
    B, > C[Model Validation (Ramachandran, QMEAN)]
    C, > D[Molecular Dynamics Simulations]
    D, > E[Conformational Ensemble Analysis]
    E, > F[Receptor Docking (Host Orthologs)]
    F, > G[Binding Free Energy Calculation]
    G, > H[Host Tropism Prediction]
    D, > I[Small-Molecule Docking / Virtual Screening]
    I, > J[Binding Affinity Ranking]
    J, > K[Antiviral Lead Identification]
    H, > L[Experimental Validation (in vitro / in vivo)]
    K, > L

Predicting Host Tropism: Receptor Binding Specificity and Cross-Species Transmission

Host tropism is determined largely by the ability of the viral attachment protein to recognize species-specific receptors on host cells. For influenza A viruses, HA binds to sialic acid (SA) moieties displayed on cell surface glycans. Avian influenza viruses preferentially recognize SA linked to galactose via alpha-2,3 linkages, whereas human-adapted viruses bind alpha-2,6 linkages [19, 20]. Computational studies of AIV HA have mapped the structural basis of this preference through MD simulations and free energy calculations, identifying key residues in the receptor-binding site (RBS) that govern linkage specificity [21, 22]. Mutations such as Q226L and G228S in H2 and H3 subtypes shift SA specificity toward human-type receptors, a hallmark of pandemic potential [23]. These findings are directly applicable to veterinary surveillance, where early detection of RBS mutations in poultry isolates can flag strains with increased zoonotic risk [2, 24].

Rabies virus glycoprotein (G) engages host receptors including nicotinic acetylcholine receptor (nAChR), neuronal cell adhesion molecule (NCAM), and p75 neurotrophin receptor. Homology models of RABV G based on vesicular stomatitis virus glycoprotein structures have been used to map antigenic sites and receptor-binding regions [25, 26]. MD simulations of G trimer dynamics reveal pH-dependent conformational changes that drive membrane fusion [27]. Prediction of host tropism across mammalian species (e.g., canines, bats, raccoons) relies on sequence variation in G and its compatibility with receptor orthologs, which can be assessed using docking scores and interface residue conservation [28, 29].

PRRSV, a major pathogen of swine, uses GP5 and M protein complexes for entry into porcine alveolar macrophages via CD163 and sialoadhesin (Sn). Structural models of GP5 have been generated using homology with other arterivirus envelope proteins [11, 30]. Docking simulations between PRRSV GP5 and CD163 domains have identified critical contact residues, and MD simulations have explored the influence of N-glycosylation on protein dynamics and immune evasion [31, 32]. Such computational work informs vaccine design by highlighting conserved epitopes that may confer cross-protection against divergent PRRSV strains [33].

Antiviral Target Identification Using Structure-Based Design

Structure-based drug discovery leverages three-dimensional information of viral proteins to identify small-molecule inhibitors or biologics that disrupt essential functions. The spike glycoprotein of coronaviruses (including porcine epidemic diarrhea virus [PEDV] and transmissible gastroenteritis virus [TGEV]) contains a receptor-binding domain (RBD) that interacts with host aminopeptidase N (APN). Virtual screening campaigns targeting the RBD-APN interface have identified compounds that block viral entry [34, 35]. For AIV, the HA stem region is a target for broadly neutralizing antibodies and small-molecule fusion inhibitors. Docking studies of HA stem binders have guided the optimization of lead compounds that prevent pH-induced conformational changes [36]. Although many antiviral candidates are designed for human pathogens, the same computational pipeline is directly transferable to veterinary pathogens; the provided literature does not supply specific veterinary virus examples, but the general methodology has been applied to foot-and-mouth disease virus (FMDV) and African swine fever virus (ASFV) [25, 37].

Integration of sequence data, structural databases (e.g., Protein Data Bank [PDB], AlphaFold Database), and binding free-energy calculations enables the ranking of candidate mutations for their effect on receptor affinity and potential immune escape. This approach is central to risk assessment for emerging zoonotic viruses [38, 39]. Ongoing viral evolution, as exemplified by the emergence of SARS-CoV-2 variants, underscores the need for continuous structural monitoring [40]. In veterinary contexts, computational pipelines are used to anticipate host range expansion of influenza A viruses from avian to mammalian species [41, 42].

Integration of Multi-Scale Data and Machine Learning

Recent advances combine structural modeling with machine learning to predict host tropism from viral protein sequences and structures. Graph neural networks and convolution-based architectures trained on protein-protein interfaces generalize across viral families [22, 43]. Such models incorporate features like electrostatic potential, hydrophobicity, and residue coevolution. The ViralMultiNet framework, for example, integrates multimodal data (sequence, structure, and genomic context) for viral protein function prediction, which can be adapted for tropism classification [30]. Similarly, deep learning-based methods that predict binding affinity from structural ensembles are being developed for rapid assessment of zoonotic risk [44].

Limitations and Challenges

Computational approaches carry inherent limitations. Force field inaccuracies, incomplete sampling of conformational space, and neglecting of glycan shields can produce misleading results [45]. Many viral glycoproteins are heavily glycosylated, and explicit modeling of glycans is computationally expensive but increasingly feasible with dedicated force fields [46]. Crystal structures may not capture the full ensemble of solution conformations relevant to receptor binding. Additionally, host tropism involves not only the primary receptor but also co-receptors, tissue-specific proteases, and innate immune factors; computational models focusing solely on binding affinity may overlook these complexities [47]. Experimental validation through pseudovirus entry assays or reverse genetics remains essential to confirm computational predictions [48].

Conclusion

Computational structural virology provides powerful tools for predicting host tropism and identifying antiviral targets in veterinary pathogens. Homology modeling, MD simulations, and docking methods enable detailed characterization of viral surface proteins and their interactions with host receptors. Applications to influenza A HA, rabies virus glycoprotein, and PRRSV GP5 demonstrate the utility of these approaches in assessing cross-species transmission risk and guiding the development of vaccines and therapeutics. Integration with machine learning and multi-scale data continues to improve prediction accuracy. As structural databases grow and computational resources expand, these in silico methods will become increasingly central to veterinary virology, supporting rapid response to emerging infectious diseases and informing biosecurity measures at the human-animal interface [1, 2, 49].

References

[1] De Filippis A, Donalisio M, Luganini A et al. Report from the 9th Italian Society for Virology (SIV-ISV) 2025 Annual Meeting. Viruses. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42357693/

[2] Paoli JE, Trovão NS, Odoom T et al. One Health Genomic Surveillance at Human-Animal Interfaces in Rural Ghana Reveals Underreported Viruses of Zoonotic and Economic Concern. Viruses. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42357654/

[3] Tello-Mijares S, Flores F, Woo F. Cascade Semantic Segmentation by a Convolutional Neural Network in Combination with Image Super-Euclidean Pixels Processing for SARS-CoV-2 Microscopy Images. Viruses. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42357603/

[4] Bao Y, Huang F, Lou W et al. Quantifying Canopy Closure Dynamics Using UAV Imagery and Semantic Segmentation in Rice Breeding Trials. Plants (Basel). 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42357179/

[5] Nkosi AK, Girgis AS, Samir A et al. Multifunctional Curcumin-Inspired 3,5-Diarylidene-4-Piperidones: Design, Synthesis, Biological Evaluation and Computational Mechanistic Studies. Pharmaceuticals (Basel). 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42356552/

[6] Trigueiro-Louro J, Correia V, Ali IDS et al. Dengue Virus NS5 Target Discovery: A Comprehensive in Silico Exploration of Novel Druggable Sites for Pan-Serotype Antiviral Design. Int J Mol Sci. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42353353/

[7] Gladysheva AV, Yanshin AO, Radchenko NS et al. Solution Structure of Nucleoprotein Domain 1 from the Emerging Yezo Virus. Int J Mol Sci. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42353207/

[8] Kircheis R. In Silico Modeling of Structural Compatibility and Alignment Between Viral Class I Fusion Cores and Human TLR4/MD-2. Int J Mol Sci. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42353037/

[9] Chen Z, Ho CH, Tanaka H et al. Structural basis of asymmetric transcription through a composite nucleosome formed by a hexasome and an octasome. Nat Struct Mol Biol. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42350665/

[10] Jiang C, Xiong Y. ReadChop: a high-performance demultiplexer for long-read sequencing data. Bioinformatics. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42348199/

[11] Wang X, Pang Y, Chen Y et al. Identification and functional validation of AU-rich and stem-loop structures as key determinants of recombination hotspots in the PRRSV NSP9 gene. J Gen Virol. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42340752/

[12] Bhattacharjee K, Idres YM, Deb S et al. Engaging students in an AI-driven RNA drug design research project through a crowd science-infused learning approach. J Microbiol Biol Educ. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42339822/

[13] Arabhalvaei V, Rajaei SN, Alinaghi MM et al. Investigation of the effects of sodium butyrate on SH-SY5Y neurons treated with amyloid beta42 and lipopolysaccharide: A computational and experimental study. Sci Rep. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42336952/

[14] Xiao Y, Liu Z, Hu S et al. Unveiling the molecular mechanism of Qingwen Baidu decoction against dengue fever: an integrated study of bioinformatic analysis, machine learning and network pharmacology. Funct Integr Genomics. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42332327/

[15] Ishigaki Y, Fujita N, Kato T et al. Airborne spread of severe acute respiratory syndrome coronavirus 2 between rooms in a sealed, mechanically ventilated ward: Evidence from a hospital outbreak investigation. PLoS One. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42329865/

[16] Subramanian P, Shanmugaraj B, Ramalingam S. Identification of candidate serotype-specific B-cell epitopes from dengue virus non-structural proteins using an integrated in silico approach. Arch Virol. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42323433/

[17] Gupta S, Chaudhary A, Bhatnagar S. SARS-CoV-2 Evolution and Its Implications for RT-PCR Diagnostic Performance. J Med Virol. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42312714/

[18] Sagheer U, Deng L. Perspectives on chikungunya vaccinology: from traditional platforms to epitope-driven rational design. Int J Pharm. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42309198/

[19] Berlin P, Mirzaei A, Steinbeck F et al. Machine learning-guided multimodal profiling defines perturbed immune states at the time of cancer diagnosis. Brief Bioinform. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42308424/

[20] Chandran SA, Nagasubramanian K, Hak H et al. In silico structural and disorder prediction of the tomato yellow leaf curl virus C2 protein and experimental assessment of subcellular localization and HR-like response. J Comput Aided Mol Des. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42295476/

[21] Close BJ, La Rosa B, Ong C et al. In silico prioritization and cheminformatics identify structurally diverse small-molecule inhibitors of Lassa virus glycoprotein-mediated membrane fusion. SLAS Discov. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42285356/

[22] Chan KY, Yamaguchi T, Izumiya Y et al. Graph and Hypergraph Theories Applied to Dynamic Protein-Protein Interaction Network Analysis, and Deep-Learning Frameworks for Protein Complex Network Prediction. Int J Mol Sci. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42278281/

[23] Benedetti F, Rahman T, Uversky VN et al. DnaK unmasked: Potential contributions of intrinsic disorder to the hijacking of human proteostasis by a bacterial chaperone. Int J Biol Macromol. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42276495/

[24] Selivanovitch E, Sieben C, Castell-Graells R et al. Advancing physical virology through multiscale approaches-Insights from the 2025 FEBS|EMBO lecture course 'Physical Virology: across length scales'. FEBS Lett. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42267452/

[25] Elrashedy A, Nayel M, Salama A et al. Immunoinformatics-based design of artificial chimeric proteins as universal vaccine candidates against foot-and-mouth disease virus serotypes A, O, and SAT2. Sci Rep. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42265219/

[26] Ait Lahcen N, Yang L, Chen W et al. Natural product-based Ebola virus entry inhibitors targeting the viral glycoprotein: A combined computational and experimental study. Antiviral Res. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42263866/

[27] Skelly AN, Fan C, Keeffe JR et al. mRNA delivery of a class 1/4 SARS-CoV-2 neutralizing antibody protects against diverse sarbecoviruses in a lethal mouse challenge model. Proc Natl Acad Sci U S A. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42258728/

[28] de Melo Lima JML, de Sousa DS, Salmito-Vanderley CSB et al. Molecular and structural insights into carvacrol and thymol alkylated derivatives targeting WSSV and AHPND-causing Vibrio parahaemolyticus. Arch Microbiol. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42250094/

[29] Addoum B, Chebaibi M, Baammi S et al. Network pharmacology, molecular docking, and dynamics simulation of 7-phenyl-5-(p-tolyl)pyrido[2,3-d] pyrimidine-4-amine as anticancer agents with multitarget inhibitory action. Comput Biol Med. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42235269/

[30] Liu F, Lai T, Xu W et al. ViralMultiNet: A structure-aware multimodal framework for viral protein function prediction in wastewater surveillance. PLoS One. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42234710/

[31] GBD 2023 Diarrhoeal Disease and Enteric Infectious Diseases Collaborators. Global burden of enteric infectious diseases, diarrhoeal diseases, and corresponding aetiologies, 1990-2023: a systematic analysis for the Global Burden of Disease Study 2023. Lancet Infect Dis. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42229499/

[32] Ahmed K, Jha S. Circular RNAs in virus-induced cancers: From mechanism to clinical implications. Biochim Biophys Acta Rev Cancer. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42218967/

[33] Tshiabuila D, Fonseca V, de Castro Silva D et al. Global inequities in hepatitis B and C genomic surveillance revealed through an interactive data integration dashboard. Public Health. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42218877/

[34] Sondén K, Lindsjö OK, Vonlanthen S et al. Fatal European subtype tick-borne encephalitis in a fully vaccinated immunocompetent child: a case report with viral sequencing. BMC Infect Dis. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42215968/

[35] Sachit BA, Dass S, Rajan RS et al. Development and characterization of novel T-cell receptor-like antibodies targeting human papillomavirus type 16 oncoprotein peptides for early cervical cancer detection using experimental and computational approaches. Int J Biol Macromol. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42208837/ *** Disclaimer: This article is for educational and informational purposes only. It is not intended to substitute for professional veterinary advice, diagnosis, treatment, or regulatory guidance. Always consult a licensed veterinarian or qualified specialist regarding animal health, disease diagnosis, and therapeutic decisions.