Section: Computational Biology

Computational Screening of Fragment Libraries in Active Sites

Introduction to Fragment-Based Lead Discovery

Fragment-based lead discovery (FBLD) represents a paradigm shift in early-stage drug design, emphasizing the screening of low-molecular-weight compounds (typically 150-300 Da) against macromolecular targets [1, 2]. Unlike high-throughput screening of large, drug-like molecules, fragment screening interrogates a sparse chemical space with high efficiency, identifying small, weakly binding moieties that can be elaborated into potent leads [3, 4]. The biophysical basis for this approach lies in the reduced complexity of fragment-target interactions; fragments possess fewer degrees of freedom and lower enthalpic penalties upon binding, allowing for higher hit rates per atom screened [5, 6]. In veterinary medicine, FBLD has been applied to targets such as Mycobacterium tuberculosis dihydrofolate reductase (Mtb-DHFR) and Mycobacterium protein kinase B, demonstrating its utility in addressing pathogens relevant to livestock and companion animals [1, 7].

The computational screening of fragment libraries within active sites integrates molecular docking, scoring functions, and structural analysis to predict binding modes and affinities [8, 9]. This process is distinct from traditional virtual screening because fragments exhibit promiscuous binding and require specialized scoring functions that account for solvation, entropy, and induced fit [10, 6]. The active site, defined as the catalytic or functional pocket of a protein, serves as the spatial constraint for docking simulations [11, 12]. Accurate representation of this pocket through grid mapping is essential for capturing electrostatic, van der Waals, and hydrogen bonding interactions [13, 14].

Active Site Grid Mapping and Preparation

The computational representation of an active site begins with the generation of a three-dimensional grid that encapsulates the binding pocket [8, 9]. This grid partitions the space into discrete points, each assigned precomputed interaction potentials for probe atoms (e.g., carbon, nitrogen, oxygen, hydrogen) [6]. The grid dimensions are typically set to encompass the entire pocket plus a margin of 5-10 angstroms to accommodate ligand flexibility and induced fit [13]. For veterinary targets such as the dengue viral RNA-dependent RNA polymerase or Helicobacter pylori DapE, grid spacing of 0.375 angstroms is standard to balance resolution and computational cost [11, 12].

Grid maps are generated using programs such as AutoGrid, which calculates electrostatic potentials via Poisson-Boltzmann or Coulombic formalisms and van der Waals potentials using Lennard-Jones 12-6 functions [8]. Desolvation terms are often included to penalize the displacement of ordered water molecules [10]. In fragment screening, the grid must be calibrated to detect weak interactions; a fragment may form only one or two hydrogen bonds, so the grid's hydrogen bonding potential must be finely tuned [6, 14]. For allosteric sites, such as those identified in HIV integrase or PDK1, the grid must extend beyond the orthosteric pocket to capture cryptic binding regions [15, 16].

Fragment Library Design and Filtering

Fragment libraries for computational screening are curated to maximize chemical diversity while adhering to the "rule of three": molecular weight less than 300 Da, hydrogen bond donors less than or equal to 3, hydrogen bond acceptors less than or equal to 3, and cLogP less than or equal to 3 [2, 3]. Libraries may contain hundreds to thousands of fragments, often derived from commercial sources or designed de novo using pharmacophore hypotheses [14]. For example, a fragment library targeting heparanase was generated by enumerating minimal pharmacophoric features from known inhibitors [14]. Similarly, fragment merging approaches have been used to design selective inhibitors of Mtb-DHFR by combining two weakly binding fragments into a single high-affinity molecule [1].

Computational filtering removes fragments with reactive functional groups, poor solubility, or high conformational flexibility [4, 17]. The ACFIS web server provides an automated pipeline for fragment-based drug discovery, integrating library preparation, docking, and hit analysis [4]. In the context of veterinary virology, fragment libraries have been screened against the SARS-CoV-2 helicase Nsp13 and the Nsp3 macrodomain, demonstrating cross-species applicability [18, 19]. The use of lead-like libraries, as opposed to fragment libraries, has also been explored for targets such as glutaminyl cyclase, where initial hits were validated by in vitro assays [20].

Molecular Docking of Fragments

Molecular docking algorithms predict the preferred orientation of a fragment within the active site by sampling translational, rotational, and conformational degrees of freedom [8, 9]. For fragments, exhaustive sampling is computationally feasible due to the limited number of rotatable bonds (typically fewer than five) [5]. Docking programs such as AutoDock Vina, RosettaLigand, and Glide employ stochastic or systematic search algorithms, including genetic algorithms, Monte Carlo methods, and incremental construction [8, 9, 6].

Scoring functions evaluate the complementarity between the fragment and the active site. Common scoring functions include force-field-based (e.g., MM-PBSA), empirical, and knowledge-based potentials [13, 10]. For fragments, the binding energy is often dominated by van der Waals contacts and hydrogen bonding, with electrostatic contributions playing a secondary role [6]. Re-scoring with MM-PBSA (Molecular Mechanics Poisson-Boltzmann Surface Area) has been shown to improve the correlation between predicted and experimental binding affinities for fragment hits [13, 10]. In a study targeting Leishmania major protein disulfide isomerase, computational fragment screening combined with MM-PBSA re-scoring identified novel inhibitors with micromolar activity [21].

A critical challenge in fragment docking is the accurate prediction of water-mediated interactions. Fragments often displace or coordinate water molecules in the active site, and explicit water models or hydration site analysis can improve docking accuracy [5, 6]. For the disulfide-dithiol oxidoreductase DsbA from Burkholderia pseudomallei, NMR fragment screening revealed a novel binding site near the catalytic surface, highlighting the importance of solvent effects in fragment recognition [22].

Fragment Linking and Elaboration

Once fragment hits are identified, they must be linked or elaborated to produce high-affinity leads [1, 3]. Fragment linking involves connecting two or more fragments that bind to adjacent subpockets within the active site, ideally with minimal linker strain [3, 23]. The resulting molecule often exhibits superadditive binding affinity, as the entropic penalty of binding multiple fragments is paid once [2]. For renin inhibitors, fragment linking yielded compounds with nanomolar potency by bridging the S1 and S3 pockets [3].

Fragment elaboration, or growing, involves extending a single fragment into adjacent pockets by adding functional groups [23]. This approach was used to develop inhibitors of Mycobacterium tuberculosis type II dehydroquinase, where a fragment hit was elaborated through iterative synthesis and docking [23]. Computational tools such as ACFIS facilitate the design of linked fragments by enumerating possible linkers and evaluating their docking scores [4]. The fragment merging approach, distinct from linking, combines overlapping features of two fragments into a single scaffold, as demonstrated in the design of Mtb-DHFR inhibitors [1].

Visualization of Fragment Overlays and Binding Geometries

Visual inspection of fragment binding modes is essential for understanding structure-activity relationships and guiding lead optimization [15, 17]. Three-dimensional active site viewers, such as PyMOL, Chimera, and VMD, allow researchers to overlay multiple fragment poses and compare their interactions with key residues [18, 19]. Fragment overlays reveal conserved hydrogen bonds, hydrophobic contacts, and pi-stacking interactions that define the binding pharmacophore [6, 14].

In a typical workflow, the protein structure is rendered as a molecular surface or ribbon diagram, with the active site highlighted by electrostatic potential maps [11, 12]. Fragment poses are displayed as stick models, color-coded by atom type or by fragment identity. The viewer can rotate, translate, and zoom to inspect specific interactions, such as the distance between a fragment's hydroxyl group and a catalytic serine residue [22, 16]. For allosteric sites, such as the PIF-pocket of PDK1, fragment overlays can identify conformational changes induced by binding [16].

The following Mermaid diagram illustrates the computational fragment screening workflow from library preparation to lead optimization.

flowchart TD
    A[Protein Target Selection], > B[Active Site Identification]
    B, > C[Grid Map Generation]
    C, > D[Fragment Library Curation]
    D, > E[Molecular Docking of Fragments]
    E, > F[Scoring and Re-scoring]
    F, > G[Hit Selection]
    G, > H[Fragment Overlay and Visualization]
    H, > I[Fragment Linking or Elaboration]
    I, > J[Lead Optimization]
    J, > K[In Vitro and In Vivo Validation]

Applications in Veterinary Drug Discovery

Computational fragment screening has been applied to a range of veterinary pathogens, including bacteria, viruses, and parasites [7, 21, 11]. For Mycobacterium bovis, the causative agent of bovine tuberculosis, fragment-based approaches targeting essential enzymes such as dihydrofolate reductase and protein kinase B have yielded selective inhibitors [1, 7]. In aquaculture, fragment screening against Streptococcus agalactiae and Streptococcus iniae targets has been explored, though published studies remain limited [17]. The identification of dengue viral RNA-dependent RNA polymerase inhibitors via computational fragment-based methods demonstrates the potential for antiviral drug design in veterinary species [11].

Parasitic targets, such as Leishmania major protein disulfide isomerase and Helicobacter pylori DapE, have also been addressed using fragment screening [21, 12]. For Eimeria species causing coccidiosis in poultry, fragment-based approaches could target enzymes involved in folate metabolism or energy production, though specific studies are not yet published. The integration of fragment screening with molecular dynamics simulations and quantum mechanics calculations enhances the reliability of predicted binding modes [7, 13].

Limitations and Future Directions

Despite its advantages, computational fragment screening faces several limitations. Scoring functions often fail to accurately rank fragment hits due to the shallow energy landscape and the dominance of entropic effects [5, 10]. False positives arising from non-specific binding or aggregation must be filtered through orthogonal assays such as NMR or surface plasmon resonance [22, 16]. The use of ensemble docking, where multiple protein conformations are considered, can improve hit rates but increases computational cost [13].

Future directions include the integration of machine learning models to predict fragment binding affinities and the development of fragment libraries tailored to specific target classes [18, 19]. The CACHE challenge, which evaluated fragment screening against the SARS-CoV-2 helicase Nsp13, highlights the importance of community-wide benchmarking [18]. In veterinary medicine, the expansion of fragment libraries to include species-specific pharmacophores and the application of cryo-electron microscopy to resolve fragment-bound complexes will accelerate drug discovery [19, 15].

Conclusion

Computational screening of fragment libraries in active sites is a powerful methodology for initiating drug discovery campaigns against veterinary pathogens. By combining active site grid mapping, molecular docking, and fragment linking, researchers can identify high-quality leads with improved efficiency compared to traditional high-throughput screening. The visualization of fragment overlays within three-dimensional active site viewers provides critical insights for medicinal chemistry optimization. As computational resources and algorithms continue to advance, fragment-based approaches will play an increasingly central role in veterinary structural bioinformatics.

References

[1] Gaur YK, Turkar MK, Nandi A et al. Identification of Selective Mtb-DHFR Inhibitors as Antitubercular Agents: A Fragment Merging Approach. Curr Comput Aided Drug Des. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42136210/

[2] Shin WS, Nguyen ME, Bergstrom A et al. Fragment-based screening and hit-based substructure search: Rapid discovery of 8-hydroxyquinoline-7-carboxylic acid as a low-cytotoxic, nanomolar metallo beta-lactamase inhibitor. Chem Biol Drug Des. 2021. URL: https://pubmed.ncbi.nlm.nih.gov/34148302/

[3] Tawada M, Suzuki S, Imaeda Y et al. Novel approach of fragment-based lead discovery applied to renin inhibitors. Bioorg Med Chem. 2016. URL: https://pubmed.ncbi.nlm.nih.gov/27720325/

[4] Hao GF, Jiang W, Ye YN et al. ACFIS: a web server for fragment-based drug discovery. Nucleic Acids Res. 2016. URL: https://pubmed.ncbi.nlm.nih.gov/27150808/

[5] Ferruz N, Harvey MJ, Mestres J et al. Insights from Fragment Hit Binding Assays by Molecular Simulations. J Chem Inf Model. 2015. URL: https://pubmed.ncbi.nlm.nih.gov/26376295/

[6] Loving K, Salam NK, Sherman W. Energetic analysis of fragment docking and application to structure-based pharmacophore hypothesis generation. J Comput Aided Mol Des. 2009. URL: https://pubmed.ncbi.nlm.nih.gov/19421721/

[7] Saiki K, Yokota M, Yamamura S et al. In silico identification of quinoline-pyridine hybrids binding to Mycobacterium protein kinase B, assessment by molecular dynamics simulation and quantum mechanics calculation, and in vitro validation of antimicrobial activity. Tuberculosis (Edinb). 2026. URL: https://pubmed.ncbi.nlm.nih.gov/41638016/

[8] Perryman AL, Santiago DN, Forli S et al. Virtual screening with AutoDock Vina and the common pharmacophore engine of a low diversity library of fragments and hits against the three allosteric sites of HIV integrase: participation in the SAMPL4 protein-ligand binding challenge. J Comput Aided Mol Des. 2014. URL: https://pubmed.ncbi.nlm.nih.gov/24493410/

[9] Kumar A, Zhang KY. Computational fragment-based screening using RosettaLigand: the SAMPL3 challenge. J Comput Aided Mol Des. 2012. URL: https://pubmed.ncbi.nlm.nih.gov/22246345/

[10] Kawatkar S, Moustakas D, Miller M et al. Virtual fragment screening: exploration of MM-PBSA re-scoring. J Comput Aided Mol Des. 2012. URL: https://pubmed.ncbi.nlm.nih.gov/22869295/

[11] Anusuya S, Velmurugan D, Gromiha MM. Identification of dengue viral RNA-dependent RNA polymerase inhibitor using computational fragment-based approaches and molecular dynamics study. J Biomol Struct Dyn. 2016. URL: https://pubmed.ncbi.nlm.nih.gov/26262439/

[12] Mandal RS, Das S. In silico approach towards identification of potential inhibitors of Helicobacter pylori DapE. J Biomol Struct Dyn. 2015. URL: https://pubmed.ncbi.nlm.nih.gov/25204745/

[13] Zhu T, Lee H, Lei H et al. Fragment-based drug discovery using a multidomain, parallel MD-MM/PBSA screening protocol. J Chem Inf Model. 2013. URL: https://pubmed.ncbi.nlm.nih.gov/23432621/

[14] Gozalbes R, Mosulén S, Carbajo RJ et al. Development and NMR validation of minimal pharmacophore hypotheses for the generation of fragment libraries enriched in heparanase inhibitors. J Comput Aided Mol Des. 2009. URL: https://pubmed.ncbi.nlm.nih.gov/19421720/

[15] Patel D, Antwi J, Koneru PC et al. A New Class of Allosteric HIV-1 Integrase Inhibitors Identified by Crystallographic Fragment Screening of the Catalytic Core Domain. J Biol Chem. 2016. URL: https://pubmed.ncbi.nlm.nih.gov/27645997/

[16] Stockman BJ, Kothe M, Kohls D et al. Identification of allosteric PIF-pocket ligands for PDK1 using NMR-based fragment screening and 1H-15N TROSY experiments. Chem Biol Drug Des. 2009. URL: https://pubmed.ncbi.nlm.nih.gov/19207420/ *** Disclaimer: This article is for educational and informational purposes only. It is not intended to substitute for professional veterinary advice, diagnosis, treatment, or regulatory guidance. Always consult a licensed veterinarian or qualified specialist regarding animal health, disease diagnosis, and therapeutic decisions.

[17] Cancellieri M, Bassetto M, Widjaja I et al. In silico structure-based design and synthesis of novel anti-RSV compounds. Antiviral Res. 2015. URL: https://pubmed.ncbi.nlm.nih.gov/26259810/

[18] Herasymenko O, Silva M, Abu-Saleh AAA et al. CACHE Challenge #2: Targeting the RNA Site of the SARS-CoV-2 Helicase Nsp13. J Chem Inf Model. 2025. URL: https://pubmed.ncbi.nlm.nih.gov/40539604/

[19] Brosey CA, Houl JH, Katsonis P et al. Targeting SARS-CoV-2 Nsp3 macrodomain structure with insights from human poly(ADP-ribose) glycohydrolase (PARG) structures with inhibitors. Prog Biophys Mol Biol. 2021. URL: https://pubmed.ncbi.nlm.nih.gov/33636189/

[20] Szaszkó M, Hajdú I, Flachner B et al. Identification of potential glutaminyl cyclase inhibitors from lead-like libraries by in silico and in vitro fragment-based screening. Mol Divers. 2017. URL: https://pubmed.ncbi.nlm.nih.gov/28070724/

[21] Ben Khalaf N, Pham S, Romeo G et al. A computer-aided approach to identify novel Leishmania major protein disulfide isomerase inhibitors for treatment of leishmaniasis. J Comput Aided Mol Des. 2021. URL: https://pubmed.ncbi.nlm.nih.gov/33615401/

[22] Nebl S, Alwan WS, Williams ML et al. NMR fragment screening reveals a novel small molecule binding site near the catalytic surface of the disulfide-dithiol oxidoreductase enzyme DsbA from Burkholderia pseudomallei. J Biomol NMR. 2020. URL: https://pubmed.ncbi.nlm.nih.gov/32761504/

[23] Tran AT, West NP, Britton WJ et al. Elucidation of Mycobacterium tuberculosis type II dehydroquinase inhibitors using a fragment elaboration strategy. ChemMedChem. 2012. URL: https://pubmed.ncbi.nlm.nih.gov/22461418/