Spatial Transcriptomics Alignment and Cellular Neighborhood Analysis
Introduction
Spatial transcriptomics (ST) has emerged as a transformative technology that preserves the spatial context of gene expression within tissue sections, enabling researchers to map transcriptomic profiles onto their native histological coordinates [1]. Unlike single-cell RNA sequencing (scRNA-seq), which dissociates cells and loses positional information, ST retains the physical architecture of tissues, allowing the study of cellular microenvironments, intercellular signaling, and tissue organization [2, 3]. In veterinary medicine, ST has been applied to investigate host-pathogen interactions in tissues such as avian respiratory epithelium, bovine mammary gland, and porcine intestinal mucosa, providing insights into disease pathogenesis at an unprecedented resolution [4, 5].
The core computational challenges in ST include aligning multiple tissue slices, deconvolving cell types from mixed spots, defining cellular neighborhoods, and inferring cell-cell communication networks [6, 7]. This article provides an exhaustive technical review of these topics, focusing on alignment algorithms, deconvolution strategies, neighborhood analysis, and integration with three-dimensional organoid models. All methodological descriptions are grounded in the peer-reviewed literature listed in the References section.
Spatial Coordinate Mapping and Alignment
Spatial coordinate mapping is the process of assigning transcriptomic measurements to physical positions within a tissue section. Most ST platforms, such as those based on barcoded bead arrays or in situ sequencing, generate a grid of capture spots, each with a unique spatial barcode [6]. The raw output consists of a gene expression matrix indexed by spot coordinates, which must be aligned to a reference coordinate system, often derived from a hematoxylin and eosin (H&E) stained histological image [8, 9].
Alignment of multiple ST slices from the same tissue or from different biological replicates is essential for constructing three-dimensional (3D) tissue atlases and for comparing conditions across disease states [10, 11]. Several computational frameworks have been developed for this purpose. GALA (a unified landmark-free framework) performs coarse-to-fine spatial alignment across resolutions and modalities without requiring manual landmark selection [12]. It uses iterative closest point optimization combined with feature matching from histological images to register slices in a common coordinate space. Similarly, topography-aware optimal transport (TA-OT) aligns spatial omics data by preserving the local topographical structure of tissue sections, minimizing distortion due to tissue tearing or folding [13].
Benchmarking studies have systematically evaluated the performance of alignment methods across different platforms, including array-based and imaging-based ST technologies [14]. Metrics such as spatial overlap, gene expression correlation, and preservation of tissue morphology are used to assess accuracy. Yan et al. [14] demonstrated that methods incorporating both gene expression and image features outperform those relying solely on transcriptional data. Wang et al. [15] extended alignment to cross-disease and cross-platform scenarios, showing that slice integration can reveal conserved spatial domains even when tissues originate from different species or pathological conditions.
Deep learning approaches have further advanced alignment robustness. SPADE uses a deep learning framework for spatial mapping and quantitative cell-cell interaction inference, integrating spatial coordinates with expression profiles to correct for batch effects and tissue distortions [2]. Holistic Invariant Retracing (HIR) addresses distortion-resilient multi-modal learning by learning invariant representations across ST and histology modalities [3]. DGAE (Dynamic Graph Convolutional Network) aligns multi-slice ST data by constructing dynamic graphs that capture spatial relationships between spots across slices, enabling joint enhancement of expression signals [16].
Cell-Type Deconvolution
ST spots typically contain multiple cells, making direct cell-type assignment ambiguous. Cell-type deconvolution aims to estimate the proportion of each cell type within a spot, often by integrating scRNA-seq reference data [4, 17]. Graph contrastive learning methods, such as those proposed by Dong et al. [4], leverage single-cell reference data to infer spatial cell composition by learning a shared embedding space between scRNA-seq and ST data. The contrastive objective encourages spots with similar cell-type mixtures to cluster together while separating distinct compositions.
Reference-free deconvolution methods eliminate the need for a matched scRNA-seq dataset. Attention-guided enhanced deconvolution (AGED) uses a self-attention mechanism to identify cell-type-specific gene expression patterns directly from ST data, without requiring external references [18]. This approach is particularly valuable for veterinary species where comprehensive single-cell atlases may be unavailable. SpaVGMC performs unified representation learning via structural and semantic alignment, combining spatial graph information with gene expression to infer cell-type proportions [5].
IntegrateRigor is an annotation-free integration optimization method that recovers cell identity by aligning ST and scRNA-seq data without requiring prior cell-type labels [17]. It uses iterative clustering and correlation-based matching to assign cell identities to spots, revealing cancer-immune interface niches in a veterinary oncology context. PRIME (Atlas-Level Single-Cell and Spatial Transcriptomics Data Integration) extends this concept to atlas-level integration, enabling the mapping of multiple ST datasets onto a common single-cell reference [8].
Cellular Neighborhood Analysis
Cellular neighborhoods are spatially localized groups of cells that interact functionally, often defined by proximity in physical space and coordinated gene expression [19, 20]. Neighborhood analysis identifies these regions by clustering spots based on both spatial adjacency and transcriptional similarity. Spatial domain detection methods, such as SpatialDG (dual-graph neural network), use graph neural networks to model spot-spot relationships and assign domain labels [21]. The dual-graph architecture captures both local spatial dependencies and global expression patterns, improving domain boundary detection.
Graph autoencoders with contrastive learning have been applied to identify batch-integrated domains from ST data [10]. Mao et al. [10] augmented the spatial graph with expression-derived edges and used contrastive loss to enforce consistency across batches, enabling robust domain detection in multi-slice experiments. DuaST integrates cross-branch interaction between a spatial branch and an expression branch, fusing information to delineate fine-grained neighborhoods [22].
Multi-view clustering approaches, such as STCF, combine multiple views of the data (e.g., spatial coordinates, gene expression, histological features) to identify consensus neighborhoods [23]. This method is particularly effective when tissue architecture is complex, such as in lymphoid organs or tumor microenvironments. The identification of spatially variable genes (SVGs) is a complementary task; MLN2SVG uses a contrastive variational autoencoder with multi-level neighbor search to detect genes whose expression varies significantly across spatial domains [24].
Cell-Cell Communication Networks
Inferring cell-cell communication from ST data involves predicting ligand-receptor interactions between neighboring cells or neighborhoods [2, 25]. SPADE quantitatively infers cell-cell interaction strengths by modeling the spatial distribution of ligand and receptor expression, accounting for diffusion gradients [2]. Directional diffusion models, as described by Wang et al. [25], incorporate spatial anisotropy to predict signaling directionality, which is critical for understanding polarized tissues such as intestinal epithelium or mammary ducts.
GAMMI (graph-guided contrastive and adversarial integration) integrates single-cell and spatial multi-omics data to infer intercellular communication networks [20]. By learning a joint embedding of gene expression and protein abundance, GAMMI can identify signaling pathways that are spatially restricted. GR2ST predicts spatial transcriptomics from histology images using graph-enhanced multimodal contrastive learning, enabling the inference of cell-cell interactions even when ST data are not directly available [26].
Logistic regression models have been adapted to estimate functional effects of spatial interactions [19]. Barkasi et al. [19] used logistic regression to identify gene pairs whose co-expression in neighboring spots is associated with functional outcomes, such as cell proliferation or immune activation. This approach provides a statistical framework for testing hypotheses about neighborhood-specific signaling.
Integration with Organoid Structural Models
Organoids are three-dimensional, self-organizing structures that recapitulate aspects of native tissue architecture. Mapping ST data onto organoid models requires alignment of 2D slices to a 3D reference, followed by projection of transcriptomic profiles onto the organoid surface or interior [27, 28]. stVCR (spatiotemporal dynamics of single cells) reconstructs the spatiotemporal trajectory of cells within developing organoids by combining ST with time-lapse imaging [27]. This method uses a variational autoencoder to align ST slices from different time points, generating a 4D atlas of gene expression dynamics.
Whole-embryo ST at subcellular resolution has been achieved in model organisms, providing a template for organoid alignment [28]. Wan et al. [28] demonstrated that subcellular-resolution ST can resolve nuclear and cytoplasmic transcripts, enabling precise mapping of cell states within developing tissues. sc3D is a comprehensive tool for 3D ST analysis that reconstructs volumetric gene expression from serial sections [29]. It uses iterative registration and interpolation to fill gaps between slices, producing a continuous 3D expression map that can be overlaid onto organoid structural models.
Viewer-based visualization platforms allow researchers to interactively explore aligned ST data within organoid geometries. These platforms typically support multi-resolution rendering, allowing users to zoom from whole-organoid views to individual cell neighborhoods [30]. STHELAR provides a multi-tissue dataset linking ST and histology for cell type annotation, which can serve as a reference for organoid alignment [30].
Computational Workflow
The following Mermaid diagram summarizes a typical computational workflow for spatial transcriptomics alignment and cellular neighborhood analysis.
flowchart TD
A[Raw ST Data: Spot x Gene Matrix + Coordinates], > B[Preprocessing: Normalization, QC, Batch Correction]
B, > C[Histology Image Registration]
C, > D[Multi-Slice Alignment: GALA, TA-OT, DGAE]
D, > E[Cell-Type Deconvolution: Graph Contrastive, AGED, PRIME]
E, > F[Spatial Domain Detection: SpatialDG, DuaST, STCF]
F, > G[Cellular Neighborhood Identification]
G, > H[Cell-Cell Communication Inference: SPADE, GAMMI, Directional Diffusion]
H, > I[3D Organoid Integration: stVCR, sc3D]
I, > J[Visualization & Interpretation: STHELAR, Viewer Platforms]
Challenges and Future Directions
Despite rapid progress, several challenges remain. First, the resolution gap between ST platforms (typically 10-100 μm spot diameter) and single-cell dimensions (10-20 μm) limits the accuracy of deconvolution and neighborhood definition [6, 18]. Emerging near-cellular ST methods, such as Well-ST-seq, use deterministic barcoded bead arrays to achieve higher resolution at reduced cost [6]. Second, batch effects across different ST experiments complicate integration, although methods like CIPHER (an end-to-end framework for designing optimized aggregated ST experiments) aim to minimize these effects through experimental design [7].
Third, the computational cost of alignment and neighborhood analysis scales poorly with the number of spots and slices. Efficient graph-based methods, such as those implemented in HisCMCL (cross-modal contrastive learning with hierarchical multi-scale fusion), reduce memory requirements by processing data in hierarchical patches [9]. Fourth, the lack of ground truth for cell-cell interactions in complex tissues makes validation difficult. Benchmarking datasets with known spatial organization, such as those generated by Yan et al. [14], are essential for method evaluation.
In veterinary applications, the availability of species-specific reference atlases remains limited. Cross-species alignment methods, such as those benchmarked by Wang et al. [15], may enable the transfer of knowledge from well-characterized model organisms to livestock and companion animals. Additionally, the integration of ST with other omics layers (e.g., proteomics, metabolomics) will provide a more complete picture of tissue function [20, 25].
Conclusion
Spatial transcriptomics alignment and cellular neighborhood analysis are foundational computational tasks for understanding tissue organization in health and disease. Advances in deep learning, graph neural networks, and optimal transport have enabled robust alignment across slices and platforms, accurate cell-type deconvolution, and the identification of functionally relevant cellular neighborhoods. The integration of ST data with 3D organoid models further extends the utility of these methods for studying development, host-pathogen interactions, and tissue regeneration in veterinary species. Continued method development, coupled with the generation of high-quality reference datasets, will drive the adoption of ST in veterinary diagnostics and research.
References
[1] Zhu J, Deng R, Guo J, et al. A comprehensive survey of computer vision methods for spatial transcriptomics. Brief Bioinform. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42184118/
[2] Li X, Zhang N, Jin Z. SPADE: A Deep Learning Framework for Spatial Mapping and Quantitative Cell-Cell Interaction Inference. Adv Sci (Weinh). 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42314058/
[3] He X, Zhao H, Wang D, et al. Holistic Invariant Retracing for Distortion-Resilient Multi-modal Learning in Spatial Transcriptomics. IEEE Trans Image Process. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42275334/
[4] Dong Y, Ai H, Yao S, et al. Graph Contrastive Learning for Inferring Spatial Cell Composition from Integrated Single-cell RNA Sequencing and Spatial Transcriptomics. IEEE J Biomed Health Inform. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42262950/
[5] Fan A, Shang J, Zhang X, et al. SpaVGMC: A Unified Representation Learning Framework via Structural and Semantic Alignment in Spatial Transcriptomics. J Chem Inf Model. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42253102/
[6] Yu N, Jin Z, Zhu S, et al. Well-ST-seq: Cost-Effective and Near-Cellular Spatial Transcriptomics Using Deterministic Barcoded Bead Arrays. Anal Chem. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42246548/
[7] Hemminger Z, De Ocampo H, Xie F, et al. CIPHER: An end-to-end framework for designing optimized aggregated spatial transcriptomics experiments. PLoS Comput Biol. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42241464/
[8] Wu X, Wang X, Wang J, et al. Atlas-Level Single-Cell and Spatial Transcriptomics Data Integration via PRIME. bioRxiv. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42239079/
[9] Liu C, Zhu F, Min W. HisCMCL: Cross-Modal Contrastive Learning with Hierarchical Multi-Scale Fusion for Spatial Expression Prediction. Bioinformatics. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42213081/
[10] Mao Y, Quan L, Chen X, et al. Identifying batch-integrated domains from spatial transcriptomics via graph autoencoder with contrastive learning based on cross-modality and data augmentation. Brief Bioinform. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42184117/
[11] Zhang Z, Yepes AJ, Bian J, et al. GatorDuo: Global-Consistency Dual-Graph Refinement With Pseudo-Label Agreement for Spatial Transcriptomics. bioRxiv. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42182473/
[12] Ding T, Zeng P. GALA: a unified landmark-free framework for coarse-to-fine spatial alignment across resolutions and modalities in spatial transcriptomics. Brief Bioinform. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42007518/
[13] Ceccarelli F, Liò P, Saez-Rodriguez J, et al. Topography-aware optimal transport for alignment of spatial omics data. Cell Rep Methods. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/41916307/
[14] Yan Y, Gu T, Sun C, et al. Benchmarking alignment methods for spatial transcriptomics data. Nat Comput Sci. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/41933187/
[15] Wang Y, Liu Z, Zang Q, et al. Alignment of spatial transcriptomics slices across diseases, platforms and conditions. Genome Med. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/41928322/
[16] Li A, Wang R, Duan X, et al. DGAE: Dynamic Graph Convolutional Network for Multi-Slice Spatial Transcriptomics Alignment and Enhancement. IEEE Trans Comput Biol Bioinform. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/41950130/
[17] Zhai Z, Wang C, Jiang C, et al. IntegrateRigor: annotation-free integration optimization for cell identity recovery reveals cancer-immune interface niches. bioRxiv. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42182279/
[18] Yang X, Wang Y, Chen X. Attention-guided enhanced deconvolution enables reference-free cell type estimation in spatial transcriptomics. Sci Rep. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/41667720/ *** Disclaimer: This article is for educational and informational purposes only. It is not intended to substitute for professional veterinary advice, diagnosis, treatment, or regulatory guidance. Always consult a licensed veterinarian or qualified specialist regarding animal health, disease diagnosis, and therapeutic decisions.
[19] Barkasi M, Pham CN, Neophytou D, et al. Logistic regression for estimating functional effects with spatial transcriptomics. Nucleic Acids Res. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42137981/
[20] Yu Y, Long M, Song J, et al. GAMMI: graph-guided contrastive and adversarial integration of single-cell and spatial multi-omics data. Brief Bioinform. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42108634/
[21] Wu J, Oshinjo A, Izzi V. SpatialDG: a novel spatial domain identification method for spatially resolved transcriptomics data based on dual-graph neural network. Brief Bioinform. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/41947419/
[22] Liang X, Liu P, Li J, et al. DuaST: an integrated deep learning framework for spatial transcriptomics with cross-branch interaction. Brief Bioinform. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/41985058/
[23] Zhu Z, Liang K, Meng L, et al. STCF: Multi-View Clustering for Spatial Transcriptomics Based on Cross-View Fusion. IEEE Trans Pattern Anal Mach Intell. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/41701605/
[24] Hussain S, Ayoub M, Ye F, et al. MLN2SVG: domain-aware spatially variable gene detection using contrastive variational autoencoder and multi-level neighbor search. Brief Bioinform. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42080589/
[25] Wang H, Yuan Z, Su Y, et al. Dissecting spatial patterning and signaling with directional diffusion in spatial multi-omics. Proc Natl Acad Sci U S A. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/41785324/
[26] Zhou J, Li S, Han R, et al. GR2ST: spatial transcriptomics prediction based on graph-enhanced multimodal contrastive learning. Bioinformatics. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42036805/
[27] Peng Q, Zhou P, Li T. stVCR: spatiotemporal dynamics of single cells. Nat Methods. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/41820580/
[28] Wan Y, El Kholtei J, Jenie I, et al. Whole-embryo spatial transcriptomics at subcellular resolution from gastrulation to organogenesis. Science. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/41818362/
[29] Sendra M, Bolondi A, Guignard L. sc3D: A Comprehensive Tool for 3D Spatial Transcriptomic Analysis. Bio Protoc. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/41769261/
[30] Giraud-Sauveur F, Blampey Q, Benkirane H, et al. STHELAR, a multi-tissue dataset linking spatial transcriptomics and histology for cell type annotation. Sci Data. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/41820393/
[31] Egbon OA, Atitey K, Li J, et al. Spatial-ZEDNet : a unified spatial transcriptomics framework for detecting differential gene activation and expression. Brief Bioinform. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42001470/
[32] Yang Y, Unjitwattana T, Zhou S, et al. STDrug enables spatially informed personalized drug repurposing from spatial transcriptomics. bioRxiv. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/41993522/
[33] Zhang W, Chen T, Xu W, et al. Img2Gene: Debiased Spatially Resolved Transcriptomics with Biological Context from Pathology Images. IEEE J Biomed Health Inform. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/41945845/
[34] Xia L, Ding Z, Zhang X, et al. Leveraging Spot-Gene Heterogeneous Graphs for Unified Spatially Resolved Transcriptomics Domain Detection on Single-Slice and Multi-Slice Data. Genes (Basel). 2026. URL: https://pubmed.ncbi.nlm.nih.gov/41898844/
[35] Wang K, Shi L, Li X, et al. Integrating histology and spatial transcriptomics via multimodal transformers and contrastive representation learning for accurate gene expression prediction. J Biomed Inform. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/41763376/