Emerging AI Approaches for Cancer Spatial Omics
1.This paper introduces a comprehensive review of AI paradigms in cancer spatial omics, highlighting three key conceptual frameworks: data-driven spatial AI, constraint-based spatial AI, and mechanistic spatial modeling. The authors argue that interpretability and mechanistic grounding are crucial for clinical and biological utility.
2.Data-driven spatial AI is exemplified by foundation models trained on large histopathology or omics datasets. These models can perform tasks such as classification and annotation but often operate as black boxes, with limited biological interpretability.
3.Histopathology foundation models like DINOv2, Virchow, UNI, and GPFM are transformer-based and trained on millions of H&E slides. They excel in image tasks such as segmentation and diagnosis and are increasingly integrated with language models (e.g., CONCH, PathChat) for richer, multimodal interpretations.
4.Spatial omics foundation models are at an earlier stage, but tools like CellPLM and Nicheformer extend transformer methods to spatial transcriptomics, combining spatial and dissociated RNA data for tasks like label transfer and cell annotation.
5.Spatial proteomic models are emerging but face challenges in marker standardization. Recent efforts such as KRONOS demonstrate how tokenization strategies can improve model performance across heterogeneous datasets.
6.To improve interpretability, post-hoc techniques like SHAP, LIME, and attention mechanisms are employed. There is also growing interest in explainable structures like Functional Tissue Units (FTUs) and graph-based representations (e.g., SpaGFT) to ground features in biological contexts.
7.Constraint-based spatial AI introduces assumptions from biophysics and information theory. For example, models inspired by information bottleneck theory aim to compress spatial omics data while preserving biologically predictive signals.
8.Image diffusion models, adapted from generative AI (e.g., DALL-E, Stable Diffusion), are now applied to spatial omics. Methods like stDiff, DiffuST, and SpatialDiffusion use diffusion transformers and graph neural networks to impute missing spatial transcriptomics data and interpolate 3D structures.
9.Mechanistic spatial modeling offers the highest interpretability, aiming to infer biophysical processes directly. Physics-Informed Neural Networks (PINNs) and Kolmogorov–Arnold Networks (KANs) are proposed to learn equations from spatial data, capturing diffusion, tissue stiffness, and mechanical forces.
10.Cancer evolution can be tracked using spatial transcriptomics. Models like InferCNV, CalicoST, and transformer-based CNV callers are being adapted to spatial data to infer subclonal architecture and evolutionary pressures. There is potential in integrating expression, genotype, and morphology into unified AI models.
11.Data integration remains a challenge. Combining H&E, spatial transcriptomics, and proteomics requires careful standardization and alignment. Methods like SpatialGlue, COVET, and OmiCLIP aim to merge these modalities for domain identification and gene prediction.
12.The paper advocates for mechanism-driven data generation, particularly using mouse models. Mice allow dynamic and perturbative spatial omics data collection, crucial for training models like PINNs. Cross-species alignment tools (e.g., BrainAlign, Nicheformer) are in development to bridge human and mouse data.
13.Validation of spatial foundation models is still lacking. The authors argue that benchmarking should occur at multiple spatial scales and reflect the ability to distinguish fine-grained microenvironments. Mechanistic and constraint-based approaches may offer more interpretable and data-efficient alternatives.
14.The authors conclude that future progress hinges on curated cross-species datasets, consensus benchmarking tasks, and interpretable AI frameworks. These will enable better identification of functional tissue units and enhance clinical translation in cancer spatial omics.
📜Paper:
arxiv.org/abs/2506.23857
#SpatialOmics #CancerAI #DeepLearning #ComputationalPathology #SpatialTranscriptomics #FoundationModels #ExplainableAI #TissueBiophysics #PINNs