Literature Review: Spatial Niche Analysis Workflows in Cancer¶
This page reviews 8 published studies that perform end-to-end spatial niche analysis in cancer tissues. Each paper chains multiple analysis steps — niche identification, cell-cell communication, gene programs, clinical validation — providing methodological templates for our hackathon pipeline.
Selection criteria
Papers were selected for: (1) end-to-end niche workflows (not single-tool papers), (2) cancer tissues, (3) high-impact venues, (4) relevance to our pipeline design (composition-based niches, communication, cross-platform validation).
Summary Table¶
| Paper | Cancer | Platform(s) | Niche Method | CCC | Gene Programs | Validation | Cells |
|---|---|---|---|---|---|---|---|
| Wang/Liu 2025 | Pan-cancer (10 types) | MERSCOPE, CosMx, Visium, COMET, Xenium, CODEX, IMC | Neighborhood composition clustering | CellChat L-R | NMF programs | 7 platforms, 10 cancers | 14M+ |
| Peng/Kadara 2026 | Lung (AAH→LUAD) | Visium CytAssist, Xenium 5K, COMET | cell2location deconvolution + co-localization | IL1B-IL1R1 axis | Stage-specific programs | Independent cohort + mouse model | 5.4M |
| Meyer/Bodenmiller 2025 | TNBC | IMC (39-plex) | imcRtools + spicyR spatial interaction | CD8-tumor interaction phenotyping | RNA-seq + scRNA-seq validation | 8 cohorts (~4,000 patients) | ~1M |
| Wang/Ali 2023 | TNBC | IMC (43-plex) | Cell mask adjacency + permutation test | Direct contact scoring | Activation state profiling | 3 timepoints, 660 images | >1M |
| Sorin 2023 | Lung adenocarcinoma | IMC (35-plex) | Schurch-style cellular neighborhoods | Co-localization analysis | Protein activation states | Independent cohort (n=60) | 1.6M |
| Arora 2023 | OSCC + pan-cancer | Visium + scRNA-seq | Pathologist-annotated core/edge regions | L-R interaction analysis | DE programs per region | TCGA pan-cancer | Spot-level |
| Klughammer 2024 | Metastatic breast | Slide-seq, MERFISH, ExSeq, CODEX | Neighborhood composition clustering | Co-localization across modalities | Spatial DE programs | 4 platforms on serial sections | 67 biopsies |
| Prakrithi 2025 | Skin (cSCC, BCC, melanoma) | 12 technologies (Xenium, CosMx, CODEX, Visium, ...) | Meta-communities across platforms | CellChat + stLearn SCTP | Joint pathway analysis | 12 platforms on serial sections | 131K+ |
Per-Paper Analysis¶
1. Wang/Liu 2025 — CAF Spatial Subtypes¶
Conserved spatial subtypes and cellular neighborhoods of cancer-associated fibroblasts revealed by single-cell spatial multi-omics. Cancer Cell 43(5): 905-924 (2025). DOI
Hackathon organizer's paper
Senior author: Linghua Wang (MD Anderson). This paper directly demonstrates the composition-based niche approach we are using with CellCharter.
Workflow:
- Cell segmentation and annotation across MERSCOPE (500-plex) and CosMx (1,000-plex) on 8 cancer types, 24 tissue sections, >5.7M cells
- Spatial graph construction → neighborhood composition profiling for each CAF
- Clustering CAFs by neighborhood composition → 4 conserved spatial CAF subtypes
- NMF-based gene program identification per subtype
- CellChat ligand-receptor communication analysis per subtype
- Cross-platform validation on Visium, COMET, Xenium, CODEX, IMC (5 additional platforms)
- Cross-cancer validation across 10 cancer types (14M+ total cells)
- Clinical association: CAF subtype composition → immune infiltration, TLS, TAM distribution, survival
What we can learn:
- Composition-based niche definition works pan-cancer. CAF subtypes defined by neighboring cell composition are conserved across 10 cancers and 7 platforms — strong evidence for our CellCharter approach.
- Cross-platform validation is achievable even when key markers are missing from some panels. They used neighborhood-based classification (not marker-based) to transfer subtypes across platforms.
- Communication follows niche identity. Each spatial CAF subtype has distinct L-R interaction networks, supporting our pipeline ordering (niche ID first, then communication).
2. Peng/Kadara 2026 — Lung Precancer Niche Co-evolution¶
Multimodal spatial-omics reveal co-evolution of alveolar progenitors and proinflammatory niches in progression of lung precursor lesions. Cancer Cell 44(2): 321-339 (2026). DOI
Hackathon organizer's paper
Corresponding authors: Humam Kadara and Linghua Wang (MD Anderson). Uses Visium + Xenium + COMET — the same platforms as our D1/D4 datasets.
Workflow:
- Visium CytAssist on 56 tissues from 25 patients spanning AAH → AIS → MIA → LUAD
- scFFPE-seq (Chromium Flex) on 75 matched tissues as single-cell reference
- cell2location deconvolution of Visium spots using scFFPE-seq reference
- Co-localization analysis → identification of epithelial-proinflammatory niches (alveolar progenitors + IL1B-high macrophages)
- Stage-specific transcriptional program identification
- L-R communication analysis (IL1B-IL1R1 axis)
- WES on 79 samples for clonal architecture
- Xenium 5K validation at single-cell resolution (6 pairs of lesions)
- COMET spatial proteomics for protein-level validation
- Independent validation cohort (36 lesions, 19 patients)
- Mouse model validation + IL-1B neutralization therapeutic experiment
What we can learn:
- Multi-resolution strategy: Discovery at spot-level (Visium, cheaper/broader), validation at single-cell level (Xenium). We could adopt this for D4 (Visium v2 + Xenium).
- Deconvolution as niche proxy: cell2location on Visium spots effectively identifies niches without explicit niche-calling tools — an alternative to CellCharter for spot-level data.
- Niche → causal biology path: They went from spatial niche discovery → mouse model → therapeutic intervention. This is the gold standard for translational niche analysis.
- Same lab, same platforms as our hackathon. Their workflow on Visium + Xenium + COMET is directly transferable.
3. Meyer/Bodenmiller 2025 — TNBC Stratification¶
A stratification system for breast cancer based on basoluminal tumor cells and spatial tumor architecture. Cancer Cell 43(9): 1637-1655 (2025). DOI
Workflow:
- IMC (39-plex) on TMAs from 215 TNBC patients (~495 images, ~1M cells)
- steinbock preprocessing → Mesmer deep learning segmentation
- Two-step epithelial classification: graph clustering + Gaussian Mixture Model on panCK/E-cadherin
- Unsupervised clustering → 110 fine clusters → 9 metaclusters → 11 tumor cell phenotypes
- imcRtools spatial interaction testing between tumor metaclusters and immune cells
- spicyR permutation-based spatial interaction analysis
- CD8 T cell spatial phenotyping: infiltrating / bystanding / excluded
- Five-subtype stratification combining CK expression + spatial immune phenotype
- Survival analysis (Cox PH, Lasso-regularized Cox)
- Validation across 8 independent cohorts (~4,000 patients) including ISPY2 trial
What we can learn:
- Spatial interaction testing as niche characterization. imcRtools + spicyR provide a rigorous statistical framework for testing whether cell-cell co-localization is significant — complementary to composition-based approaches.
- Clinical stratification from spatial niches. Their 5-subtype system (INF > N > B > L > BL) directly predicts survival and immunotherapy response — demonstrating the clinical utility of niche analysis.
- Massive validation depth. 8 cohorts, ~4,000 patients, multiple modalities (IMC, mIF, RNA-seq, scRNA-seq, organoids). Sets the standard for how thoroughly niche findings should be validated.
4. Wang/Ali 2023 — Immunotherapy Spatial Predictors¶
Spatial predictors of immunotherapy response in triple-negative breast cancer. Nature 621: 868-876 (2023). DOI
Workflow:
- IMC (43-plex) on FFPE biopsies from the NeoTRIPaPDL1 trial at 3 timepoints (baseline n=243, on-treatment n=207, post-treatment n=210)
- steinbock preprocessing → Mesmer/DeepCell segmentation
- Semi-supervised phenotyping → 37 cell phenotypes (17 epithelial + 20 TME)
- Spatial interaction: direct cell mask adjacency + permutation tests (not KNN neighborhoods)
- Feature extraction: 148 features per image (densities, interaction scores, proliferative fractions)
- Univariate logistic regression → odds ratios for pCR association
- Regularized logistic regression → multivariate prediction (AUC = 0.82)
- 100 repeated random train/test splits for internal validation
What we can learn:
- Adjacency-based spatial analysis (direct cell contact) is a simpler alternative to neighborhood composition that still yields strong clinical predictions.
- Longitudinal spatial profiling (3 timepoints) reveals how niches remodel during therapy — an analysis dimension our hackathon could consider if temporal data is available.
- Feature engineering matters. Their 148-feature framework (density + interaction + proliferation) is a practical template for extracting clinically relevant features from niche analysis results.
5. Sorin 2023 — Lung Immune Landscapes¶
Single-cell spatial landscapes of the lung tumour immune microenvironment. Nature 614: 548-554 (2023). DOI
Workflow:
- IMC (35-plex, Hyperion) on TMAs from 416 LUAD patients (1 mm^2 cores)
- Hybrid segmentation: Mask R-CNN + MATLAB post-processing + EM-GMM refinement
- Rule-based cell phenotyping → 17 cell types
- Cellular neighborhood analysis following Schurch et al. (Cell, 2020) — KNN neighborhood composition vectors → clustering into recurrent neighborhoods
- Survival analysis: neighborhoods → Kaplan-Meier + Cox PH
- ResNet-50 deep learning on raw IMC images (37 channels) → binary progression prediction
- Validation cohort: 60 patients, 2 cores each
What we can learn:
- Schurch-style cellular neighborhoods remain the most widely used niche definition in IMC studies. This is essentially what CellCharter automates with a learned latent space.
- Deep learning on spatial images can predict clinical outcomes directly from raw pixel data — bypassing cell segmentation and phenotyping entirely.
- Scale matters. 416 patients is large for spatial studies. Most niche findings are underpowered; survival associations require hundreds of patients.
6. Arora 2023 — Tumor Core vs Edge¶
Spatial transcriptomics reveals distinct and conserved tumor core and edge architectures that predict survival and targeted therapy response. Nature Communications 14: 5029 (2023). DOI
Workflow:
- Visium spatial transcriptomics on HPV-negative oral squamous cell carcinoma (OSCC)
- scRNA-seq as reference for deconvolution
- Pathologist annotation of tumor core (TC) vs. leading edge (LE) regions
- Differential gene expression between TC and LE
- Cell-type composition analysis per region (deconvolution)
- Cell-cell communication inference (L-R analysis)
- Pan-cancer signature validation using TCGA cohorts
- Drug response prediction from spatial architecture signatures
What we can learn:
- Anatomical niche definition (core vs. edge) is simpler than algorithmic approaches but biologically meaningful — edge signatures are conserved across cancers.
- Pan-cancer transferability. Spatial signatures derived from one cancer type can predict outcomes in others via TCGA validation — useful if our hackathon findings need broader contextualization.
- Spot-level limitation. Visium resolution limits niche granularity compared to single-cell platforms (Xenium, CosMx). Our pipeline uses single-cell platforms for this reason.
7. Klughammer 2024 — Metastatic Breast Multi-Modal¶
A multi-modal single-cell and spatial expression map of metastatic breast cancer. Nature Medicine 30: 3236-3249 (2024). DOI
Workflow:
- Serial sections from 67 biopsies (60 patients, 9 metastatic sites) profiled on 4 spatial platforms: Slide-seq, MERFISH, ExSeq, CODEX
- scRNA-seq/snRNA-seq as dissociated reference
- Cell2location deconvolution for Slide-seq; direct classification for MERFISH/CODEX
- Neighborhood composition analysis → spatial niche clustering
- Macrophage spatial phenotyping (3 patterns: short-range accumulation, long-range accumulation, intermixing)
- Spatial DE: genes differentially expressed near vs. far from T/NK cells
- Cross-platform concordance: pseudobulk correlation + cell-type composition comparison across all 4 platforms on same biopsies
What we can learn:
- Cross-platform concordance is quantifiable. Pseudobulk expression correlation and cell-type composition comparison across serial sections provide concrete metrics for our RNA vs. protein concordance analysis (Step 4).
- Macrophage spatial phenotypes reveal that the same cell type can have fundamentally different spatial organizations — niche context matters beyond cell-type identity.
- Multi-modal on same tissue is the gold standard. Our D1 (Xenium + Protein on same tissue) mirrors this design.
8. Prakrithi 2025 — 12-Technology Skin Cancer¶
Integrating 12 spatial and single cell technologies to characterise tumour neighbourhoods and cellular interactions in three skin cancer types. bioRxiv (2025). PMC
Workflow:
- 24 skin donors (cSCC, BCC, melanoma) profiled across 12 technologies: scRNAseq, FLEX snRNAseq, Visium, Xenium, CosMx, GeoMx CTA, GeoMx Protein, Opal, RNAScope, PLA, MALDI glycomics, CODEX
- Cancer cell identification: consensus CNV (InferCNV + CopyKat must agree) + cancer module score
- Per-platform community detection → 10 communities per technology
- Cross-platform community correlation → 4 meta-communities (immune, KC, stromal, tumour)
- CellChat on scRNAseq → 312 L-R pairs; stLearn SCTP for spatially-constrained interaction testing on Visium/CosMx
- MMCCI integration: combines interaction scores across platforms and replicates
- Joint pathway analysis via MetaboAnalyst (genes + proteins + metabolites → KEGG)
- Experimental validation: RNAScope, Opal Polaris, PLA (proximity ligation assay)
- GWAS integration via gsMAP
What we can learn:
- No single technology captures all niche features. Communities from Xenium (RNA), CODEX (protein), and MALDI (metabolites) reveal complementary aspects of the same spatial neighborhoods — directly supports our RNA vs. protein concordance analysis.
- Meta-communities across platforms is a practical strategy for cross-platform niche comparison. Pearson correlation of cell-type composition vectors is a simple, effective metric.
- Spatially-constrained CCC catches what non-spatial misses. stLearn SCTP found interactions (WNT5A-ROR1) that all three spatial platforms detected but scRNAseq missed — validating our choice of spatially-aware communication tools (NicheCompass, COMMOT).
Patterns and Gaps¶
What most studies do¶
| Step | Common approach | Used by |
|---|---|---|
| Niche ID | Composition-based: KNN neighborhood → cell-type frequency vector → clustering | 6/8 papers |
| Communication | CellChat (most common), with or without spatial constraints | 4/8 papers |
| Gene programs | Differential expression or NMF between niches | 5/8 papers |
| Clinical outcome | Survival analysis (Cox PH, Kaplan-Meier) | 6/8 papers |
| Segmentation | Mesmer/DeepCell (most common for IMC/protein) | 3/8 papers |
| Deconvolution | cell2location (for spot-level data like Visium) | 2/8 papers |
What few studies do (our opportunity)¶
| Analysis | Papers that do it | Our pipeline |
|---|---|---|
| Cross-modality niche concordance (RNA vs protein) | Klughammer (4 platforms), Prakrithi (12 platforms) | CellCharter on RNA vs protein independently — more systematic than either |
| Spatially-aware communication (not just CellChat) | Prakrithi (stLearn SCTP) | NicheCompass — communication-based niche definition, more principled |
| Context-dependent gene programs (niche-specific DE) | None use Niche-DE specifically | Niche-DE — tests whether expression depends on niche context, not just cell type |
| Systematic cross-platform validation | Wang/Liu (7 platforms), Prakrithi (12 platforms) | Discovery on D1+D4, validation on V1 — different platforms, different tissue |
Key insight for our hackathon¶
Most studies follow the same workflow: IMC/CODEX → Schurch-style neighborhoods → CellChat → survival. This is well-established but has known limitations:
- Composition-only niches miss gene expression context (COVET/BANKSY address this)
- Non-spatial CCC (standard CellChat) detects implausible long-range signals (NicheCompass/COMMOT address this)
- Standard DE has 86-95% false positive rates on spatial data (Niche-DE addresses this)
- Cross-modality concordance is rarely tested systematically
Our pipeline addresses all four gaps. The literature review confirms that our tool choices (CellCharter, NicheCompass, Niche-DE) and our novel Step 4 (RNA vs protein concordance) fill genuine methodological holes in the field.
Tool Usage Across Papers¶
| Tool | Papers Using It | Our Pipeline |
|---|---|---|
| cell2location | Peng/Kadara, Klughammer | Not needed (single-cell platforms) |
| CellChat | Wang/Liu, Prakrithi, (Arora likely) | Considered; chose NicheCompass instead |
| steinbock + Mesmer | Meyer/Bodenmiller, Wang/Ali | Not needed (10x pipelines handle segmentation) |
| imcRtools + spicyR | Meyer/Bodenmiller | Not needed (IMC-specific) |
| Scanpy/AnnData | Wang/Liu, Peng/Kadara, Prakrithi | Yes — data infrastructure |
| Squidpy | Wang/Liu (likely), Klughammer (likely) | Yes — spatial graph construction |
| CellCharter | Referenced in Wang/Liu context | Our primary niche tool |
| NicheCompass | Own methods paper (Birk 2025) | Our communication tool |
| Niche-DE | Own methods paper (Bai 2024) | Our gene program tool |
Gap confirmed
No published cancer study has used CellCharter + NicheCompass + Niche-DE together in a single pipeline. Our combination is novel.
References¶
- Liu Y, Sinjab A, et al. Conserved spatial subtypes and cellular neighborhoods of cancer-associated fibroblasts. Cancer Cell 43(5): 905-924 (2025).
- Peng F, Sinjab A, et al. Multimodal spatial-omics reveal co-evolution of alveolar progenitors and proinflammatory niches. Cancer Cell 44(2): 321-339 (2026).
- Meyer D, et al. A stratification system for breast cancer based on basoluminal tumor cells and spatial tumor architecture. Cancer Cell 43(9): 1637-1655 (2025).
- Wang XQ, Danenberg E, et al. Spatial predictors of immunotherapy response in triple-negative breast cancer. Nature 621: 868-876 (2023).
- Sorin M, et al. Single-cell spatial landscapes of the lung tumour immune microenvironment. Nature 614: 548-554 (2023).
- Arora R, Cao C, et al. Spatial transcriptomics reveals distinct and conserved tumor core and edge architectures. Nature Communications 14: 5029 (2023).
- Klughammer J, et al. A multi-modal single-cell and spatial expression map of metastatic breast cancer. Nature Medicine 30: 3236-3249 (2024).
- Prakrithi P, Grice LF, et al. Integrating 12 spatial and single cell technologies to characterise tumour neighbourhoods. bioRxiv (2025).
See also: Tool Selection for our recommended pipeline based on these insights.