Literature Review: Spatial Niche Analysis Workflows in Cancer¶

This page reviews 8 published studies that perform end-to-end spatial niche analysis in cancer tissues. Each paper chains multiple analysis steps — niche identification, cell-cell communication, gene programs, clinical validation — providing methodological templates for our hackathon pipeline.

Selection criteria

Papers were selected for: (1) end-to-end niche workflows (not single-tool papers), (2) cancer tissues, (3) high-impact venues, (4) relevance to our pipeline design (composition-based niches, communication, cross-platform validation).

Summary Table¶

Paper	Cancer	Platform(s)	Niche Method	CCC	Gene Programs	Validation	Cells
Wang/Liu 2025	Pan-cancer (10 types)	MERSCOPE, CosMx, Visium, COMET, Xenium, CODEX, IMC	Neighborhood composition clustering	CellChat L-R	NMF programs	7 platforms, 10 cancers	14M+
Peng/Kadara 2026	Lung (AAH→LUAD)	Visium CytAssist, Xenium 5K, COMET	cell2location deconvolution + co-localization	IL1B-IL1R1 axis	Stage-specific programs	Independent cohort + mouse model	5.4M
Meyer/Bodenmiller 2025	TNBC	IMC (39-plex)	imcRtools + spicyR spatial interaction	CD8-tumor interaction phenotyping	RNA-seq + scRNA-seq validation	8 cohorts (~4,000 patients)	~1M
Wang/Ali 2023	TNBC	IMC (43-plex)	Cell mask adjacency + permutation test	Direct contact scoring	Activation state profiling	3 timepoints, 660 images	>1M
Sorin 2023	Lung adenocarcinoma	IMC (35-plex)	Schurch-style cellular neighborhoods	Co-localization analysis	Protein activation states	Independent cohort (n=60)	1.6M
Arora 2023	OSCC + pan-cancer	Visium + scRNA-seq	Pathologist-annotated core/edge regions	L-R interaction analysis	DE programs per region	TCGA pan-cancer	Spot-level
Klughammer 2024	Metastatic breast	Slide-seq, MERFISH, ExSeq, CODEX	Neighborhood composition clustering	Co-localization across modalities	Spatial DE programs	4 platforms on serial sections	67 biopsies
Prakrithi 2025	Skin (cSCC, BCC, melanoma)	12 technologies (Xenium, CosMx, CODEX, Visium, ...)	Meta-communities across platforms	CellChat + stLearn SCTP	Joint pathway analysis	12 platforms on serial sections	131K+

Per-Paper Analysis¶

1. Wang/Liu 2025 — CAF Spatial Subtypes¶

Conserved spatial subtypes and cellular neighborhoods of cancer-associated fibroblasts revealed by single-cell spatial multi-omics. Cancer Cell 43(5): 905-924 (2025). DOI

Hackathon organizer's paper

Senior author: Linghua Wang (MD Anderson). This paper directly demonstrates the composition-based niche approach we are using with CellCharter.

Workflow:

Cell segmentation and annotation across MERSCOPE (500-plex) and CosMx (1,000-plex) on 8 cancer types, 24 tissue sections, >5.7M cells
Spatial graph construction → neighborhood composition profiling for each CAF
Clustering CAFs by neighborhood composition → 4 conserved spatial CAF subtypes
NMF-based gene program identification per subtype
CellChat ligand-receptor communication analysis per subtype
Cross-platform validation on Visium, COMET, Xenium, CODEX, IMC (5 additional platforms)
Cross-cancer validation across 10 cancer types (14M+ total cells)
Clinical association: CAF subtype composition → immune infiltration, TLS, TAM distribution, survival

What we can learn:

Composition-based niche definition works pan-cancer. CAF subtypes defined by neighboring cell composition are conserved across 10 cancers and 7 platforms — strong evidence for our CellCharter approach.
Cross-platform validation is achievable even when key markers are missing from some panels. They used neighborhood-based classification (not marker-based) to transfer subtypes across platforms.
Communication follows niche identity. Each spatial CAF subtype has distinct L-R interaction networks, supporting our pipeline ordering (niche ID first, then communication).

2. Peng/Kadara 2026 — Lung Precancer Niche Co-evolution¶

Multimodal spatial-omics reveal co-evolution of alveolar progenitors and proinflammatory niches in progression of lung precursor lesions. Cancer Cell 44(2): 321-339 (2026). DOI

Hackathon organizer's paper

Corresponding authors: Humam Kadara and Linghua Wang (MD Anderson). Uses Visium + Xenium + COMET — the same platforms as our D1/D4 datasets.

Workflow:

Visium CytAssist on 56 tissues from 25 patients spanning AAH → AIS → MIA → LUAD
scFFPE-seq (Chromium Flex) on 75 matched tissues as single-cell reference
cell2location deconvolution of Visium spots using scFFPE-seq reference
Co-localization analysis → identification of epithelial-proinflammatory niches (alveolar progenitors + IL1B-high macrophages)
Stage-specific transcriptional program identification
L-R communication analysis (IL1B-IL1R1 axis)
WES on 79 samples for clonal architecture
Xenium 5K validation at single-cell resolution (6 pairs of lesions)
COMET spatial proteomics for protein-level validation
Independent validation cohort (36 lesions, 19 patients)
Mouse model validation + IL-1B neutralization therapeutic experiment

What we can learn:

Multi-resolution strategy: Discovery at spot-level (Visium, cheaper/broader), validation at single-cell level (Xenium). We could adopt this for D4 (Visium v2 + Xenium).
Deconvolution as niche proxy: cell2location on Visium spots effectively identifies niches without explicit niche-calling tools — an alternative to CellCharter for spot-level data.
Niche → causal biology path: They went from spatial niche discovery → mouse model → therapeutic intervention. This is the gold standard for translational niche analysis.
Same lab, same platforms as our hackathon. Their workflow on Visium + Xenium + COMET is directly transferable.

3. Meyer/Bodenmiller 2025 — TNBC Stratification¶

A stratification system for breast cancer based on basoluminal tumor cells and spatial tumor architecture. Cancer Cell 43(9): 1637-1655 (2025). DOI

Workflow:

IMC (39-plex) on TMAs from 215 TNBC patients (~495 images, ~1M cells)
steinbock preprocessing → Mesmer deep learning segmentation
Two-step epithelial classification: graph clustering + Gaussian Mixture Model on panCK/E-cadherin
Unsupervised clustering → 110 fine clusters → 9 metaclusters → 11 tumor cell phenotypes
imcRtools spatial interaction testing between tumor metaclusters and immune cells
spicyR permutation-based spatial interaction analysis
CD8 T cell spatial phenotyping: infiltrating / bystanding / excluded
Five-subtype stratification combining CK expression + spatial immune phenotype
Survival analysis (Cox PH, Lasso-regularized Cox)
Validation across 8 independent cohorts (~4,000 patients) including ISPY2 trial

What we can learn:

Spatial interaction testing as niche characterization. imcRtools + spicyR provide a rigorous statistical framework for testing whether cell-cell co-localization is significant — complementary to composition-based approaches.
Clinical stratification from spatial niches. Their 5-subtype system (INF > N > B > L > BL) directly predicts survival and immunotherapy response — demonstrating the clinical utility of niche analysis.
Massive validation depth. 8 cohorts, ~4,000 patients, multiple modalities (IMC, mIF, RNA-seq, scRNA-seq, organoids). Sets the standard for how thoroughly niche findings should be validated.

4. Wang/Ali 2023 — Immunotherapy Spatial Predictors¶

Spatial predictors of immunotherapy response in triple-negative breast cancer. Nature 621: 868-876 (2023). DOI

Workflow:

IMC (43-plex) on FFPE biopsies from the NeoTRIPaPDL1 trial at 3 timepoints (baseline n=243, on-treatment n=207, post-treatment n=210)
steinbock preprocessing → Mesmer/DeepCell segmentation
Semi-supervised phenotyping → 37 cell phenotypes (17 epithelial + 20 TME)
Spatial interaction: direct cell mask adjacency + permutation tests (not KNN neighborhoods)
Feature extraction: 148 features per image (densities, interaction scores, proliferative fractions)
Univariate logistic regression → odds ratios for pCR association
Regularized logistic regression → multivariate prediction (AUC = 0.82)
100 repeated random train/test splits for internal validation

What we can learn:

Adjacency-based spatial analysis (direct cell contact) is a simpler alternative to neighborhood composition that still yields strong clinical predictions.
Longitudinal spatial profiling (3 timepoints) reveals how niches remodel during therapy — an analysis dimension our hackathon could consider if temporal data is available.
Feature engineering matters. Their 148-feature framework (density + interaction + proliferation) is a practical template for extracting clinically relevant features from niche analysis results.

5. Sorin 2023 — Lung Immune Landscapes¶

Single-cell spatial landscapes of the lung tumour immune microenvironment. Nature 614: 548-554 (2023). DOI

Workflow:

IMC (35-plex, Hyperion) on TMAs from 416 LUAD patients (1 mm^2 cores)
Hybrid segmentation: Mask R-CNN + MATLAB post-processing + EM-GMM refinement
Rule-based cell phenotyping → 17 cell types
Cellular neighborhood analysis following Schurch et al. (Cell, 2020) — KNN neighborhood composition vectors → clustering into recurrent neighborhoods
Survival analysis: neighborhoods → Kaplan-Meier + Cox PH
ResNet-50 deep learning on raw IMC images (37 channels) → binary progression prediction
Validation cohort: 60 patients, 2 cores each

What we can learn:

Schurch-style cellular neighborhoods remain the most widely used niche definition in IMC studies. This is essentially what CellCharter automates with a learned latent space.
Deep learning on spatial images can predict clinical outcomes directly from raw pixel data — bypassing cell segmentation and phenotyping entirely.
Scale matters. 416 patients is large for spatial studies. Most niche findings are underpowered; survival associations require hundreds of patients.

6. Arora 2023 — Tumor Core vs Edge¶

Spatial transcriptomics reveals distinct and conserved tumor core and edge architectures that predict survival and targeted therapy response. Nature Communications 14: 5029 (2023). DOI

Workflow:

Visium spatial transcriptomics on HPV-negative oral squamous cell carcinoma (OSCC)
scRNA-seq as reference for deconvolution
Pathologist annotation of tumor core (TC) vs. leading edge (LE) regions
Differential gene expression between TC and LE
Cell-type composition analysis per region (deconvolution)
Cell-cell communication inference (L-R analysis)
Pan-cancer signature validation using TCGA cohorts
Drug response prediction from spatial architecture signatures

What we can learn:

Anatomical niche definition (core vs. edge) is simpler than algorithmic approaches but biologically meaningful — edge signatures are conserved across cancers.
Pan-cancer transferability. Spatial signatures derived from one cancer type can predict outcomes in others via TCGA validation — useful if our hackathon findings need broader contextualization.
Spot-level limitation. Visium resolution limits niche granularity compared to single-cell platforms (Xenium, CosMx). Our pipeline uses single-cell platforms for this reason.

A multi-modal single-cell and spatial expression map of metastatic breast cancer. Nature Medicine 30: 3236-3249 (2024). DOI

Workflow:

Serial sections from 67 biopsies (60 patients, 9 metastatic sites) profiled on 4 spatial platforms: Slide-seq, MERFISH, ExSeq, CODEX
scRNA-seq/snRNA-seq as dissociated reference
Cell2location deconvolution for Slide-seq; direct classification for MERFISH/CODEX
Neighborhood composition analysis → spatial niche clustering
Macrophage spatial phenotyping (3 patterns: short-range accumulation, long-range accumulation, intermixing)
Spatial DE: genes differentially expressed near vs. far from T/NK cells
Cross-platform concordance: pseudobulk correlation + cell-type composition comparison across all 4 platforms on same biopsies

What we can learn:

Cross-platform concordance is quantifiable. Pseudobulk expression correlation and cell-type composition comparison across serial sections provide concrete metrics for our RNA vs. protein concordance analysis (Step 4).
Macrophage spatial phenotypes reveal that the same cell type can have fundamentally different spatial organizations — niche context matters beyond cell-type identity.
Multi-modal on same tissue is the gold standard. Our D1 (Xenium + Protein on same tissue) mirrors this design.

8. Prakrithi 2025 — 12-Technology Skin Cancer¶

Integrating 12 spatial and single cell technologies to characterise tumour neighbourhoods and cellular interactions in three skin cancer types. bioRxiv (2025). PMC

Workflow:

24 skin donors (cSCC, BCC, melanoma) profiled across 12 technologies: scRNAseq, FLEX snRNAseq, Visium, Xenium, CosMx, GeoMx CTA, GeoMx Protein, Opal, RNAScope, PLA, MALDI glycomics, CODEX
Cancer cell identification: consensus CNV (InferCNV + CopyKat must agree) + cancer module score
Per-platform community detection → 10 communities per technology
Cross-platform community correlation → 4 meta-communities (immune, KC, stromal, tumour)
CellChat on scRNAseq → 312 L-R pairs; stLearn SCTP for spatially-constrained interaction testing on Visium/CosMx
MMCCI integration: combines interaction scores across platforms and replicates
Joint pathway analysis via MetaboAnalyst (genes + proteins + metabolites → KEGG)
Experimental validation: RNAScope, Opal Polaris, PLA (proximity ligation assay)
GWAS integration via gsMAP

What we can learn:

No single technology captures all niche features. Communities from Xenium (RNA), CODEX (protein), and MALDI (metabolites) reveal complementary aspects of the same spatial neighborhoods — directly supports our RNA vs. protein concordance analysis.
Meta-communities across platforms is a practical strategy for cross-platform niche comparison. Pearson correlation of cell-type composition vectors is a simple, effective metric.
Spatially-constrained CCC catches what non-spatial misses. stLearn SCTP found interactions (WNT5A-ROR1) that all three spatial platforms detected but scRNAseq missed — validating our choice of spatially-aware communication tools (NicheCompass, COMMOT).

Patterns and Gaps¶

What most studies do¶

Step	Common approach	Used by
Niche ID	Composition-based: KNN neighborhood → cell-type frequency vector → clustering	6/8 papers
Communication	CellChat (most common), with or without spatial constraints	4/8 papers
Gene programs	Differential expression or NMF between niches	5/8 papers
Clinical outcome	Survival analysis (Cox PH, Kaplan-Meier)	6/8 papers
Segmentation	Mesmer/DeepCell (most common for IMC/protein)	3/8 papers
Deconvolution	cell2location (for spot-level data like Visium)	2/8 papers

What few studies do (our opportunity)¶

Analysis	Papers that do it	Our pipeline
Cross-modality niche concordance (RNA vs protein)	Klughammer (4 platforms), Prakrithi (12 platforms)	CellCharter on RNA vs protein independently — more systematic than either
Spatially-aware communication (not just CellChat)	Prakrithi (stLearn SCTP)	NicheCompass — communication-based niche definition, more principled
Context-dependent gene programs (niche-specific DE)	None use Niche-DE specifically	Niche-DE — tests whether expression depends on niche context, not just cell type
Systematic cross-platform validation	Wang/Liu (7 platforms), Prakrithi (12 platforms)	Discovery on D1+D4, validation on V1 — different platforms, different tissue

Key insight for our hackathon¶

Most studies follow the same workflow: IMC/CODEX → Schurch-style neighborhoods → CellChat → survival. This is well-established but has known limitations:

Composition-only niches miss gene expression context (COVET/BANKSY address this)
Non-spatial CCC (standard CellChat) detects implausible long-range signals (NicheCompass/COMMOT address this)
Standard DE has 86-95% false positive rates on spatial data (Niche-DE addresses this)
Cross-modality concordance is rarely tested systematically

Our pipeline addresses all four gaps. The literature review confirms that our tool choices (CellCharter, NicheCompass, Niche-DE) and our novel Step 4 (RNA vs protein concordance) fill genuine methodological holes in the field.

Tool Usage Across Papers¶

Tool	Papers Using It	Our Pipeline
cell2location	Peng/Kadara, Klughammer	Not needed (single-cell platforms)
CellChat	Wang/Liu, Prakrithi, (Arora likely)	Considered; chose NicheCompass instead
steinbock + Mesmer	Meyer/Bodenmiller, Wang/Ali	Not needed (10x pipelines handle segmentation)
imcRtools + spicyR	Meyer/Bodenmiller	Not needed (IMC-specific)
Scanpy/AnnData	Wang/Liu, Peng/Kadara, Prakrithi	Yes — data infrastructure
Squidpy	Wang/Liu (likely), Klughammer (likely)	Yes — spatial graph construction
CellCharter	Referenced in Wang/Liu context	Our primary niche tool
NicheCompass	Own methods paper (Birk 2025)	Our communication tool
Niche-DE	Own methods paper (Bai 2024)	Our gene program tool

Gap confirmed

No published cancer study has used CellCharter + NicheCompass + Niche-DE together in a single pipeline. Our combination is novel.

References¶

Liu Y, Sinjab A, et al. Conserved spatial subtypes and cellular neighborhoods of cancer-associated fibroblasts. Cancer Cell 43(5): 905-924 (2025).
Peng F, Sinjab A, et al. Multimodal spatial-omics reveal co-evolution of alveolar progenitors and proinflammatory niches. Cancer Cell 44(2): 321-339 (2026).
Meyer D, et al. A stratification system for breast cancer based on basoluminal tumor cells and spatial tumor architecture. Cancer Cell 43(9): 1637-1655 (2025).
Wang XQ, Danenberg E, et al. Spatial predictors of immunotherapy response in triple-negative breast cancer. Nature 621: 868-876 (2023).
Sorin M, et al. Single-cell spatial landscapes of the lung tumour immune microenvironment. Nature 614: 548-554 (2023).
Arora R, Cao C, et al. Spatial transcriptomics reveals distinct and conserved tumor core and edge architectures. Nature Communications 14: 5029 (2023).
Klughammer J, et al. A multi-modal single-cell and spatial expression map of metastatic breast cancer. Nature Medicine 30: 3236-3249 (2024).
Prakrithi P, Grice LF, et al. Integrating 12 spatial and single cell technologies to characterise tumour neighbourhoods. bioRxiv (2025).

See also: Tool Selection for our recommended pipeline based on these insights.

Literature Review: Spatial Niche Analysis Workflows in Cancer¶

Summary Table¶

Per-Paper Analysis¶

1. Wang/Liu 2025 — CAF Spatial Subtypes¶

2. Peng/Kadara 2026 — Lung Precancer Niche Co-evolution¶

3. Meyer/Bodenmiller 2025 — TNBC Stratification¶

4. Wang/Ali 2023 — Immunotherapy Spatial Predictors¶

5. Sorin 2023 — Lung Immune Landscapes¶

6. Arora 2023 — Tumor Core vs Edge¶

7. Klughammer 2024 — Metastatic Breast Multi-Modal¶

8. Prakrithi 2025 — 12-Technology Skin Cancer¶

Patterns and Gaps¶

What most studies do¶

What few studies do (our opportunity)¶

Key insight for our hackathon¶

Tool Usage Across Papers¶

References¶