Spatial Transcriptomics: Key Points

Spatially Resolved Transcriptomics in Life Sciences Research

Spatial transcriptomics provides the location of cells relative to neighboring cells and cell structures.
A cell’s location is useful data for describing its phenotype, state, and cell and tissue function.
Spatial transcriptomics addresses a key obstacle in bulk and single-cell RNA sequencing studies: their loss of spatial information.
The main goal of spatial transcriptomics studies is to integrate expression with spatial information.

The 10x Space Ranger pipeline provides you with an unfiltered and a filtered data file.
The HDF5 file ends with an h5 extension and contains the barcodes, features (genes), and counts matrix.
Seurat is one of several popular environments for analyzing spatial transcriptomics data.
It is important to know something about the structure of the tissue which you are analyzing.
Plotting total counts and genes in each spot may help to identify quality control issues.

Spot filtering should be light.
Inspect the spots that you are filtering to confirm that you are not discarding important tissue structures.

Normalization is necessary to deal with technical artifacts, in particular, varying total counts (library size) across spots and the dependence of a gene’s variance on its mean expression.
Nevertheless, raw (unnormalized) data can provide biologically meaningful insights such as region-specific differences in cell type or density that impact total reads.
Popular methods aim to address these artifacts and include, but are not limited to, CPM normalization, log normalization, and SCTransform.
The ability of a normalization method to stabilize expression variance across its mean (i.e., to remove the dependence of the former on the latter) can be assessed visually with a mean-variance plot.

Feature selection is a crucial step in spatial transcriptomics analysis, particularly for non-variance-stabilizing normalization methods like NormalizeData.
Techniques such as VST and mean-variance plotting enable researchers to focus on genes that provide the most biological insight.
Different proportions of highly variable genes and feature selection methods can significantly influence the analytical outcomes, emphasizing the need for tailored approaches based on the specific characteristics of each dataset.
Linear dimensionality reduction methods like PCA are crucial for initial data simplification and noise reduction.
Nonlinear methods like UMAP are valuable for detailed exploration of data structures post-linear preprocessing.
The sequential application of PCA and UMAP can provide a comprehensive view of the spatial transcriptomics data, leveraging the strengths of both linear and nonlinear approaches.

Deconvolution enhances spatial transcriptomics by quantifying the different cell types within spatial spots.
Integrating scRNA-seq data with spatial transcriptomics data facilitates accurate deconvolution.
RCTD is a supervised deconvolution method that quantifies the proportion of different cell types in spatial transcriptomics data.

Differential expression testing pinpoints genes with significant expression variations across regions or clusters.
Moran’s I statistic reveals spatial autocorrelation in gene expression, critical for examining spatially dependent biological activities.
Moran’s I algorithm effectively identifies genes expressed in anatomically distinct regions, as validated from the correlation analysis with the DE genes from the annotated regions.

There are many decisions which need to be made in spatial transcriptomics.
It is essential to have an understanding of the tissue morphology before proceeding with a spatial transcriptomics analysis.