Deep interpretable perturbation modeling in single cell genomics¹; Learning cell communication from spatial graphs of cells²

Technical University of Munich

Institute of Computational Biology, Helmholtz Zentrum München

¹Recent advances in multiplexing single-cell transcriptomics across experiments are enabling the high-throughput study of drug and genetic perturbations. However, an exhaustive exploration of the combinatorial perturbation space is experimentally unfeasible, so computational methods are needed to predict, interpret and prioritize perturbations. Here, we present the Compositional Perturbation Autoencoder (CPA), which combines the interpretability of linear models with the flexibility of deep-learning approaches for single-cell response modeling. CPA encodes and learns transcriptional drug response across different cell types, doses, and drug combinations. The model produces easy-to-interpret embeddings for drugs and cell types, allowing drug similarity analysis and predictions for unseen dosages and drug combinations. We show CPA accurately models single-cell perturbations across compounds, dosages, species, and time. We further demonstrate that CPA predicts combinatorial genetic interactions of several types, implying it captures features that distinguish different interaction programs. Finally, we demonstrate CPA allows in-silico generation of 5,329 missing combinations (97.6% of all possibilities) with diverse genetic interactions. We envision our model will facilitate efficient experimental design by enabling in-silico response prediction at the single-cell level.

²I will discuss statistical dependencies between molecular cells states in space based. In particular, I will discuss node-centric expression modeling (NCEM), a computational method based on graph neural networks as a means of reconciling variance attribution and cell-cell communication modeling. We use these models in varying complexity across spatial assays, such as immunohistochemistry and MERFISH, and biological systems, to demonstrate that the statistical cell–cell dependencies discovered by NCEMs are plausible signatures of known molecular processes underlying cell communication. Altogether, this graphical model of cellular niches is a step towards understanding emergent tissue phenotypes. Statistically, this can be interpreted as NCEMs providing a means for replacing the i.i.d. assumption on cellular vectors, that is commonly used in models of scRNA-seq data, with structured dependencies between cells.

Related Resources

¹GitHub facebookresearch/CPA

¹Preprint: "Learning interpretable cellular responses to complex perturbations in high-throughput screens"

¹GitHub: theislab/scarches

¹Paper: "Mapping single-cell data to reference atlases by transfer learning"

¹GitHub: theislab/scgen

¹Paper: "scGen predicts single-cell perturbation responses"

²GitHub: NCEM

²Preprint NCEM: "Learning cell communication from spatial graphs of cells"

Related Resources

MIA Talks Search