Cumulus provides cloud-based data analysis for large-scale single-cell and single-nucleus RNA-seq.

Nat Methods
Authors
Keywords
Abstract

Massively parallel single-cell and single-nucleus RNA sequencing has opened the way to systematic tissue atlases in health and disease, but as the scale of data generation is growing, so is the need for computational pipelines for scaled analysis. Here we developed Cumulus-a cloud-based framework for analyzing large-scale single-cell and single-nucleus RNA sequencing datasets. Cumulus combines the power of cloud computing with improvements in algorithm and implementation to achieve high scalability, low cost, user-friendliness and integrated support for a comprehensive set of features. We benchmark Cumulus on the Human Cell Atlas Census of Immune Cells dataset of bone marrow cells and show that it substantially improves efficiency over conventional frameworks, while maintaining or improving the quality of results, enabling large-scale studies.

Year of Publication
2020
Journal
Nat Methods
Volume
17
Issue
8
Pages
793-798
Date Published
2020 08
ISSN
1548-7105
DOI
10.1038/s41592-020-0905-x
PubMed ID
32719530
PubMed Central ID
PMC7437817
Links
Grant list
HHMI / Howard Hughes Medical Institute / United States
T32 HG002295 / HG / NHGRI NIH HHS / United States
RC2 DK116691 / DK / NIDDK NIH HHS / United States
RM1 HG006193 / HG / NHGRI NIH HHS / United States
T32 CA207021 / CA / NCI NIH HHS / United States