Production Data Processing

The amount of genomic data produced worldwide is growing at a pace that far exceeds Moore’s Law, doubling every eight months on average. The Broad alone produces nearly 20 terabytes of genomic data every day.

Processing that volume of data is labor- and resource-intensive. For that reason, DSP leverages the scalability and flexibility of cloud computing to carry out initial genomic data production processing: the translation of raw sequencing machine data into sequence reads, which are further examined using the platform’s analytical tools. In doing so, DSP has created and is maintaining and evolving a robust, secure ecosystem for data processing that feeds scientists at the Broad and elsewhere high-quality alignments and quality metrics from which to glean new insights.