Various data for use in assessing call sets

From GSA

Jump to: navigation, search

We have collected various files that can be used in evaluating a set of calls. All of the data can be found under this directory [internal to the Broad, of course]:

/humgen/gsa-hpprojects/GATK/data/Comparisons/

The Validated/ directory contains the dbSNP rods, the HapMap genotypes, and various chip and Sequenom validation results for SNPs and short indel.

The Unvalidated/ directory contains various high quality call sets including those from the pilot phase of the 1000 Genomes project. It also contains several calls specifically made on NA12878 (including e.g. from Complete Genomics data).

Personal tools