Benchmarker: An Unbiased, Association-Data-Driven Strategy to Evaluate Gene Prioritization Algorithms.

Am J Hum Genet

Authors	Rebecca Fine Tune Pers Tiffany Amariuta Soumya Raychaudhuri Joel Hirschhorn
Abstract	Genome-wide association studies (GWASs) are valuable for understanding human biology, but associated loci typically contain multiple associated variants and genes. Thus, algorithms that prioritize likely causal genes and variants for a given phenotype can provide biological interpretations of association data. However, a critical, currently missing capability is to objectively compare performance of such algorithms. Typical comparisons rely on "gold standard" genes harboring causal coding variants, but such gold standards may be biased and incomplete. To address this issue, we developed Benchmarker, an unbiased, data-driven benchmarking method that compares performance of similarity-based prioritization strategies to each other (and to random chance) by leave-one-chromosome-out cross-validation with stratified linkage disequilibrium (LD) score regression. We first applied Benchmarker to 20 well-powered GWASs and compared gene prioritization based on strategies employing three different data sources, including annotated gene sets and gene expression; genes prioritized based on gene sets had higher per-SNP heritability than those prioritized based on gene expression. Additionally, in a direct comparison of three methods, DEPICT and MAGMA outperformed NetWAS. We also evaluated combinations of methods; our results indicated that combining data sources and algorithms can help prioritize higher-quality genes for follow-up. Benchmarker provides an unbiased approach to evaluate any similarity-based method that provides genome-wide prioritization of genes, variants, or gene sets and can determine the best such method for any particular GWAS. Our method addresses an important unmet need for rigorous tool assessment and can assist in mapping genetic associations to causal function.
Year of Publication	2019
Journal	Am J Hum Genet
Date Published	2019 Apr 24
ISSN	1537-6605
DOI	10.1016/j.ajhg.2019.03.027
PubMed ID	31056107
Links	PubMed Google Scholar DOI

Recent Broad Publications

An esophagus cell atlas reveals dynamic rewiring during active eosinophilic esophagitis and remission.

Genome-wide association analyses identify 95 risk loci and provide insights into the neurobiology of post-traumatic stress disorder.

ACE inhibitors and angiotensin receptor blockers differentially alter the response to angiotensin II treatment in vasodilatory shock.

Bioinformatics pipeline for the systematic mining genomic and proteomic variation linked to rare diseases: The example of monogenic diabetes.

Multi-ancestry meta-analysis of tobacco use disorder identifies 461 potential risk genes and reveals associations with multiple health outcomes.