Interpreting short tandem repeat variations in humans using mutational constraint.

Nat Genet
Authors
Keywords
Abstract

Identifying regions of the genome that are depleted of mutations can distinguish potentially deleterious variants. Short tandem repeats (STRs), also known as microsatellites, are among the largest contributors of de novo mutations in humans. However, per-locus studies of STR mutations have been limited to highly ascertained panels of several dozen loci. Here we harnessed bioinformatics tools and a novel analytical framework to estimate mutation parameters for each STR in the human genome by correlating STR genotypes with local sequence heterozygosity. We applied our method to obtain robust estimates of the impact of local sequence features on mutation parameters and used these estimates to create a framework for measuring constraint at STRs by comparing observed versus expected mutation rates. Constraint scores identified known pathogenic variants with early-onset effects. Our metric will provide a valuable tool for prioritizing pathogenic STRs in medical genetics studies.

Year of Publication
2017
Journal
Nat Genet
Volume
49
Issue
10
Pages
1495-1501
Date Published
2017 Oct
ISSN
1546-1718
DOI
10.1038/ng.3952
PubMed ID
28892063
PubMed Central ID
PMC5679271
Links
Grant list
R01 GM100233 / GM / NIGMS NIH HHS / United States
R01 HG006399 / HG / NHGRI NIH HHS / United States
U01 MH105669 / MH / NIMH NIH HHS / United States