This website allows you to browse known and discovered motifs for the all the experimental datasets.

Send any questions/comments to Pouya Kheradpour.

Last modified on 2011-11-23. Previous versions: 201006freeze 201101freeze-run1

  1. Each experiment is put into a "factor group" on the basis of ChIP TF and its known motifs with the intention to group factors with very similar motifs. Known motifs are assigned to the factor group using the same critera.
  2. The navigation is done in the left frame. For each factor group it indicates the the number of known motifs, discovered motifs, and experimental datasets. Clicking on the headers will resort the table.
  3. This is the README page and the performance page indicates how often the top motif found for a factor group matches (correlatoin at least 0.75) any known motif assigned to that factor group.
  4. The "discovered motifs" for each factor group are the top 10 (in terms of enrichment in their discovery dataset using the Intergenic background) where no two are more than 0.75 similar to each other (this prevents very similar variants of the same motif from being taken).
  5. Enrichments are computed by taking the fraction of motif instances that are inside the bound regions and dividing that by the fraction of shuffle motif instances inside (where the bound regions are filtered against the background regions, defined below). They are also corrected for small counts by using a confidence interval (with Z=1.5) around each fraction and taking the extreme which leads to the enrichment closest to 1.
  6. Clicking on a factor group will change the middle and right frames.
  7. Motifs are matched to the genome using a p-value of 4^-8 using TFM-PVALUE. A custom program is used to do the actual matching.
  8. The motif matches file can be used for carrying out custom analyses (all coordinates are in hg19).