Gene Content of the Human Genome
This site (still in development) contains the results of a detailed analysis of the gene content of the human genome. A full description of this is contained in the paper 'Gene Content of the Human Genome' by Clamp et al (submitted).
This site enables people to access a range of properties for each of the 22,218 protein coding genes in Ensembl v35.
You can search for a single gene, sets of genes and genomic regions. More specifically you can search by :
- Ensembl id e.g. ENSG00000126550
- Refseq identitfier e.g. NM_001498
- Pfam domain e.g. GCS, KRAB, kinase
- Genomic region in the format chrN:start-end e.g. chr2:10004000-11200000
| Valid genes | Gene count | Invalid genes | Gene count |
|---|---|---|---|
| ortholog | 18868 | transposon | 499 |
| cross species paralog | 147 | pseudogene | 1069 |
| human specific paralog | 51 | orphan | 1169 |
| pfam | 36 | redundant | 197 |
| functional transposon | 2 | artifact | 77 |
| functional pseudogene | 4 | ||
| orphans with protein evidence | 8 | ||
| special regions | 91 | ||
| Total | 19207 | Total | 3011 |