You are here

PLoS Comput Biol DOI:10.1371/journal.pcbi.1000067

Performance and scalability of discriminative metrics for comparative gene identification in 12 Drosophila genomes.

Publication TypeJournal Article
Year of Publication2008
AuthorsLin, MF, Deoras, AN, Rasmussen, MD, Kellis, M
JournalPLoS Comput Biol
Date Published2008 Apr 18
KeywordsAnimals, Base Sequence, Chromosome Mapping, Discriminant Analysis, Drosophila, Drosophila Proteins, Genetic Variation, Molecular Sequence Data, Open Reading Frames, Reproducibility of Results, Sensitivity and Specificity, Sequence Analysis, DNA, Species Specificity

Comparative genomics of multiple related species is a powerful methodology for the discovery of functional genomic elements, and its power should increase with the number of species compared. Here, we use 12 Drosophila genomes to study the power of comparative genomics metrics to distinguish between protein-coding and non-coding regions. First, we study the relative power of different comparative metrics and their relationship to single-species metrics. We find that even relatively simple multi-species metrics robustly outperform advanced single-species metrics, especially for shorter exons (


Alternate JournalPLoS Comput. Biol.
PubMed ID18421375
PubMed Central IDPMC2291194