Project Information

The Saccharomyces cerevisiae sequencing project is part of the Broad Institute Fungal Genome Initiative. Its goal is to release an annotated assembly with 8X genome sequence coverage for Saccharomyces cerevisiae isolate RM11-1a. Jackie Whittle, in the laboratory of Dr. Leonid Kruglyak at The Lewis-Sigler Institute for Integrative Genomics and the Department of Ecology and Evolutionary Biology at Princeton University, provided the genomic DNA for the sequencing project.

Our specific aims are as follows:

  1. Generate and assemble sequence reads yielding 8X coverage of the S. cerevisiae genome through whole genome shotgun sequencing.
  2. Perform automated annotation of the sequence assembly.
  3. Distribute the sequence assembly and results of our annotation and analysis through a freely accessible, public web server at the Broad and by deposition of the sequence assembly in GenBank.

We produced whole genome shotgun sequence from two plasmid libraries (4kb and 10kb inserts) and a Fosmid library. The resulting 8X assembly was made public in January of 2005, and the results of automated genome annotation are now available in the new September 2006 release.

Data Releases

September 2006 - Release 2 provides the results of automated genome annotation and analysis.

January 2005 - Release 1 consists of a 10X whole-genome shotgun assembly generated at the Broad.

Questions about the project should be directed to annotation-webmaster(at)broad.mit.edu.

Saccharomyces cerevisiae RM11-1a -- a natural isolate

micrographs

The top image is a confocal micrograph of Elo3-GFP, effectively labeling the endoplasmic reticulum.
The bottom image shows Nile red staining of lipid droplets in 'obese yeast,' which are unable to degrade triglycerides stored in lipid droplets.
(References: Kals et al., YEAST, 2005, Natter et al., Mol Cell Prot., 2005)
These images were kindly provided by Sepp Kohlwein of the Yeast Genetics and Molecular Biology Group at the University of Graz, Austria.

Saccharomyces cerevisiae is arguably the most important model organism for studies of genetics and eukaryotic biology. As the first eukaryote to have its genome sequenced, it has also become the model of choice for functional and comparative genomics. To date, the sequence of only a single laboratory strain of S. cerevisiae, S288C, is available. The sequence of an independent natural isolate will greatly enhance biological, genomic, and evolutionary studies. High-quality draft sequence of three related Saccharomyces species was recently generated (Kellis et al. 2003). The sequence divergence between S. paradoxus (the closest of these species) and S. cerevisiae is 20%, considerably greater than that between human and rhesus macaque. In contrast, the sequence divergence between RM11 and S288C is estimated to be 0.5-1%, approaching that between human and chimp. This sequence variation is distributed throughout the genome, confirming that RM11 shares no recent history with S288C.

The sequence of a natural isolate will allow identification of mutations specific to S288C on a genome-wide basis and enable studies of the true wild-type alleles, as well as shedding light on natural genetic variation within S. cerevisiae. The sequence of RM11 will also identify S. cerevisiae genes deleted in S288C and quantify the rate of intraspecific telomeric "genome churning." RM11 has been used as a model for mapping loci that affect gene expression and other complex phenotypes, and the sequence will greatly facilitate positional cloning of the genes involved.

RM11-1a is a haploid derivative of Bb32(3), a natural isolate collected by Robert Mortimer from a California vineyard. It has high spore viability (80-90%) when crossed with different lab strains. Strains of both mating types and with a number of auxotrophic markers are available. RM11 has been subject of extensive phenotypic characterization, including growth under a wide range of conditions and gene expression profiling. Measurements of gene expression in RM11, S288C, and segregants from a cross between them show that 1/3-1/2 of the genome is differentially expressed, and that these differences are due to at least 500-1000 separate loci, some of which affect expression in cis and others in trans. Sequence of RM11 will greatly facilitate rapid identification of these loci and comprehensive characterization of regulatory variation. RM11 also has significantly longer life span than laboratory yeast strains and accumulates age-associated abnormalities at a lower rate.