What's New

5/9/2007    Protein-coding gene predictions overlapping ribosomal RNA gene regions flagged

The loci for the following protein-coding gene predictions have been flagged due to the fact that they overlap regions of genome predicted to contain ribosomal RNA genes. They have been flagged as "overlaps ribosomal RNA region" on the Gene Details reports and have been segregated into separate files in the Downloads page and on the BLAST page. 59 loci were affected; see the Downloads page for a complete list in FASTA format.

10/04/05    Release of the results of automated annotation

A full summary can be found here. This annotated release presents:

  • A predicted protein gene set (16,448 genes)
  • Protein BLAST databases
  • Precomputed results of BlastX and HMMer Analyses
  • Alternative proteins predicted by various gene callers (Genewise, FgeneSH,FgeneSH+, GeneID)
  • tRNA predictions
  • Search and visualizations for all the features

8/10/2005    Initial release of 5.4X sequence assembly

The sequence traces from the Broad Institute sequencing can be downloaded from the NCBI trace repository.

Important information about this release can be found here.

This initial release allows:

  • BLAST searches against our 5.4X genome assembly
  • Downloads of the consensus sequence and additional files for the genome assembly
  • Search and download a particular region of the genome assembly

There will be a second upcoming release containing the annotation of predicted genes and other genomic features.

Assembly Data

6/15/2005    Assembly 1

Sequencing Facts

  • 5.4X sequencing coverage of the genome
  • 4534 contigs in 588 supercontigs (scaffolds)
  • 8.6 Kb average contig length (range 389 bp - 104.3 Kb)
  • 72.6 Kb average supercontig length (range 498 bp - 1.3 Mb)
  • 38.8 Mb total length of combined contigs (38,786,820 bp)
  • Average base lies in a contig of length 16.4 Kb
  • Average base lies within a supercontig of length 256.7 Kb