Frequently Asked Questions
Questions
Sequencing
- How big is the Chaetomium globosum genome?
- Where can I obtain the strain?
- What is the current state of the assembly?
- How has the sequence been generated for the Chaetomium globosum project?
- Will the genome be finished?
- How will we know the assembly is correct?
- What data are available?
- This sequence release looks different from previous releases, like Neurospora crassa. What's different?
Misc
Answers
Sequencing
- How big is the Chaetomium globosum genome?
Our current total unique contig length of 34,336,862 bp (base pairs) suggests the genome is approximately 36 Mb.
- Where can I obtain the strain?
The strain is available from The Centraalbureau voor Schimmelcultures (CBS). This strain is listed in the catalogues of a number of other culture collections under different strain numbers (= ATCC 6205; CBS 161.52; CEB 1218.1; CEB 1218.2; CECT 2701; DSM 1962; IFO 6347; IHEM 3826; IMI 045550; IMI 045550ii; IMI 045550iv; IMI 045550iii; MUCL 1984; NRRL 1870; QM 459; UPSC 3159; USDA 1042.4; VTT D-81079).
- What is the current state of the assembly?
The current assembly contains 1245 sequence contigs in 37 supercontigs (scaffolds). There are no current plans for additional sequencing or finishing. See Assembly for detail.
- How has the sequence been generated for the Chaetomium globosum project?
Our data consist of over 455,000 individual sequencing reads obtained by sequencing each end of plasmids and Fosmids from libraries containing randomly sheared fragments of 4 kb, 10 kb, and 40 kb average size respectively. See Assembly for details.
- Will the genome be finished?
Unfortunately there are no plans to finish this genome at this time.
- How will we know the assembly is correct?
The quality of the assembly will be assessed in several ways. In addition to requiring that the paired plasmid and Fosmid ends occur in a logical manner, our assembly of the Chaetomium globosum genome will be verified through comparison with available genomic sequences.
- What data are available?
In this version of our data release, all sequence contigs and supercontigs are available. Sequence data can be accessed in several ways: either through a searching with BLASTN or TBLASTN, retrieving of a specific region of the assembly, or by downloading the entire genome. Supercontig and contig sequences are subject to change throughout this project, so each data release version number will be appended to the contig number as a prefix (e.g. 1.1 denotes assembly version 1, supercontig #1).
Fosmid clones have been integrated into the current assembly, and you can search and view the locations of these clones within the sequence supercontigs.
A fasta file of raw reads excluded from the assembly is also available for BLAST and download. Also available for download are an AGP file describing the supercontigs and contigs in this assembly and a file listing coordinates of paired endreads.
- This sequence release looks different from previous releases, like Neurospora crassa. What's different?
Important information about this release can be found here.
Misc
- How do I cite the sequence for publication?
Publications should include the following citation:
Botrytis cinerea Sequencing Project. Broad Institute of Harvard and MIT (http://www.broad.mit.edu) - Who do I contact with questions about the sequencing?
Our email address is annotation-webmaster(at)broad.mit.edu.
