Sequencing & Assembly

Methodology Overview

The Mesoplasma florum genome was sequenced using the Whole Genome Shotgun methodology, whereby:

  1. Mesoplasma florum  DNA is shattered into small fragments (~4 kb or ~40 kb)
  2. Each fragment is inserted into a vector and cloned
  3. The two ends of the fragment are sequenced, creating paired reads
  4. The assembly process uses the paired reads to identify contiguous stretches of sequence (contigs)
  5. Contigs are ordered and linked together into larger supercontigs by using paired reads lying in different contigs

Assembly Data

Assembly 1, 10/04/2004

Sequencing Facts

  • Total length of assembly: 793.224 Kb
  • >10X sequencing coverage of the genome
  • 1 contigs in 1 scaffolds (supercontigs)
  • 793.224 Kb total length of combined contigs (793,224 bp)

Supercontig/Contig Numbering

  • Supercontig and contig numbers are preceded by the version of the assembly. For example:
    • Contig 1.25 - refers to contig number 25 within assembly 1.
    • Supercontig 1.2 - refers to supercontig number 2 within assembly 1.

  • Supercontigs are numbered in order of decreasing length. For example, supercontig 1.1 is the largest with 793.224 Kb, and supercontig 1.1 is the smallest with 793.224 Kb.

  • Contigs within supercontigs are ordered positionally. For example, supercontig 1.1 contains contigs 1,2,3...1 (in that order).

    There is no correspondence between contig or supercontigs numbers in different assemblies.

Library Clones

We end-sequenced the following types of libraries: Plasmid

 

Library Name

# Clone ends mapped to Assembly 1

Plasmid

22,631

total

22,631