Sequencing & Assembly

Whole Genome Shotgun Overview

This genome was sequenced using 454 Whole Genome Shotgun methodology, whereby:

  1. DNA is shattered into small fragments (~0.6kb or~3kb)
  2. 0.6kb fragments are tailed with 454 sequencing adapters
  3. 3kb fragments are circularized on a biotinylated linker, circles are sheared, fragments containing biotinylated linker are retrieved and tailed with 454 sequencing adapters.
  4. Adapterized fragments are sequenced from one end, creating fragment or paired reads.

Newbler (454 Life Sciences) Assembly

  1. The assembly process uses fragment and paired reads to identify contiguous stretches of sequence (contigs)
  2. Contigs are ordered and linked together into larger supercontigs by using paired reads lying in different contigs

assembly graphic

For more info on 454 and Newbler assembler see: http://www.454.com

Supercontig/Contig Numbering

  • Supercontig and contig numbers are preceded by the version of the assembly. For example:
    • Contig 1.25 - refers to contig number 25 within assembly 1.
    • Supercontig 1.2 - refers to supercontig number 2 within assembly 1.
  • Supercontigs are numbered in order of decreasing length. For example, supercontig 1.1 is the largest, and supercontig 1.2 is the next largest.
  • Contigs within supercontigs are ordered positionally. For example, supercontig 1.1 contains contigs 1,2,3... (in that order).

There is no correspondence between contig or supercontigs numbers in different assemblies.