Download Sequence


Choose compression type: .zip .gz

Sequence Downloads

supercontigs.fasta
contigs.fasta
contigs.agp
All File Types
M. tuberculosis F11
M. tuberculosis Haarlem
M. tuberculosis KZN 4207 (DS)
M. tuberculosis KZN 1435 (MDR)
M. tuberculosis KZN 605 (XDR)
M. tuberculosis C
M. tub. 98-R604 INH-RIF-EM
M. tuberculosis W-148
All Assemblies->

Gene Downloads

genes.fasta
transcripts.fasta
transcripts.gtf
proteins.fasta
proteins_stops.fasta
pfam_to_genes.txt
genes_upstream_1000.fasta
genes_upstream_utr_1000.fasta
genes_downstream_1000.fasta
genes_downstream_utr_1000.fasta
genome_summary.txt
genome_summary_per_gene.txt
All File Types
M. tuberculosis F11
M. tuberculosis Haarlem
M. tuberculosis KZN 4207 (DS)
M. tuberculosis KZN 1435 (MDR)
M. tuberculosis KZN 605 (XDR)
M. tuberculosis C
M. tub. 98-R604 INH-RIF-EM
M. tuberculosis W-148
All Assemblies->

File Descriptions

File NameFile Description
supercontigs.fastaDownload nucleotide sequence by supercontig.
contigs.fastaDownload nucleotide sequence by contig.
contigs.agpAGP file containing supercontig & contig details, including contig size, contig position, gap size and linkage information.
genes.fastaDownload nucleotide sequence of gene predictions, including untranslated regions. (one entry per gene)
transcripts.fastaDownload nucleotide sequence of transcripts, with all non-coding sequences removed. (may be more than one transcript per gene)
proteins.fastaDownload amino acid sequence of proteins predicted to be encoded by the transcripts. (may be more than one protein per gene)
proteins_stops.fastaThis file contains protein sequences which either contain internal stops, or are derived from coding regions which are not multiples of 3. These translations should be treated as suspect.
genes_upstream_1000.fastaThe nucleotide sequence 1000 nucleotides before the start codon for each gene prediction
genes_downstream_1000.fastaThe nucleotide sequence 1000 nucleotides after the stop codon for each gene prediction