Sample Information File  Print-icon

Note: If you are using Excel to edit GenePattern files, be sure to save the file as a tab-delimited text file and supply the correct file extension. You can specify the file name in quotes to prevent Excel from appending .txt to the file name. Also, note that Excel's auto-formatting can introduce errors in gene names, as described in Zeeberg, et al (2004).

Sample Information File Format (.txt)

The sample information file is a tab-delimited format that describes a set of SNP arrays. The column labels in the first row define the information provided for each array; each subsequent row describes one SNP array. The sample information file is organized as follows:

  1. The first line contains the column labels. A sample information file can contain any number of columns and the column labels are arbitrary. However, SNP modules may require specific labels, as discussed below.
    • Line format:
      Label-1 (tab) Label-2 (tab) ... Label-n
    • For example:
      Array (tab) Sample (tab) Type (tab) Ploidy(numeric) (tab) Gender (tab) Paired (tab) Platform
  2. The remainder of the sample information file contains a line of information for each SNP sample. Where data is unavailable, columns may be empty.
    • Line format:
      Col-1- data (tab) Col-2-data (tab) ... Col-N-data
    • For example:
      S004274N_250S_123005 (tab) S004274N (tab) Normal (tab) 2 (tab) (tab) (tab) 250K_Sty

A sample information file can contain any number of columns and the column labels are arbitrary. A SNP analysis module, however, may require a sample information file to include specific column labels. For example, the SNP module CopyNumberDivideByNormals requires a sample information file that includes two columns, Sample and Ploidy(numeric). Following is a list of commonly used column labels:

Note: When a SNP module requires a sample information file to include specific column labels, the module documentation lists the required column labels. Specify required column labels exactly: they are case-sensitive and space-sensitive.

Sample .txt file: 250K_Sampleinfofile.txt

Creating a Sample Information File

The following steps outline how to copy exactly sample identifiers from Excel data and tranpose them from horizonal to vertical.

  1. In Excel, Select entire row containing sample names and Copy. Open a new workbook, Paste Special>Transpose.
  2. If starting from a RES file, to remove blank rows, Select relevant column(s), then click Edit>Go To>Special button>Blanks option and click OK. Blank rows will be selected. Choose Edit>Delete>Entire row option and click OK.

  1. Label row headings exactly as specified for module, fill in cells, and save as tab delimited text (.txt).  For example, ComBat module labels first three cells of Row 1: “Array”, “Sample”, and “Batch”.

 

<< RES Up TXT >>

Updated on May 29, 2014 11:14