Please ensure that you have followed the installation guide, to ensure the program runs as intended.
Affy 5.0
Affy 6.0
Illumina 610
Illumina 1M
Illumina 650
Birdsuite should be run one chemistry plate at a time (~96 samples.) You'll need to batch up your CEL files to do this. Running birdsuite on multiple chemistry plates at one time will result in poor performace due to batch effects that we can not resolve, due to structure in the data you're introducing. We prefer that you run birdsuite on at least 40 samples, as many of the methods employed utilize clustering algorithms, which function better with a fair number of exemplars.
There are other problems that can result from running too many samples - out of memory errors, corrupted files, etc. So please batch your data appropriately.
We've found 5 Gigs of ram to be optimal for speed. You can get away with 4. You'll need about 2 G of hard drive room for the results of each batch. We have no processor requirements, other than knowing that 64 bit CPUs tend to run much better/faster for this application.
We support the linux operating system, and provide source code. Some users have successfully gotten Birdsuite to run on non-Unix enviornments, but they've had to build from source. We only test our software on linux, so we can't guarantee the results. If you build the code on your system and get the same results on the test data set as we distribute, you're doing it right!
Once you change the code (beyond what's detailed in the install instructions), we'll no longer provide support for you, unless we explicitly ask you to do so to test a change. We take user input and bugs, and add them to our version of the code, so everyone can benefit
from the problems you may find and diagnose. We appreciate your input, but attempts to change the inner workings of this quite complicated software package can be difficult and problematic.
Birdsuite is designed to be aware of copy number variable regions in the genome. If a SNP overlaps one of these regions, it's possible to call this SNP with knowledge of that event. If the SNP overlaps a deletion, you could have calls of -/-, A/-, T/-, A/T, AA, T/T.
Birdseed is not aware of these differences, so classes like A/- are more likely to be either miscalled, or not called at all, since they generally fall outside of the expected cluster locations.
In non copy number variant regions, birdseed and birdsuite produce the same SNP genotypes.
Birdsuite will have both more calls (as it can call SNPs in copy number variant regions) and more accurate calls, as it calls SNPs in copy number variant regions with the correct genotype, instead of a no-call or the wrong cluster assignment.
There is! Please see our analysis section, which contains links to a set of programs that will convert birdsuite output to plink compatible input files.
At the Broad, we put each plate onto a separate node of our 10 slot cluster queue, so we can process somewhere on the order of 40-60 plates a day.
When you send a crash or install question for us, it's impossible for us to help you without receiving some information about what happened at the time of the crash. In order to get support, please provide the following:
Failure to submit this information can greatly delay our response to your issue.
Affy's APT software seems to work slightly differently with each release. We're been using version 1.8.6 internally, which seems to do the right thing. Versions between 1.8.6 and 1.10.2 seem to produce odd results, or don't extract all the information birdsuite needs (for
example, APT 1.10.1 does not extract any of the SNP info.) Use version 1.8.6. Crashes can occur far downstream (divide by 0 error on birdsuite reports, caused by having no data through the entire run.)
Birdsuite changed in a fundamental way in version 1.5.5. Python scripts now need to be installed. Please read the python install documentation here [CLICK THAT LINK!] to install them correctly
Birdsuite expects multiple CEL files, because it's a clustering based system (see FAQ question #1.) Birdsuite will crash. We suggest running at least 20 CEL files at one time.
For example, a user has written in to comment on using sh:
We had to modify the run_birdseye.sh, because the original one didn't recognized the [[ ]] in sh interpreter.
We changed the double brackets for single ones [ ] and it worked.
We may change the bash scripts to python in a future release to avoid these problems.
./bin/birdsuite.sh --basename=test --chipType=GenomeWideSNP_6 --outputDir=output --genderFile=test.gender --celFiles=test.cels --noLsf --apt_probeset_summarize.force
If you omit the "=", Birdsuite's argument parser things you're telling it the value of the argument is true. So --genderFile test.gender would make birdsuite believe the value of --genderFile is true, and it's hard to parse true as a file.
Also note that there is no space between the argument name, the "=", and the value.
--genderFile =test.gender
will also break birdsuite.
Common causes of canary crashing:
Run the appropriate number of samples in birdsuite (see FAQ question #1)
Run the appropriate number of samples in birdsuite (see FAQ question #1)
This is an R error. We're not sure of how this bug is occurring, as we're unable to recreate it. It may revolve around how you have R installed in your enviornment, if you've installed the R libraries correctly, or if you override the R_LIBS path with your own path and prevent birdsuite from seeing the canary libraries. If you have your own R_LIBS path you can try installing the Birdsuite R libraries there and see if Birdsuite is able to come up with them.
If you're able to reproduce this bug on your system and find a solution, please let us know so we can add it here.