




| 1.1. | What is the latest version of GenePattern? |
| A. |
For information about the latest version of GenePattern and its components, see the Release Notes. |
| 1.2. | Where can I find the hardware and software prerequisites for GenePattern? |
| A. | The Release Notes list hardware requirements, supported operating systems, and supported browsers. |
| 1.3. | Where can I find the GenePattern release notes? |
| A. | Click here for the latest release notes. |
| 1.4. | How can I get help with GenePattern or provide feedback? |
| A. |
In addition to this FAQ, the GenePattern team provides the following online resources:
To provide feedback or ask a question not addressed by the online resources, send email to gp-help(at)broadinstitute.org. |
| 1.5. | How do I cite GenePattern? |
| A. |
To cite GenePattern, please use the following citation: To cite a GenePattern analysis or visualization module, cite the GenePattern software and the original paper or other source for the module as specified in the module documentation. Documentation for each module is available on the Modules page and in GenePattern (click Help when prompted to enter the module's parameters). |
| 1.6. | If I am a member of the press, how can I get more information? |
| A. |
If you are a member of the press and need additional information about GenePattern, please contact our Public Relations department. |
| 1.7. | What can I do if I my job keeps running out of memory or fails with no error on the public server? |
| A. |
Sometimes jobs with large datasets or parameter settings which cause greater computational load on the system can fail because they ran out of memory. Usually you will get an error message stating that the module ran out of memory, though sometimes this can cause a "silent" failure; meaning that you see that your job failed, but there are no error files. In either case, GenePattern administrators can track down and resolve the problem. GenePattern administrators can assign memory settings on a per user basis, allowing your jobs to run with the required amount of memory for your analysis. To have an administrator look into your errors and adjust the memory settings for your jobs please contact us at gp-help(at)broadinstitute.org, making sure to provide your username and job id. |
| 1.8. | Are there other public GenePattern servers? |
| A. |
The public server maintained by the GenePattern development team can be found at http://genepattern.broadinstitute.org. There are also other GenePattern servers maintained by other organizations that have been made publicly available. These organizations include: SMD:Stanford Microarray Database NuGO: NBX Garvan Institute
|
| 1.9. | How can I find out about upcoming workshops or new module releases? |
| A. |
You can follow GenePattern on Twitter, FaceBook, or our RSS feed. Check our Twitter feed in the news items at genepattern.org, join our mailing list, or add yourself to the Workshops notification list. |
| 1.10. | Is there a version of GenePattern that can run in the cloud? |
| A. |
Yes. The GenePattern team is doing final testing on a GenePattern Amazon Machine Instance (AMI) which is expected to be available soon. |
Q. A. |
| 2.1. | How do I install (uninstall) GenePattern? |
| A. |
To install GenePattern, go to GenePattern Download and follow the instructions for your operating system. To uninstall GenePattern, use the utility provided as part of the GenePattern installation. If the GenePattern uninstall utility is unavailable, deleting the GenePattern installation folder removes all GenePattern files other than the desktop icons. Mac users: If R2.5 is not already installed, GenePattern installs it in the /Library/Frameworks/R.framework/Versions/2.5 folder. Uninstalling GenePattern does not uninstall R. To uninstall R, use the utility provided by R. |
| 2.2. | How do I upgrade to the latest version of GenePattern without losing my modules/pipelines/suites? |
| A. |
Simply install the new version of GenePattern into the same directory as your previous version. Do not uninstall first. It is unnecessary and will delete your existing modules, pipelines and suites. When you overwrite the previous version:
User groups: The userGroups.xml file for GenePattern 3.2 omits the group named Public. In GenePattern 3.2, all users are now in a predefined group named Public. To avoid confusion, do not recreate the group named Public. R versions: Installing GenePattern 3.1 (or later) installs R2.5 and sets the full path to R2.5. See Using Different Versions of R for information on how to create and/or use GenePattern modules written for other versions of R. |
| 2.3. | Is the source code available? |
| A. |
Yes. Source code for GenePattern and its modules is available under the GenePattern software license. For access to the source code, email the GenePattern team. |
| 2.4. | I already have R/Perl/Java on my machine. Will the versions of R/Perl/Java that GenePattern installs interfere with these? |
| A. |
No. The R, Perl, and Java installations that come with the GenePattern are installed within the GenePattern directory and do not affect any other versions that you may currently have. |
| 2.5. | Can I configure GenePattern to work with versions of R/Perl/Java other than those installed by GenePattern? |
| A. |
You can configure GenePattern to work with other versions of R/Perl/Java; however, the versions of R, Perl, and Java bundled with GenePattern are the ones that have been fully tested. We cannot guarantee that other versions will work. Java VM: If you install a GenePattern server without the Java VM, choosing instead to use a Java VM that you have already installed, ensure that the file tools.jar (provided by SUN seperately from the JRE and JDK) is on your classpath. When you install a GenePattern server with an included VM, the GenePattern installation does this for you. If this file is not on your classpath, when you attempt to install a module that requires the MatlabComponentRuntime (MCR) Installer, the MCR Installer fails. R versions: GenePattern modules can be written for any version of R. For details on how to specify which version to use, see Using Different Versions of R. |
| 2.6. | Does GenePattern support the international settings on my computer? |
| A. |
GenePattern supports the Basic Latin character set. Characters other than those in the Basic Latin character set may not be displayed correctly. Asian character sets are not currently supported. All analysis and visualization modules support the decimal point (.) as the separator between the integral and fractional parts of a decimal number. Using a decimal comma (,) may cause unexpected behavior in some modules. |
| 2.7. | I have no modules or pipelines installed. Why? How do I get them? |
| A. |
The most likely reason is that when the installer finished, you missed the step to install the modules that came up in a browser window after the installer closed. To install the modules:
|
| 2.8. | I am behind a web proxy/firewall and my GenePattern server says it cannot connect to the module repository to load the modules. What do I do? |
| A. |
If you did not indicate that you were behind a web proxy/firewall when you installed GenePattern, you must update the proxy settings for your server before you can install the modules:
If you still cannot connect to the repository, email us at gp-help(at)broadinstitute.org. |
| 2.9. | I want to install GenePattern into our corporate/departmental/other Web server and not have GenePattern run in its own Web server. How do I install it? |
| A. |
You need to use the war file installation. Instructions are available here. |
| 2.10. | When should I choose to install the GenePattern server on a different port than the default 8080? |
| A. |
If you already have a server such as Tomcat running on this port, you need to install the GenePattern server on a different port to avoid conflicts. |
| 2.11. | How do I install GenePattern on a 64-bit Windows machine if I want to use a version before 3.2.2? |
| A. |
When GenePattern is installed on Windows 64-bit systems in the default C:Program Files (x86) directory, modules fail because of some code that is expecting only "C:Program Files" and then truncates that location to "C:Progr ~1". There is similar bug in ComparativeMarkerSelection. These errors are corrected in the 3.2.2 release of GenePattern. However, if you do not upgrade to release 3.2.2 or after, the work around is to re-install GenePattern in a directory that has no spaces in the name. |
| 2.12. | Why don't MATLAB modules work on my Windows 64 bit GenePattern server? |
| A. |
Installing MATLAB to "Program Files" or "Program Files (x86)" may cause the MATLAB installation to be incomplete. The installation will not fail, but modules will throw an exception, reporting a missing file and then fail. To fix this manually, re-run the MATLAB installer and choose a new installation directory that contains no spaces. To do this you'll need to first uninstall MATLAB, else it will only let you "repair" the installation in the current directory. Steps to uninstall MATLAB are as follows:
2. Rerun the .msi, found in GP/patches/...../*.msi, and choose a new install directory with no spaces - like the default GenePattern install directory at Windows: C:GenePatternServer. If you've uninstalled GP and now need to uninstall the MATLAB that was installed via the now deleted GenePattern:
2. Install MATLAB module 3. Repair installation 4. Run the module to be sure MATLAB installed correctly 5. Use Add/Remove programs to uninstall MATLAB You might need to reboot to get the system to let you uninstall the MCR files. Or you can email us at gp-help(at)broadinstitute.org and we can send you the .msi which you can then point the uninstaller to, through Add/Remove Programs. |
| 2.13. | Why can't I connect to my GenePattern server on Windows 7 or Vista? |
| A. |
On Windows 7 and Vista, the StartGenePatternServer and StopGenePatternServer applications must be run as an administrator. To start or stop the GenePattern server, right-click on StartGenePatternServer.exe or StopGenePattnerServer.exe and select Run as administrator. To launch GenePattern in your browser, you can double-click the GenePatternHome.html icon located with the StartGenePatternServer and StopGenePatternServer icons. |
| 2.14. | How can I set up a GenePattern server for others to use remotely? |
| A. |
There are two useful sections of the GenePattern User Guide that explain how to do this: |
| 2.15. | Why doesn't clicking StartGenePatternServer launch GenePattern? |
| A. |
The StartGenePatternServer application only starts the server. To access the web client interface for your GenePattern server, click the GenePatternHome.html shortcut icon, or, if you did not install icons in your task bar or on your desktop, GenePatternHome.html can be found at the top level of your GenePattern install directory. |
| 2.16. | How can I get R to install correctly on my Mac? |
| A. |
Some Mac users have found that the R library is not installing correctly when they try to install GenePattern. Even after making sure that the folder into which GenePattern is installing R has write permissions, upon running a module, they receive the following error message: java.io.IOException: Cannot run program "/Library/Frameworks/R.framework/Versions/2.5/Resources/bin/R": error=2, No such file or directory while running R command [/Library/Frameworks/R.framework/Versions/2.5/Resources/bin/R, --no--save, --quiet, --slave, --no-restore] This may be a simple GenePattern server configuration problem. First, check that something is installed at that path. Open the Terminal.app and run the following commands: ls /Library/Frameworks/R.framework/Versions/2.5 If there is something installed at this path, then check that the path to R is correctly configured in your GenePattern server. Go to the Administration>Server Settings>Programming Languages GenePattern page and verify that: R 2.5 Home: /Library/Frameworks/R.framework/Versions/2.5/Resources If this is configured correctly, you may be able to correct the problem by manually downloading and installing R 2.5. If you have further issues installing R, contact us at gp-help(at)broadinstitute.org. |
Q. A. |
| 3.1. | How do I increase the memory allocated to the GenePattern server or client? |
| A. | |
| 3.2. | How do I increase the memory allocated to a module? |
| A. | |
| 3.3. | Can I run more than one instance of the GenePattern server on a machine? |
| A. |
Yes. If you are running more than one installation of GenePattern on the same machine, you must make sure that the port numbers for the GenePattern server and the HSQL server are unique to each installation. The Tomcat server listens on two ports, 8080 (requests) and 8005 (shutdown) by default, and the HSQL server listens on port 9001. All 3 ports need to be modified on the second copy of Tomcat. For example, you can set the GenePattern server port to 8080 and 8005 on one install and 8081 and 8086 on the other, and set the HSQL port to 9001 on one and 9002 on the other. You can configure these port numbers when you are installing the server. |
| 3.4. | How do I configure GenePattern to use a grid engine? |
| A. |
See Using a Queuing System for information on integrating GenePattern with queuing systems such as the Load Sharing Facility (LSF) and the Sun Grid Engine (SGE). |
| 3.5. | How do I configure the GenePattern server on a machine with multiple IP addresses? Can I keep the GenePattern URL from changing when the server hostname changes? |
| A. |
Choose one hostname for the GenePattern server; for example, http://servername.domainname.edu:8080/gp/. Edit the genepattern.properties file and set the following properties:
|
| 3.6. | How do I modify the GenePattern session timeout interval? |
| A. |
Session timeout is set in the Tomcat configuration file of the GenePattern server. To modify this setting for a local GenePattern server:
On the public GenePattern server, session timeout is set to four hours and cannot be modified by a user. |
| 3.7. | How do I configure GenePattern to work with a queuing system? |
| A. |
Queuing systems such as the Load Sharing Facility (LSF) and the Sun Grid Engine (SGE) allow computational resources to be used effectively. If you have such a queuing system installed at your site and you have installed a local GenePattern server, you can configure the GenePattern server to work with the queuing system. For instructions on how to do so, see Using a Queuing System in the GenePattern User Guide. |
| 3.8. | How do I modify how often result files are deleted from my GenePattern server? |
| A. |
From your GenePattern server, go to Administration>Server Settings and click File Purge in the menu at the left. From here you can specify when anaylsis result files are deleted from the server:
Click Save to save your changes. Click Restore to return to the values set at the installation. Note: This setting can only be modified on local GenePattern servers for which you have administrative rights. You cannot change this setting on the Public Server. |
| 3.9. | I'm getting 'error "connection refused"': what is the problem? |
| A. |
A refused connection is most likely due to a proxy issue. If you are behind a proxy or firewall, verify that you have correctly configured GenePattern and/or talked with your local SysAdmin allow GenePattern access to your machine. To configure a proxy connection in GenePattern please do the following:
2. Click Proxy to display the proxy settings. 3. On the Proxy Settings page, enter the hostname and port of your web proxy server. If you do not know them, contact your IT help desk to get the values. If you need to log into the proxy server, also enter your username and password (these will NOT be saved to a file and will need to be reentered following a server restart next time you want to connect). 4. Click Save to update the proxy settings. If this does not resolve the issue, please contact us at gp-help(at)broadinstitute.org. |
| 3.10. | Can I use a file path as input for a GenePattern module? |
| A. |
If you install your own GenePattern server, the default setting is not to allow input file paths. To change this, if you have administrator privileges on the server, add or edit the following in your genepattern.properties file: allow.input.file.paths=true Then restart your server. This will allow users to input an arbitrary network file path (such as file:///server/directory/file.gct) as the value for an input file parameter. When input file paths are allowed, you can use the server.browse.file.system.root property to set a root directory where the GenePattern server begins browsing for the specified network file path. Note: On the Broad public server, we prevent users from entering an input file path (file://urls) as an input file for a module in order to better secure the machine running the public server. |
| 3.11. | How can I work around a LaunchAnywhere error? |
| A. |
If you tried to install GenePattern on Ubuntu, you may have received an installation error: "An internal LaunchAnywhere application error has occurred and this application cannot proceed. (LAX)" with "java.lang.IllegalArgumentException: Malformed \uxxxx encoding." in the stack trace. LaunchAnywhere can interfere with the prompt string formatter PS1. In order to work around this problem, you need to use the following command: $ export PS1=">" This is not only important for installing GenePattern on Ubuntu, but also launching GenePatternServer. Use the command before the GenePatternServer startup command, like so: $ export PS1=">" |
| 3.12. | Why aren't my memory intensive modules working on my 64-bit Windows machine? |
| A. |
32-bit Windows machines only allow you to allocate up to 1.2GB of RAM to processes. 64-bit Windows will allow for more (depending on how much you have installed), but to run memory-intensive Java modules, you must install 64-bit Java and update your GenePattern server's maximum memory allocation (Refer to this information on increasing memory allocation). First, install 64-bit Java. Then, to configure GenePattern to use 64-bit Java:
|
| 3.13. | What does an "unknown program" error mean? |
| A. |
If you tried to launch a visualizer with Internet Explorer and received the following error: User Access Control, unknown program, file origins: downloaded from Internet. this is a Java error. You can manually install the latest version of Java from java.com (Windows) or from Apple (Macintosh), or you can try another browser, such as Google Chrome. |
| 3.14. | Why does the Specify File Path or URL option not work in Internet Explorer? |
| A. |
This is a known issue: when users click the Browse Server File System button, the Internet Explorer web browser window (instead of a pop-up window) becomes the file system browser. If you want to continue using Internet Explorer, you can copy and paste or manually enter the server file path rather than clicking the Browse Server File System button. We recommend using another browser for full functionality. |
Q. A. |
| 4.1. | Does GenePattern support cDNA and other 2-channel microarray data? |
| A. |
Yes. Most GenePattern analyses can run on 2-channel or ratio-based data as easily as on single channel or absolute value data. To run 2-channel data in GenePattern, do the following:
Your data is now in a GCT file that can be analyzed by most GenePattern modules. (If you want to use non-negative matrix factorization (NMF) and your data contains negative values, see the NMF note in the Modules & Pipelines section below.) Ratio values for cDNA data can be computed using a variety of methods. How the ratios are computed determines whether it is possible to create a class (CLS) file for the cDNA ratio data. For example:
If you cannot create a CLS file, you can analyze your data using modules that do not require class files (such as ConsensusClustering), but will not be able to use modules that require the CLS file (such as ComparativeMarkerSelection). |
| 4.2. | Where can I find information about file formats used by GenePattern? |
| A. |
Information on file formats supported by the modules currently in GenePattern is available in File Formats. |
| 4.3. | How can I convert between RES, GCT, and ODF formats? |
| A. |
Run your file through PreprocessDataset. Select the desired output format for your file. If you only want to convert the file type without filtering, select "no filter" as the choice for the "filter flag" parameter. |
| 4.4. | How do I convert a file to GenePattern format? |
| A. |
File Formats describes the file formats used in GenePattern and, where applicable, suggests methods for converting files to these formats. |
| 4.5. | How can I use CEL, MAGE-ML, and MAGE-TAB files in GenePattern? |
| A. |
The ExpressionFileCreator module converts a set of individual CEL files into an expression data set that is usable by GenePattern modules. The MAGEMLImportViewer module imports data in MAGE-ML format into GenePattern, and similarly, the MAGETABImportViewer module imports data in MAGE-TAB format into GenePattern. |
Q. A. |
| 5.1. | Can I keep my existing modules, pipelines, and suites when reinstalling or upgrading GenePattern to a new version? |
| A. |
Yes. Simply install the new version of GenePattern into the same directory as your previous version. See FAQ 2.2 for more information. |
| 5.2. | I have installed a module/pipeline/suite, but I do not see it. What's wrong? |
| A. |
This generally occurs for one of two reasons:
|
| 5.3. | My pipeline requires an input file, but displays a file-not-found error when I enter a file name. What's wrong? |
| A. |
Pipeline input files with spaces in their names may give file-not-found errors. If this happens, use DOS' "dir /x" command to get the 8.3 version of the directory and filename and use that instead of the long filename. If you are using a Unix-based platform, you may need to quote the filename parameters on the command line definition. |
| 5.4. | What are the pipelines whose names start with Lu.Getz.Miska.Nature.June.2005? |
| A. |
These pipelines are the actual analyses performed in the Nature paper, "MicroRNA Expression Profiles Classify Human Cancers" by Lu, Getz, Miska, et. al. Running these pipelines will exactly reproduce their results. You can also tweak the parameters used to see for yourself how their results change if they had performed any step in their analyses differently. Additional information about the paper is available at http://www.broadinstitute.org/cancer/pub/miGCM. |
| 5.5. | How can I run non-negative matrix factorization NMF on data that contains negative values, such as log-ratio or unthresholded Affymetrix data? |
| A. |
To run NMF on data that contains negative values, you must do the following (using the method of Kim, P. M. & Tidor, B. (2003) Genome Res. 13, 1706-1718):
To do this in MATLAB, you can execute the following:
|
| 5.6. | When I do a Hierarchical Clustering analysis, two files are produced, but the Hierarchical Cluster Viewer (JavaTreeView) looks like it needs three files. Do I need another one? |
| A. |
No, you can use the two files that are created and leave the remaining input box blank. HierarchicalClustering creates a cdt file and one or two additional files: an atr file if you clustered by samples (columns), a gtr file if you clustered by genes (rows), or both atr and gtr files if you clustered by both samples and genes (columns and rows). The JavaTreeView module accepts the two or three files created by HierarchicalClustering. |
| 5.7. | How can I export a Heat Map image with gene annotations? |
| A. |
The HeatMapViewer module currently does not include gene annotations with the saved image. Use the HeatMapImage module to include gene annotations. |
| 5.8. | Why do the scores from ComparativeMarkerSelection and ClassNeighbors differ? |
| A. |
When computing the t-test or signal to noise ratio, ClassNeighbors thresholds the standard deviation to ensure that it is at least twenty percent of the mean. Additionally, if the standard deviation is zero, ClassNeighbors sets it to 0.1. |
| 5.9. | I have used ComparativeMarkerSelection to construct gene lists representing different experimental conditions. Is there a GenePattern module that can determine if there are upstream non-coding motifs over represented in those gene lists? |
| A. |
Yes. You can use the GSEA module with the c3 (motif) gene sets. The GSEA module is documented on the Modules page. |
| 5.10. | How do I view the 3D visualization in the PCAViewer? |
| A. |
You must install Java 3D (https://java3d.dev.java.net/binary-builds.html). |
| 5.11. | How do I resolve GISTIC errors? |
| A. |
Most errors reported by users running the GISTIC module are caused by a mismatch between the segmentation and markers files. If an error occurs, verify that all markers indicated in the segmentation file appear in the markers file and only those markers indicated by the segmentation file appear in the markers file. The CBS and GLAD segmentation methods produce GISTIC-friendly marker positions. Partek's latest beta version also produces GISTIC-friendly marker positions. However, if you used an earlier version of the Partek algorithm to create the segmentation file, the algorithm did not report the exact physical position of the first and last markers of the segments. If you run GISTIC on a segmentation file generated using the earlier version of the algorithm, the physical positions of the marker file will not agree with the start or stop positions of the segmentation file. Note that Partek also uses the control probes in the generation of the CN/segmentation. |
| 5.12. | What does the GISTIC MATLAB error "Matrix dimensions must agree." mean? |
| A. |
??? Error using ==> plus If you are running GISTIC and get the error above in your stderr.txt file, you should verify that your segmentation file and markers file are exactly matched. Only the markers from the markers file should be indicated in the segmentation file and only those markers indicated by the segments should be in the markers file. IE seg file should be
5-6 and markers file should be
2 3 4 5 6 |
| 5.13. | How can I see the Color Scheme Legend in HeatMapViewer or HiearchicalClusteringViewer? |
| A. |
To see the Color Scheme Legend in either HeatMapViewer or HierarchicalClusteringViewer, select View>Color Scheme Legend. This legend also coordinates with HeatMapImage if you use the same parameters for HeatMapViewer and HeatMapImage. Note that the issue of legends for row-normalized HeatMaps will be addressed in upcoming releases of HeatMapViewer, and HierarchicalClusteringViewer. |
| 5.14. | ComparativeMarkerSelectionViewer is not launching, but HeatMapViewer is. What is wrong? |
| A. |
You may have a corrupted copy of the ComparativeMarkerSelectionViewer directory. To fix this, do the following:
2. Attempt to launch ComparativeMarkerSelectionViewer again 3. Look in the console for the directory where your viewer is being downloaded and executed. Look for a line like: downloading URL ftp://ftp.broadinstitute.org/pub/genepattern/datasets/protocols/all_aml_test.preprocessed.gct to directory /var/folders/+8/+8VwuO5ZH1S3hKW-uC7XEk0cTCY/-Tmp-/ComparativeMarkerSelectionViewer2436495742394955925.tmp as all_aml_test.preprocessed.gct 4. Remove the "to" directory. So, in this example we would delete /var/folders/+8/+8VwuO5ZH1S3hKW-uC7XEk0cTCY/-Tmp-/ComparativeMarkerSelectionViewer2436495742394955925.tmp 5. Try launching the viewer again. This should download a new copy of the executable and the viewer should display correctly. (Note that bash shells do not display "-" files and directories (like "-Tmp-") So, you may need to use a different shell to find the file path.) If this does not resolve the issue, please contact us at gp-help(at)broadinstitute.org. |
| 5.15. | I get "??? Attempted to access rl(:,2); index out of bounds because size(rl)=[0,1]" in my stderr file when running GISTIC, what does this mean? |
| A. |
If your run of GISTIC fails with the error below in the stderr.txt file, check your segmentation file format. Please see the sections on the segmentation file format in the GISTIC documentation for more details and examples. ??? Attempted to access rl(:,2); index out of bounds because size(rl)=[0,1]. If this does not resolve the issue, please contact us at gp-help(at)broadinstitute.org. |
| 5.16. | I get "??? Index exceeds matrix dimensions." in my stderr file when running GISTIC, what does this mean? |
| A. |
If your run of GISTIC fails with the error below in the stderr.txt file, check your markers file format. Please see the sections on the markers file format in the GISTIC documentation for more details and examples. ??? Index exceeds matrix dimensions. If this does not resolve the issue, please contact us at gp-help(at)broadinstitute.org. |
| 5.17. | Does GISTIC support SNP 6.0 data? |
| A. |
Yes, GISTIC supports the Affymetrix Human SNP 6.0 array. If you have further questions please contact us at gp-help(at)broadinstitute.org. |
| 5.18. | Why can't I run the GenePattern visualizers? |
| A. |
1. Do you have Java installed on your computer? If you're not sure, try Java Tester to find out. If not, you need to install the current version of Java, as the visualizers need it to run. 2. What version of Java do you have installed? If you're not sure how to find this on your computer, try Java Tester to find out. The visualizers require Java 1.5 or later. We have tested them with Java 1.5 and Java 1.6. If you have an earlier version, you will need to update it. (Note that there is a bug in the Java 1.6.0_15 build on Macintosh, so we recommend updating to _17 or later.) 3. Was there a pop-up window as you started the visualizer? The pop-up would have asked if you wanted to allow the applet to access your computer (on Macintosh) or if you wanted to run the application (Windows). Did you click Deny or Cancel? If so, restart your browser, relaunch the viewer, and when the pop-up appears, click Allow (Macintosh) or Run (Windows). 4. Are you running your own server behind a firewall? Make sure your proxy is configured correctly. 5. Set your Java Preferences to display the Java console.
6. Attempt to launch the visualizer again. 7. Look at the console output. If you see a line like this: |
| 5.19. | Why is my module taking so long to run? |
| A. |
Some computationally-intense modules can take a day or more to run. Some examples are FLAMEMetacluster, NMFConsensusClustering, GISTIC, and GLAD. In addition, server load can affect queuing times on the Broad public server, and this can affect the length of time a module can take to complete. If your job does not use a computationally-intense module or a large data set, and it takes longer than about 4 hours to complete, please contact us at gp-help(at)broadinstitute.org. |
| 5.20. | Why does Safari crash whenever I run a Java applet? |
| A. |
A recent update in Safari 5 (we observed the problem with 5.0.1 [5533.17.8]) has caused this problem for some users. The Java applets interfered with include the GenePattern visualizers and the GenePattern installer. The solution is to:
If you continue to experience problems, please contact us at gp-help(at)broadinstitute.org. |
| 5.21. | What does a missing value error for ComparativeMarkerSelection mean? |
| A. |
If you receive the following errors while performing an analysis with ComparativeMarkerSelection: Error in if (min(p) < 0 || max (p) > 1) \{: missing value where TRUE/FALSE needed Execution halted
or ERROR: The estimated pi0 <=0. Check that you have valid p-values or use another lambda method. then a gene in your data has insufficient variation in its expression values. Use the PreprocessDataset module with a filter that is more stringent than you have previously used on your data set before running ComparativeMarkerSelection. If you continue to experience problems, please contact us at gp-help(at)broadinstitute.org. |
| 5.22. | Why does ExpressionFileCreator fail? |
| A. |
If ExpressionFileCreator fails on your server, but works on the public server, you need to install version 8 or version 9 of ExpressionFileCreator. Only version 7 (and earlier versions) is available in the public repository (via Pipelines & Modules>Install from repository) because versions 8 and 9, which support the updated CEL file formats, require R 2.8, and GenePattern installs with R 2.5. To update to versions 8 or 9, use the following steps:
R2.8=// |
| 5.23. | What does "Could not obtain CDF environment" mean? |
| A. |
If ExpressionFileCreator gives you the following error when you try to convert Affymetrix CEL files to GCT format: Error in getCdfInfo(object) : Then the CDF for your array was not found in the Broad-hosted CDF library. You need to use a custom CDF to support the conversion. CDF files are available here. For instance, if you were analyzing the Mouse Gene 1.0 ST Array, you could type in that search term on the Affymetrix page. The result page opens, where you could find your CDF file under the Library Files section. Provide this CDF file as the input for the cdf file parameter in ExpressionFileCreator. |
| 5.24. | What does the GISTIC error, "Invalid file identifier" mean? |
| A. |
The usual cause of this error is spaces in any of the input file names. |
| 5.25. | Why does the "no such module" error occur for a module on the server? |
| A. |
If you run an imported pipeline on your own GenePattern server, and you get the error, "No such module [module name]", when you know you have that module on your server, then the pipeline requires a version of the module that is not on your server. If you return to the pipeline page and click Properties, you can view the modules that are required but not installed. If you install these module versions from the repository, the pipeline will run. |
| 5.26. | How can I properly view my GCT or RES file in IGV? |
| A. |
In order to properly view a GCT or RES file in IGV, some pre-processing is required.
The default display option for a GCT or RES file is the Heatmap. For the heatmap to make sense, the data must be row-centered, scaled and possibly have a threshold applied. Currently the workflow for this is as follows. 1. Run PreprocessDataset in GenePatternIf the data contains negative (non-log transformed) values, run it through PreprocessDataset. The default threshold there is 20. 2. Run data through igvtoolsCurrently igvtools is a stand-alone utility providing a set of tools for pre-processing data files. For the preprocessing of unscaled GCT and RES files an option called "formatExp" is provided. It takes a non-log expression file and performs the following steps. (Note that these are the steps used for our internal expression data prior to viewing in IGV.)
You can download the beta version of igvtools containing this option. After unzipping, igvtools can be used on the command line to transform the RES or GCT file as described above. Use following command line (if you are on a Windows platform, use "igvtools.bat" instead of "igvtools"): ./igvtools formatExp inputFile outputFile To run this on your preprocessed dataset, save the resulting .preprocessed file, and provide it as "inputFile" in the command line. 3. Run IGV with Scaled and Centered DataTake the output from igvtools and provide it as input to IGV. |
| 5.27. | Why is nothing happening when I try to upload my large file? |
| A. |
There are limitations on file upload size. Files uploaded via the Browse button on the module input page must be under 1.2 GB. To use larger files, there are a few options:
|
| 5.28. | What does "Error in subfiles: subscript out of bounds" mean? |
| A. |
This error can be produced if there are hidden files or directories in the ZIP archive. This usually occurs on a Mac when using the "Compress" option from the right-click pop-up menu. If this is the case, you may want to use the zip command from the terminal window to zip files instead. If you didn't Compress on a Mac, then you should check that there are no hidden files in the ZIP archive. |
| 5.29. | How do I pre-process my raw expression data so I can view it as a heat map in IGV? |
| A. |
Run your data file as the input file in IGVTools.formatexp, which scales and centers your data. Use the output file of IGVTools.formatexp as input for the IGV module. IGVTools.formatexp is currently in beta. If you would like to use this module, please contact gp-help(at)broadinstitute.org. |
| 5.30. | How can I pre-process my RNA-seq data for IGV? |
| A. |
The recommended format for RNA-seq data in IGV is the BAM file. If you run your SAM or BAM file as the input file for the SortSam module, you can sort and index it, and can convert a SAM file to BAM. In addition, the IGVTools.sort and IGVTools.index modules can sort and index a SAM file. These modules are currently in beta. If you would like to use them, please contact us at gp-help(at)broadinstitute.org. |
| 5.31. | Does GenePattern support SNP 6.0? |
| A. |
No. The GenePattern team is presently working to support SNP 6.0. |
| 5.32. | Why did the module I tried fail to run with my ZIP file as input? |
| A. |
If your ZIP file has a directory in it, GenePattern cannot resolve it. Unfortunately, if you generated your ZIP archive using the Finder on the Macintosh OS, the Mac builds a directory structure into your ZIP archive and GenePattern cannot resolve it. To zip on a Mac, use the zip command from a terminal window; for example, if you wanted to create a ZIP archive called "all_foo" that contains the files all_foo.cls and all_foo.gct, you could use the following command: zip all_foo all_foo.cls all_foo.gct Some other reasons that your ZIP file may fail include spaces in the names of the files or hidden files. If you cannot locate the issue with your ZIP file, please contact us at gp-help(at)broadinstitute.org. |
| 5.33. | Why did my GenePattern job fail? |
| A. |
The first place to look for the reason is the stderr.txt file, which should be available in the job summary or job status page. This file often contains plain text indicating what went wrong with a job, such as formatting or filtering errors. If you find that this file does not help you resolve the error, please contact us at gp-help(at)broadinstitute.org. |
| 5.34. | How can I use the RNA-seq modules available in GenePattern? |
| A. |
You can use the RNA-seq modules either on the Broad public server or by installing them on a GenePattern server installed on your machine or a network-accessible server. If You Choose to Run RNA-seq Modules on Your Own GenePattern ServerIf you have not installed GenePattern on your local machine, instructions for installing a local GenePattern server are provided on the Download GenePattern page. If you have already installed a GenePattern server, select Modules & Pipelines>Install from repository. The page will present all available modules. You only need to select the checkboxes for the modules you want and click Install Checked. Note: The main analysis RNA-seq modules (Bowtie, BWA, Cufflinks, TopHat, and Scripture) currently only run on Macintosh and Linux. If you do not have access to machines with these operating systems, you can use the modules on the Broad public server. The conversion/utility modules that are related to the RNA-seq modules are available for Macintosh, Linux, and Windows. You may find it helpful to enable your GenePattern server to accept file paths in order to handle large input files that are already present on the system where your local server is installed. To do this, edit genepattern.properties (located in the resources directory under your GenePattern server directory) and make allow.input.file.paths=true. This allows users to input a network file path (such as file:///server/directory/file.gct) as the value for an input file parameter. When this value is set to true, you can define a root directory where the GenePattern server begins browsing for network files by setting server.browse.file.system.root to the root directory you want to specify. Example: In genepattern.properties, setting server.browse.system.root=/Users/mydata/ngs will cause the browser window to open to /Users/mydata/ngs when a user chooses Specify File Path or URL. |
| 5.35. | Why is my GenePattern job stuck in the PENDING state? |
| A. |
There are a few reasons why this might occur. Jobs are often PENDING because GenePattern is a shared resource. When your job is in the PENDING state, it means that it is waiting in the queue behind other jobs for the GenePattern server to submit the job to the server farm. Jobs that use large files and access them via an external URL may hold up the line while those files are transferred to the GenePattern server, even keeping jobs that normally take a few seconds in PENDING. The job will run when the queue clears up. If this is a common issue on your GenePattern server, it is possible to configure it to help reduce the wait. If you want to reconfigure your GenePattern server in this way, please contact us at gp-help(at)broadinstitute.org. |
| 5.36. | Why do I receive an error when running my preprocessed GCT file and CLS file in ComparativeMarkerSelection? |
| A. |
If you tried to run your preprocessed GCT file and CLS file in ComparativeMarkerSelection, but it gives you the following error: An error occurred while reading the file ClassFile.cls.Make sure your CLS file is space delimited and not tab-delimited. This is the most common cause of this error. If this does not stop the error, please contact us at gp-help(at)broadinstitute.org. |
| 5.37. | Why did I get a warning stating that my index is older than my BAM file? |
| A. |
If you try to run an indexed BAM file through a module and receive a warning that your index file (BAI) is older than your BAM file, it means that the timestamps for these files are out of sync. If you receive this warning, you should index your BAM file by using the SortSam module. |
| 5.38. | How do I run several files through a set of modules in parallel? |
| A. |
You can do this by creating a pipeline for the jobs you want to run in parallel. Then you can submit your set of data files to the pipeline as batch job. This in-depth article discusses working with batch processing: Batch Processing in GenePattern 3.3.3 There are additional features that make it easier to work with large input files and to run batches of jobs in parallel:
|
| 5.39. | How do I zip my files for use in GenePattern? |
| A. |
On Windows, you need to select the files to be added to the ZIP archive (hold down the Control or Shift key while selecting to select a group). Then right-click on the group and select WinZip (or whichever zip application you have on your machine). Do not select a folder and zip it – that will create a directory inside the ZIP archive; if your ZIP archive has a directory in it, GenePattern cannot resolve it. On Macintosh, if you generate your ZIP archive using the Finder, Mac builds a directory structure into your ZIP archive and GenePattern cannot resolve it. To zip files on a Mac, use the zip command from a terminal window (launched from Applications/Utilities); for example, if you wanted to create a ZIP archive called "all_foo" that contains the files all_foo.cls and all_foo.gct, you could use the following command: zip all_foo all_foo.cls all_foo.gct If you follow these instructions and find that GenePattern does not accept your ZIP file, check for spaces in the names of the files or hidden files in the ZIP archive. If you cannot locate the issue with your ZIP file, please contact us. The GenePattern team plans to develop a ZIP module to help users with creating ZIP archives. |
| 5.40. | How can I easily run the same analysis on many different data files? |
| A. |
As of GenePattern 3.3.3, GenePattern supports batch jobs. To use this feature:
The module will be run once for each file selected. All the job results for the batch will be listed under a single batch ID. If you have difficulties with the batch upload function in GenePattern 3.3.3, please contact us at gp-help(at)broadinstitute.org. |
| 5.41. | Why can't I use a directory as input for all modules? |
| A. |
While as of GenePattern 3.3.3, GenePattern supports the use of directories as input for modules, not all modules support this function. A few quick ways to tell if a module does accept directories are:
|
| 5.42. | Does ExpressionFileCreator support Human Exon 1.0 ST arrays? |
| A. |
ExpressionFileCreator does not currently support Human Exon 1.0 ST arrays. The GenePattern module development team is working on a module for this. |
| 5.43. | How do I format my GenePattern output for submission to GEO? |
| A. |
There are currently no modules in GenePattern for submitting data to GEO. The NCBI has webtools for this purpose, such as GEOarchive. |
| 5.44. | How do I get a heat map with a high enough resolution for publication? |
| A. |
To generate a new heat map image at a resolution near 300 dpi, you can:
If you already have a heat map image that you cannot for some reason recreate that is at 72 dpi, you can use an image manipulation application that can scale images (like Adobe Photoshop or GIMP) to increase the resolution to 180 dpi. This will shrink the image by half, but 180 dpi is usually the minimum resolution necessary for print publication. |
| 5.45. | I am running a large number of RNA sequencing jobs, and I'd like to be able to look at the quality of the data. Is there a tool I could use for this? |
| A. |
Yes: the RNAseQC module in GenePattern calculates standard RNA-seq related metrics, including depth of coverage, ribosomal RNA contamination, continuity of coverage and GC bias. See the module documentation for the recommended data processing workflow for optimal use of this QC analysis. |
| 5.46. | Is there an easier way to create a CLS file than creating it by hand in a text editor? |
| A. |
Try the ClsFileCreator module in GenePattern. The ClsFileCreator is a wizard-based tool that can be used to create class label (CLS) files from array data in the GCT or RES file formats. |
Q. A. |
| 6.1. | How can I retrieve external database information from GenePattern? |
| A. |
The GenePattern server itself does not connect to any database, but modules can and have been written to connect to databases and retrieve data from them including caArray (caArrayImportViewer) and Gene Expression Omnibus (GEOImporter). To connect to any database of your choice, write a simple command-line program to connect to the database and retrieve data into a file format and install this program as a module into GenePattern (see Creating Modules). |
| 6.2. | My MATLAB figures are not appearing in the MATLAB visualizer I created. Why? |
| A. |
When creating a matlab visualizer using matlab 7.0 compiled m-code (any release before 7.4), any figures that you create in MATLAB must have the value visible set to on or they will not be drawn to the screen. |
| 6.3. | Can my module use a different version of R than GenePattern? |
| A. |
GenePattern modules can be written for any version of R. For details on how to specify which version to use, see Using Different Versions of R. |
Q. A. | |
| 6.4. | How can I share a GenePattern module? |
| A. |
You can submit your module to the GenePattern Archive (GParc). You will need to register to use all the features of GParc. Check the resources for module developers for best practices in testing and documentation before submitting your module.
For an introduction to GParc, see the Submitting your module to GParc video tutorial. |
Q. A. | |
| 6.5. | Why can't I specify a 64-bit platform for a module? |
| A. |
GenePattern does not have a valid CPU Type for 64-bit platforms. So if you try to specify a 64-bit CPU Type, the module will fail on 64-bit platforms, whether or not they are running compatibility mode. You will have to set the CPU Type to 'any' and add more information on the appropriate platforms in your documentation. If this does not stop the module from failing on appropriate platforms, contact us at gp-help(at)broadinstitute.org. |
Q. A. |
| 7.1. | Where can I find out more about how to launch GenePattern modules from other programming languages? |
| A. |
The reference guide for accessing GenePattern modules from Java, MATLAB, and R is the GenePattern Programmer's Guide. |
| 7.2. | Can I use the GenePattern APIs to create a web service that programmatically accesses the GenePattern server? |
| A. |
GenePattern is based on a web services API already so you may not need to create a new web service for this purpose. The WSDL for the GenePattern server is available at
http://your_server:your_port/gp/services
To get an easy start on creating a web services client to GenePattern:
You can then modify the pipeline code to do what your application needs. For more information about the programming libraries, see the GenePattern Programmer's Guide. |
| 7.3. | Why can't I call my pipeline/module from MATLAB? |
| A. |
A pipeline or module with a period in its name cannot be called from MATLAB. |
Q. A. |
If you haven't found what you are looking for, please send an email to gp-help(at)broadinstitute.org.

