Per-sample parallelism
Posted in Ask the GATK team | Last updated on


Comments (2)

Queue allows for "per-region" parallelism using scatter-gather. However, not all GATK tools support this (e.g. RealignerTargetCreator), and not all tools in a pipeline are GATK tools (e.g. BWA).

What I would like to do in the 1st phase of the best-practice pipeline is "per-sample" parallelism, that is, process each sample in parallel on a separate cluster node. Is there a recommended way to do this?


Return to top Comment on this article in the forum