I have been trying to use ReadBackedPhasing, but its is producing an output with no phasing tags (HP tag). It works with a vcf file that is been called on a single bam file, but does not produce any output from vcf files generated from vcf-subset. In short, I would like to know what INFO/FORMAT fields it looks for while assigning HP tag.
Hello folks. I am using the last GATK version (3.1) and OsX Mavericks (with java 1.7).
When a use the ReadBackedPhasing routine, it is always reporting the same original unphased VCF. Curiously, ReadBackedPhasing is reporting that all sites were phased :
--- Phasing summary [minimal haplotype quality (PQ): 20.0, maxPhaseSites: 10, cacheWindow: 20000] --- Sample: sampleSM Sites tested: 40 Sites phased: 40 Phase-inconsistent sites: 0 [phased: 0, unphased:0]
But the output VCF is always unphased. Do you have any idea of what am I doing wrong? Thank you in advance.
Hi all, Just to give some context: I have filtered my trio data with some scripting to only heterozygous (hets) variants that may constitute compound hets (i.e., if phase could be accurately inferred). This is essentially phasing the child data by transmission - for all the het variants seen in the child I looked at the father and mother vcfs and filtered relevant sites as follows:
My question is: can I use this filtered child vcf as my input for ReadBackedPhasing? For each of my genes that feature in the child vcf after the above filtering, I want to determine whether the variants seen within the gene are in the same haplotype or not. I am just not sure if I can do the phasing at this stage - is this alright? If I had to do the phasing early on with the raw vcf, I am not sure how would I maintain the correct phasing information when applying this filtering downstream to the phased vcf (i.e., as the phasing of a het variant is relevant to the previous PASS-ing het variant in the vcf?).
Help would be appreciated! Thanks a lot, Eva