-
Notifications
You must be signed in to change notification settings - Fork 72
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix IntegrateGQ.sh errors due to presence of variants of just one type #760
base: main
Are you sure you want to change the base?
Conversation
@@ -30,41 +30,57 @@ zcat $RD_melted_genotypes \ | |||
|gzip \ | |||
>rd_indiv_geno.txt.gz | |||
|
|||
##Deletions, need to PE-SR genotypes to match RD format (2==ref)## |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think you should keep this comment in and the one about duplications below
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Updated accordingly.
@mwalker174 Going to mark this as a draft PR until I've got more clarity into why some of the genotypes are being changed. I did take a quick look already, and it seems like the specific outputs from |
This PR addresses Issue #759.
Description
Addresses an issue in the
IntegrateGQ.sh
script where the combination ofset -o pipefai
l and{ fgrep ... || true; }
patterns could cause pipelines to fail prematurely if no matches were found. By leveragingawk
withARGIND
, the script no longer fails silently and continues processing regardless of theSVTYPE
makeup of the shard.Testing
Test 1: Changes circumvent WDL failure.
IntegrateGQ.sh
, wherein each variant in its depth & PESR VCFs are of type- note that the run fails.IntegrateGQ.sh
- note that the run completes successfully.Test 2: Changes do not change the existing logic.
IntegrateGQ.sh
, including all types of variants.IntegrateGQ.sh
- note that the output depth & PESR VCFs are identical.Test 3: Changes produce expected results when using DEL-only inputs.
Conducted local tests where I used the
IntegrateGQ.sh
inputs for a run that contained all variants, then removed all non-DEL records from this. After runningIntegrateGQ.sh
with these DEL-only inputs, the outputs were identical for the matched counterparts in the case where the inputs contained all variants.Note: If it's preferred to reflect this in Terra, let me know - I did it locally for the time being as there is no dedicated WDL file to test
IntegrateGQ
in isolation, though I could make one.Test 4: Changes produce expected results when using only non-DEL inputs.
Same as Test 3, but this time removed all DEL records from the inputs. The outputs were identical for the matched counterparts in the case where the inputs contained all variants.