Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

bcftools annotate labels transferred FILTER column as single value "Number=1" in header #2335

Closed
toddajohnson opened this issue Dec 13, 2024 · 1 comment

Comments

@toddajohnson
Copy link

I annotated a VCF with another VCF of TOMMO allele frequency data, including updating the FILTER field and several INFO TAGs.
VCF sites have FILTER as ".", "PASS", or one or more semicolon delimited FILTER flags.
I then ran "bcftools annotate -c INFO/TOMMO_FILTER:=FILTER -Ov -o test_anno.vcf test.vcf" to add an INFO field for the FILTER field. Semicolons in the FILTER field were correctly replaced with commas in instantiating the TOMMO_FILTER field, but the header line was:
##INFO=<ID=TOMMO_FILTER,Number=1,Type=String,Description="Transferred FILTER column">

So, instead of "Number=." to record that the tag can have any number of values, it contained "Number=1". I don't know if that impacts bcftools parsing of that field in dowstream analyses, but in using R VariantAnnotation package, it recognizes that TOMMO_FILTER has only a single value, and thus only parses the first value in the TOMMO_FILTER.

Is the assignment of "Number=1" expected behavior?

@pd3 pd3 closed this as completed in c58d522 Jan 1, 2025
@pd3
Copy link
Member

pd3 commented Jan 1, 2025

It should be Number=. as there can be multiple FILTER values. This is now fixed, thank you.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants