Ingesting the same sample in two different projects #4893
-
Hello, hope you're doing well! I'm wondering whether seqr (with the Hail backend) supports ingesting the same sample into multiple projects? For instance, can I upload a VCF to Project A today, and then upload the same VCF to Project B a month later? Would that also lead to duplication of allele counts in seqr for the samples present in both projects A and B? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 5 replies
-
Hi Anajli, There is nothing that will prevent you from ingesting the same sample into multiple projects. However, this would lead to duplication of allele counts and thus we strongly discourage doing this, as it adversely affects your searches and the ability to assess variants. Can you explain why this is something you would be considering in the first place? The primary reason for splitting data into different projects is usually access control, but if it is all the same samples then it would seem more logical to keep one project and grant everyone who needs access to it. Understanding the use case may help me propose better alternatives Best |
Beta Was this translation helpful? Give feedback.
Linking data from one project to another is not possible in seqr. It seems like your options are to either make a project just for that single family, or you can duplicate the data. There is nothing that will detect or prevent data duplication, its just something you will need to keep in mind going forward as you do your analysis. While it is not recommended, a single family duplicated should not be too terrible for your allele counts. That said, I do think having a project with a single family is also a valid solution, but the researchers will be able to see each others tags and notes.