Skip to content

Question #3

@awata-hiroko

Description

@awata-hiroko

Hi,

I tried to estimate PCR duplicates ratio of my samples using it. I did "extract_duplicates" command using vcf file made by "samtools mpileup" using my samples.
The output files from "exact_duplicates" command are strange for me because the contents from multiple samples are almost same and "PCR duplicates marked 0 total-reads 0 frac -nan discarded 49244807" and "#clusters
total reads (PE=1) 0 unique-reads 0 duplicates:0, duplication rate -nan" were written.
My samples seems to contain many PCR duplicates (aprrox. 90% of the total) by judge of "samtools rmdup". So I don't believe the output file from "exact duplicates" (it said no PCR duplicate in my samples, right?)

So, what should I do to resolve this issue? Should I use vcf files from public databse like ENSEMBL?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions