-
Notifications
You must be signed in to change notification settings - Fork 335
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Not reproducible #562
Comments
And even worse, if I change the number of threads to 1 I get very different results:
|
Examining thread dependency a bit more:
|
If I revert to version 0.20.1 then things are repreoducible (and give me a different result than any of those above: 42341 reads are removed for being too short). That is on par with what trimmomatic returns and is probably the correct result. |
I stumbled upon the same problem with version 0.23.4, but found that the results are reproducible when I run using 1 thread. It's interesting that version 0.23.0 claims to have fixed the reproducibility problem... |
could you please give me a piece of sample data, along with the command ? |
My test dataset is too large to share, but here is the exact fastp call I used I'm running in a modified version of this docker container which is based on ubuntu "mambaorg/micromamba:1.5.8-jammy" fastp installed with micromamba I used Honestly, fastp is so fast that 1 thread is still usable. Love the program and thanks for following up! EDIT: |
You can download my sample dataset here: https://bis180ldata.s3.amazonaws.com/downloads/Illumina_Assignment/GH.lane67.fastq.gz My commands are in my earlier posts in this thread. |
Version 0.23.4
I get a different number of reads and bases after filtering every time I run fastp
The text was updated successfully, but these errors were encountered: