-
Notifications
You must be signed in to change notification settings - Fork 74
Remove R script/notebook package installs from remaining modules (Part 4 of 4) #704
Remove R script/notebook package installs from remaining modules (Part 4 of 4) #704
Conversation
PCA numbers also look slightly different. I am rerunning:
In the Docker container on the branch for #675 right now and I've gotten through some of the steps that are showing changes here and nothing is showing up for me as altered in the scores TSV files. Hardcoded seed in shell script OpenPBTA-analysis/analyses/transcriptomic-dimension-reduction/01-dimension-reduction.sh Line 14 in 608f7c5
Here's that seed getting passed to OpenPBTA-analysis/analyses/transcriptomic-dimension-reduction/01-dimension-reduction.sh Line 28 in 608f7c5
Here's were we run t-SNE via a custom function in OpenPBTA-analysis/analyses/transcriptomic-dimension-reduction/scripts/run-dimension-reduction.R Line 221 in 608f7c5
Here's that seed getting set in the custom function: OpenPBTA-analysis/analyses/transcriptomic-dimension-reduction/util/dimension-reduction-functions.R Line 32 in 608f7c5
Wondering if something else is going on. |
Ran the same thing on my |
@jaclyn-taroni I've checked out AlexsLemonade master and reran on the docker image and am still getting changed data. So I think we've narrowed down that it is due to changes in the docker image/container. Can you tell me more about what docker container/image you used? Specifically what version of Rtsne is it using? I'm gonna double check that I'm using the most updated Docker image as well. |
The versions for Rtsne are unchanged as far as I know (the last release was end of 2018, I believe). I have a version of the Docker image that is post-refactor/change of the Bioconductor install process. |
Upon digging into this PR, I was unable to pinpoint a specific issue and/or solution but I did determine the following:
In other words, the mystery, unfortunately, still continues... |
Can someone file an issue to track the t-SNE results not being reproducible (including how we might test if it's the underlying |
Filed issue #716 to track this. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@cansavvy, everything in this PR looks LGTM (once merge conflicts are fixed with what is being tracked in issue #716 in mind).
I did notice that the analyses/survival-analysis/results/cox_regression_tmb.tsv
output file also changed in this PR, with the conf.high variable within containing a change of 1.026 -> 7.275 (the most significant change I noticed).
It appears that the input file pbta-snv-consensus-mutation-tmb-coding.tsv
was last updated in the v14 data release, while what's in master was last ran 6 months ago so this makes sense.
That being said, I am approving these changes (once the results also make sense to you, someone who would be more familiar with the survival-analysis
module than I am) 👍 .
Yes! Because of changes to tmb calculations (#564) I would expect the regression to have different results. I'll get those conflicts resolved. |
Purpose/implementation Section
What was your approach?
I'm removing R package installations in the scripts in accordance with #690
Here I've addressed what should be the last of the changes to wrap up and closes #690.
01-plot-oncoprint.R
and reranrun-oncoprint.sh
run-dimension-reducion.R
and reran./dimension-reduction-plots.sh
util/survival_models.R
and reransurvival-analysis_template.Rmd
Directions for reviewers. Tell potential reviewers what kind of feedback you are soliciting.
t-SNE numbers look different. Are we able to set the seed for that?
Everything else look okay? Results that should be there are there?
Is the analysis in a mature enough form that the resulting figure(s) and/or table(s) are ready for review?
Yes, but the results shouldn't really be different.
Reproducibility Checklist
No changes are needed here.