-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Report of MOAs sizes/values per dose analysis (ranking) on consensus datasets #2
Comments
@AdeboyeML and I inspected one of the cases and found that the dose remapping can be flawed. E.g. |
@AdeboyeML uses Also consider computing the correlation matriix of the subsetted dataframe corresponding to the replicates you care about and then take the median of the lower (or upper triangular matrix). This is an implementation detail, but the logic is otherwise correct. |
just an update to #2 (comment)
- MOAS that do not have the same number of compounds in all Doses |
@gwaygenomics @shntnu - Results from Null DistributionMajor points:
- Visualization: non-parametric p-value vs median pairwise correlation score (for each MOA) per dose- Median Consensus- Median DMSO Consensus- MODZ Consensus- MODZ DMSO Consensus
- Median Consensus -MOAs with p-values <0.05 in all doses- Median DMSO Consensus -MOAs with p-values <0.05 in all doses- MODZ Consensus -MOAs with p-values <0.05 in all doses- MODZ DMSO Consensus -MOAs with p-values <0.05 in all doses |
This comment has been minimized.
This comment has been minimized.
- In addition to the figures shown in #2 (comment), I included the density distribution of the p-values vs median scores in the same plots:- - Median Consensus- - Median DMSO Consensus- MODZ Consensus- MODZ DMSO Consensus |
@AdeboyeML quick question: In this figure (where each data points is an MOA, I assume), if two MOA's have the same number of compounds, and the same x-axis value (median pairwise correlation between compounds), do they also have the same y-axis value? Or phrased more simply, are you computing the null distribution once for each MOA size (by size I mean number of compounds in the MOA classs)? Or are you doing it once per MOA? Both are fine, but the former is preferred to remove y-axis variance that's not informative. |
@AdeboyeML kudos for making the data so easy to peek into. I was curious to see if one could see a dose-response in some of the MOAs. Looks like we do in a few. More during profiling check-in! Codemoa_consistency <-
read_csv(
"https://raw.githubusercontent.com/broadinstitute/lincs-profiling-comparison/9bc5db8167674e2c8bec5cee3fcc043117acfbf6/1.Data-exploration/moa_sizes_consensus_datasets/median_dmso_moa_median_scores.csv"
)
moa_consistency %<>% rename(moa = X1)
moa_consistency %<>% pivot_longer(-moa, names_to = "dose", values_to = "score")
moa_consistency %<>% mutate(dose = as.integer(str_remove(dose, "dose_")))
moa_consistency %<>%
inner_join(
moa_consistency %>%
group_by(moa) %>%
summarize(score_median = median(score)) %>%
filter(score_median > 0.30)
)
p <-
ggplot(moa_consistency, aes(dose, score)) +
geom_line() +
facet_wrap( ~ round(score_median, 2) ~ moa,
ncol = 5,
scales = "free_y")
ggsave("~/Desktop/moa_score_dose_response.png",
width = 10,
height = 10) |
@shntnu In regards to question in #2 (comment)
|
@AdeboyeML thanks for clarifying. Everything looks good, but the one change I recommend is to use the same null distribution for all MOAs of the same size. There is no upside to having different null distributions for each unique MOA (of the same size), while it has the downside of adding uninformative variance to the p-value estimates. cc @gwaygenomics |
@shntnu @gwaygenomics Results from the Null distribution, based on using the same null distribution for all MOAs of the same size.
Median ConsensusDistribution of the median pairwise correlation scoresP-values distribution across dosesIncrease in MOAs with values below the significant level (0.05) as dose increasesMOAs with p-values <0.05 in all dosesThese MOAs dose responses:The above results and distributions are similar for the Modz Consensus datasets |
That's really strange – sounds like a bug to me |
This comment has been minimized.
This comment has been minimized.
@shntnu @gwaygenomics Results from the Null distribution, based on using the same null distribution for all MOAs of the same size. (L1000 & Cell Painting)
Cell paintingMODZ ConsensusL 1000MODZ Level-5 data- The above results and distributions are similar for both median and rank level-5 (Consensus data) in cell painting and L1000. |
@gwaygenomics @shntnu
MOAs with only one compound were excluded. Out of 601 MOAs, 369 were excluded.
Results: I will be showing only the first 10 MOAs values per dose for all the consensus datasets and the heatmap plots of the 232 MOAs for each of the consensus datasets. Doses 0 (has only dmso) and 7 (has only two MOAs) were excluded from the analysis.
1a. Median Aggregation Consensus dataset - consensus_median (whole plate normalization).
Heatmap plots -- I split the dataframe into 3 parts for easier visualization:
1b. Median Aggregation Consensus dataset - consensus_median_dmso (dmso normalization).
heatmap plots
2a. Modified Z Score Aggregation (MODZ) dataset - consensus_modz (whole plate normalization).
heatmap plots
2b. Modified Z Score Aggregation (MODZ) dataset - consensus_modz_dmso (dmso normalization).
heatmap plots
The text was updated successfully, but these errors were encountered: