Comparing the distribution of median scores between L1000 and Lincs Cell painting Consensus datasets #3

gwaybio · 2020-11-18T18:36:20Z

I am pasting @AdeboyeML's analysis performed in #2 (comment) so that we can continue a targeted discussion in a clean github issue:

@shntnu @gwaygenomics

213 MOAs (Mechanism of actions) present in both Cell painting and L1000 Level-5 data are compared based on the distribution of their median scores.
During alignment of MOAs in L1000 with the MOAS in Cell painting, I realized that MOAs found in the same broad sample in both L1000 & Cell painting data are partly named differently i.e. the naming of same MOAS in both are not consistent.

Median scores in cell painting data are more spread out and have more extreme median values than L1000 data.

gwaybio · 2020-11-18T18:58:54Z

@AdeniyiML - this is beautiful.

I see a couple things from this analysis:

A couple things to keep in mind:

we know that MOA annotations are wrong - thinking that they aren't will hurt us trying to interpret these results
It makes sense to me that morphology will group things more tightly than gene expression. There are many ways to alter a gene expression profile that can manifest into similar morphology.
I am most interested in following up on some groups of MOAs that are bad in one tech and good in the other (the off-diagonal)👇

AdeboyeML · 2020-11-19T16:36:12Z

@gwaygenomics

I am most interested in following up on some groups of MOAs that are bad in one tech and good in the other (the off-diagonal)

Provide feedback