-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Difference in the data between the CellProfiler data and the Efficientnet data #9
Comments
Yea this is confusing because those are two different stages of data. Level 5 above and level 3 below. See: Top: Bottom: |
@MattiasSehlstedt Hope that makes it clear. Also, if you use @ symbols, I will respond faster next time :) |
@michaelbornholdt So I guess that would mean that I would either have to modz your efficientnet data or work with https://github.com/broadinstitute/lincs-cell-painting/blob/master/profiles/2016_04_01_a549_48hr_batch1.dvc if I want a one-to-one row relation between some CellProfiler data and your Efficientnet data? |
Yes correct! Also. You should be using the Spherized CP data instead of the non spherized. |
What is the reason for there existing several replications across several different "Metadata_Plate" in the efficientnet data, while this doesn't exist within the CellProfiler data?
If one loads each of the two datasets, and runs the query
display(df[df.Metadata_broad_sample == 'BRD-K05804044-001-06-0']))
then the CellProfiler data will return 6 lines, where the difference between them is the dose concentration.
The Efficientnet data will return 5 lines, where the difference between the lines is their "Metadata_Plate" value and their "Metadata_Treatment_Replicate" value.
How come there seems to exist replicates within the Efficientnet data when the data is aggregated based on wells? And if the Efficientnet values are aggregations themselves, then how does these tie into the CellProfiler data and its lone row?
The text was updated successfully, but these errors were encountered: