Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MOA Multi-label Classification Predictions #8

Open
AdeboyeML opened this issue Feb 16, 2021 · 3 comments
Open

MOA Multi-label Classification Predictions #8

AdeboyeML opened this issue Feb 16, 2021 · 3 comments

Comments

@AdeboyeML
Copy link
Collaborator

AdeboyeML commented Feb 16, 2021

MOA Multi-label Classification Predictions

@gway @shntnu

  • A compound profile has multiple MOAs i.e. more than one target label (multi-label classification)
  • All MOAs found in just one compound are given the same MOA label called unknown moa
  • Train / Test split == ~78% / ~22%
  • 271 MOAs labels
  • Level 4 profiles

image

- Above figure interpretation

  • ~ 187 MOAs are found in 2 - 5 compounds
  • ~ 67 MOAs are found in 6 - 15 compounds
  • ~ 10 MOAs are found in 16 - 25 compounds
  • ~ 5 MOAs are found in 26 - 42 compounds
  • 1 MOA (unknown moa) is found in 248 compounds

Overall Model Prediction results

image

- Model predictions with respect to MOA distribution among the 1,398 distinct compounds

image

Top predicted MOAs based on Precision-Recall AUC score

- Cell painting

image

- L1000

image

- Cell painting & L1000

image

30 different MOAs with higher ROC-AUC score in one profiling assay than in another one

image

image

image

- 76 MOAs have higher ROC AUC score in Cell painting than in L1000 and Integrated Cell painting & L1000 level 4 data

- 49 MOAs have higher ROC AUC score in Integrated Cell painting & L1000 than in L1000 and Cell painting level 4 data

- 53 MOA have higher ROC-AUC score in L1000 than in Cell painting and Integrated Cell painting & L1000 level 4 data

@shntnu
Copy link
Collaborator

shntnu commented Feb 17, 2021

Thanks @AdeboyeML

  • Please clarify how you did the train test split – was it 80/20 per MOA? (which then works out to be ~78/22) What do you do with MOAs with few compounds? How do you handle compounds with multiple MOAs
  • Please plot a graph similar to the first one, reporting the "polypharmacology" i.e. number of MOAs per compound (X = number of MOAs of that a compound is annotated with; Y = number of compounds)
  • How do you assign a label? Presumably not a max because that would give you only one label.

We can discuss over a call if that's easier

@AdeboyeML
Copy link
Collaborator Author

@shntnu Yes we can clarify some of the above bullet points over a zoom call.

@AdeboyeML
Copy link
Collaborator Author

AdeboyeML commented Feb 17, 2021

@shntnu @gwaygenomics

- Polypharmacology

image

Number of MOAs Number of compounds
1 1176
2 157
3 39
4 14
5 4
7 3
6 3
10 1
11 1

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants