You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+88-4
Original file line number
Diff line number
Diff line change
@@ -12,12 +12,89 @@ In `/training/index` you can find all the index files for the different subsets
12
12
The `/training/runs` folder holds all profiles and training output from each experiment.
13
13
14
14
## Folder structure
15
+
```commandline
16
+
.
17
+
├── LINCS_example_data
18
+
│ ├── inputs
19
+
│ │ ├── config
20
+
│ │ ├── images
21
+
│ │ ├── locations
22
+
│ │ └── metadata
23
+
│ └── outputs
24
+
├── baseline
25
+
│ ├── 01_data
26
+
│ │ ├── level_3_data
27
+
│ │ └── level_5_data
28
+
│ ├── 02_analysis
29
+
│ └── thesis
30
+
├── chtc
31
+
│ ├── DP_0.3.0
32
+
│ │ ├── aggregate
33
+
│ │ ├── checking
34
+
│ │ ├── profile
35
+
│ │ ├── sampling
36
+
│ │ └── train
37
+
│ ├── helper_functions
38
+
│ └── old_DP
39
+
│ ├── aggregate
40
+
│ ├── checking
41
+
│ ├── exporting
42
+
│ ├── profile
43
+
│ └── train
44
+
├── docker
45
+
│ ├── 0.3.0
46
+
│ └── old_versions
47
+
├── hit_k
48
+
├── pre-trained
49
+
│ ├── ResNet50v2
50
+
│ │ ├── aggregated
51
+
│ │ └── post_processing
52
+
│ ├── data-prep
53
+
│ │ ├── 01_location_extraction
54
+
│ │ └── 02_index_preperation
55
+
│ ├── efficient_net
56
+
│ │ ├── aggregated
57
+
│ │ └── post_processing
58
+
│ └── thesis
59
+
├── training
60
+
│ ├── aggregation
61
+
│ ├── index
62
+
│ │ └── sc-metadata
63
+
│ ├── prediction_analysis
64
+
│ │ └── 819
65
+
│ ├── results
66
+
│ │ └── accuracy
67
+
│ └── runs
68
+
│ ├── 1003
69
+
...
70
+
│ └── 931
71
+
└── utils
72
+
```
15
73
74
+
## Description of the repository content
75
+
### `basline/`
76
+
The first part of the project gathers CellProfiler profiles from the LINCS repository and compares them.
77
+
A general overview of the data and of the subselection is found here!
78
+
If you want to compare metrics with my data, you need to follow the steps in `baseline/02_analysis/02_clean_data.ipynb`.
16
79
17
-
## Important random things
18
-
- Some information in this repository may be old since the DeepProfiler versions changed midway through the project
19
-
- The single cell crops of all 18 million cells within the LINCS subsection can be found on S3: `s3://jump-cellpainting/projects/2015_10_05_DrugRepurposing_AravindSubramanian_GolubLab_Broad/workspace/deep_learning/outputs/1017_sc/`
20
-
- If you can't reach Michael Bornholdt, try to reach Shantanu Singh.
80
+
### `pre-trained/`
81
+
The two pre-trained nets are compared here and create the baseline for the trained neural networks.
82
+
The best pipeline for deep learning features is determined.
83
+
84
+
### `training/`
85
+
All experiments live here.
86
+
The experiments are different models trained with different hyperparameters and data.
87
+
A full analysis of the resulting profiles can be found in the `training/results/` folder.
88
+
89
+
### `chtc/` and `docker/`
90
+
These folders hold important scripts for setting up and running DeepProfiler on a server.
91
+
92
+
### `hit_k`
93
+
This folder contains the development code of the hit@k metric. Now on Cyto-eval/
94
+
95
+
### `LINCS_example_data/`
96
+
A small subsection of the LINCS data allows to test and learn DP.
97
+
Alternatively used example data from DP Github.
21
98
22
99
23
100
## Experimental data on S3
@@ -32,3 +109,10 @@ The `/training/runs` folder holds all profiles and training output from each exp
- Some information in this repository may be old since the DeepProfiler versions changed midway through the project
116
+
- The single cell crops of all 18 million cells within the LINCS subsection can be found on S3: `s3://jump-cellpainting/projects/2015_10_05_DrugRepurposing_AravindSubramanian_GolubLab_Broad/workspace/deep_learning/outputs/1017_sc/`
117
+
- If you can't reach Michael Bornholdt, try to reach Shantanu Singh.
0 commit comments