Description.

Author: Mathilde Koch, Faulon's group, INRA.

Description.

The aim of this folder is to regroup all necessary scripts for generating data and models presented in Large scale active-learning-guided exploration to maximize cell-free production by Olivier Borkowski*, Mathilde Koch*, Agnès Zettor, Amir Pandi, Angelo Cardoso Batista, Paul Soudier and Jean-Loup Faulon. Currently available at https://doi.org/10.1101/751669. *: authors contributed equally.

Active learning loop:

This folder contains data both for

generating the active learning loop (learn_and_suggest script, Figures 1c and 1d)
generating model statistics at each iteration (generate_model_statistics, Figure 1e)
genarting first plate for active learning (initial_plate_generation, Figure 1c)

Compound effect analysis:

Contains scripts to analyse the effect of compounds in different lysates with a linear regression or mutual information. (Figure 1g, Supplementary Figure 3)

Controls analysis plates:

This folder contains the data and scripts used to extract absolute yields (ie: compared to lysate of origin) for all other lysates. (Supplementary Figure 4)

ECHO handling scripts:

This folder contains scripts for handling the ECHO machine, from a file of concentrations to test to the instructions to the machine to data extraction and quality control. (No specific Figure)

Multiple extracts analysis:

This folder contains scripts to extract the most informative 20 points to predict 102 other points, as well as various analyses than were ran using those 20 points. (Figure 2 and Supplementary Figure 2e)

Predict for new lysate:

This folder contains scripts to predict yield on unseen lysates to optimise them. (No specific Figure, but same data as Figure 2)

Whole lysate most informative points:

The aim of this folder is to extract the most informative 102 (or 20) points that predict the full 1017 points dataset (Supplementary Figure 2). Functions are similar to the ones that do the same thing in multiple_extract analysis. The difference is the wrapping around the input concentrations to test, or data to compare to.

Requirements:

To run those functions, required packages are:

nb_conda_kernels (for running the jupyter notebook in the correct environment)
jupyter_contrib_nbextensions (tools for better Jupyter notebooks): conda install -c conda-forge jupyter_contrib_nbextensions
numpy (for array handling)
matplotlib (for visualisation)
scikit-learn (for machine learning). Version 0.19.1 has to be used.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
ECHO_handling_scripts		ECHO_handling_scripts
active_learning_loop		active_learning_loop
compounds_effect_analysis		compounds_effect_analysis
controls_analysis_plates		controls_analysis_plates
multiple_extracts_analysis		multiple_extracts_analysis
predict_for_new_lysate		predict_for_new_lysate
whole_lysate_most_informative_points		whole_lysate_most_informative_points
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Description.

Active learning loop:

Compound effect analysis:

Controls analysis plates:

ECHO handling scripts:

Multiple extracts analysis:

Predict for new lysate:

Whole lysate most informative points:

Requirements:

About

Releases

Packages

Languages

brsynth/active_learning_cell_free

Folders and files

Latest commit

History

Repository files navigation

Description.

Active learning loop:

Compound effect analysis:

Controls analysis plates:

ECHO handling scripts:

Multiple extracts analysis:

Predict for new lysate:

Whole lysate most informative points:

Requirements:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages