Skip to content
Draft
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
143 commits
Select commit Hold shift + click to select a range
943e48c
BYOModel Refactor: Proposal for dataset and task changes (#269)
mlgill Jun 20, 2025
e2804c4
feat: Remove Models and Docker code
saugupta1 Jul 3, 2025
5a69cc8
removed cli and runner
saugupta1 Jul 7, 2025
f34ff6f
removed cli test cases
saugupta1 Jul 7, 2025
4b40c90
remove build and docs for model
saugupta1 Jul 8, 2025
92d3626
removed scripts
saugupta1 Jul 8, 2025
0acfbcc
Merge branch 'main' into v1.0-byomodel
atolopko-czi Jul 9, 2025
f2b3c22
more removal of model-related docs (#304)
atolopko-czi Jul 9, 2025
115d577
Addressed review comments and code formatting
saugupta1 Jul 9, 2025
0c5a3d6
restore CLI as is - not working
saugupta1 Jul 9, 2025
7685352
CLI list working. Model execution commented
saugupta1 Jul 9, 2025
29a1c9c
cli - removed model execution related commented code
saugupta1 Jul 10, 2025
b8d169c
Merged v1.0-byomodel-remove-models into v1.0-byomodel, resolved confl…
saugupta1 Jul 10, 2025
75f66f9
minor updates in index.rst
saugupta1 Jul 10, 2025
425fd2d
feat: 308 improve usability of datasets (#311)
saugupta1 Jul 11, 2025
ff48346
feature: datasets write task inputs to files (#299)
atolopko-czi Jul 12, 2025
3e48818
feat: Remove Base prefixes from classes and rename files (#318)
atolopko-czi Jul 15, 2025
97b2c4f
task: v1.0 byomodel dataset and task improvements (#313)
mlgill Jul 15, 2025
b3c5f76
Align model inputs and outputs (#327)
saugupta1 Jul 16, 2025
68ef972
chore: reverted #307 (#336)
saugupta1 Jul 21, 2025
c662dc0
docs: examples updated, added model fine tuning (#330)
saugupta1 Jul 21, 2025
ae22852
docs: verified and updated docstrings for accuracy and better API doc…
saugupta1 Jul 21, 2025
f2ef311
tests: e2e task regression tests (#331)
atolopko-czi Jul 28, 2025
428b55b
feat: Added few test cases. Validated existing test cases for new cod…
saugupta1 Jul 28, 2025
84efc5a
feat: updated cli (#328)
saugupta1 Jul 31, 2025
6dfa716
docs: docs updated for byomodel release (#337)
saugupta1 Jul 31, 2025
5b34f8f
docs: 340 add update notebook examples (#341)
saugupta1 Jul 31, 2025
43e42df
feat: Updates to single cell perturbation datasets and related classe…
mlgill Aug 19, 2025
94be5f4
Remove other perturb datasets and split key
mlgill Aug 4, 2025
5f3078a
Temporarily deprecate older perturbseq datasets
mlgill Aug 6, 2025
c82caf3
Ready to start testing
mlgill Aug 7, 2025
2444be3
Dataset directory update
mlgill Aug 7, 2025
6c6be46
Debugging
mlgill Aug 7, 2025
a3f220f
Dataset load_data runs to completion
mlgill Aug 7, 2025
d702ca0
nit
mlgill Aug 7, 2025
3e45fb0
Formatting and improving pandas efficiency
mlgill Aug 7, 2025
ebe13b7
Update
mlgill Aug 8, 2025
4e28b5a
Fix utils tests
mlgill Aug 8, 2025
40a7e82
No more split columns
mlgill Aug 8, 2025
4e9c89a
store_task_inputs works
mlgill Aug 8, 2025
49ffef9
10 genes
mlgill Aug 8, 2025
fd9e630
Fixing test
mlgill Aug 8, 2025
0997cad
Joblib runs
mlgill Aug 8, 2025
d998fc1
Remove dual condition, validation works
mlgill Aug 11, 2025
525d136
Tests pass
mlgill Aug 11, 2025
c8fafe1
Linting
mlgill Aug 11, 2025
26023c3
Add example script
mlgill Aug 11, 2025
88ccd7f
update
mlgill Aug 11, 2025
b75b414
nit
mlgill Aug 11, 2025
df770a2
Docstring updates for errors
mlgill Aug 11, 2025
8c5630c
Fix dimension
mlgill Aug 11, 2025
022020c
Fixed docstring formatting in API
mlgill Aug 12, 2025
32c82f1
Remove joblib since it doesn't accelerate results
mlgill Aug 13, 2025
5d5716f
nit
mlgill Aug 13, 2025
2b6ce1f
Add gene masking as an input param
mlgill Aug 13, 2025
a71f3c2
Cleanup joblib imports
mlgill Aug 13, 2025
46a9bdf
Update filters
mlgill Aug 13, 2025
4e2462c
WIP updates to stored inputs
mlgill Aug 13, 2025
3431015
Fixed tests
mlgill Aug 13, 2025
1a4b36b
Clean up FIXMEs
mlgill Aug 14, 2025
04c77fc
Bug fix and expose min_de_genes
mlgill Aug 14, 2025
2dc57bc
Fix
mlgill Aug 14, 2025
be2a6ef
Expose them all
mlgill Aug 14, 2025
095be3a
FIXME
mlgill Aug 14, 2025
a62c76d
Fix path
mlgill Aug 14, 2025
c381111
adding a simple perturbation task
polinabinder1 Aug 4, 2025
714ad1f
adding metric + code cleanup
polinabinder1 Aug 4, 2025
47ae03b
files that were mistakenly added
polinabinder1 Aug 4, 2025
a2ab68b
adding new metrics
polinabinder1 Aug 4, 2025
c18655d
fixing variable naming
polinabinder1 Aug 4, 2025
2522c7c
adding a test case
polinabinder1 Aug 4, 2025
b1069e9
addressing some of the PR comments
polinabinder1 Aug 5, 2025
92e3d8c
accepting any name
polinabinder1 Aug 5, 2025
6d27580
redoing the task
polinabinder1 Aug 7, 2025
05c6ccb
adding a test to compare output to perturbation_expression_prediction
polinabinder1 Aug 7, 2025
e5f5083
adding a test to compare output to perturbation_expression_prediction
polinabinder1 Aug 7, 2025
8357731
completing test case for comparing to Maria's notebook
polinabinder1 Aug 8, 2025
f0bfc22
renaming print to logging
polinabinder1 Aug 11, 2025
57dc305
making sure tests are properly merged
polinabinder1 Aug 11, 2025
e900c2b
Reset file to match michelle/v1.0-byomodel-replogle_dataset
polinabinder1 Aug 11, 2025
e98b0a5
removing some file changes
polinabinder1 Aug 11, 2025
9ca8784
works with new peturbseq
polinabinder1 Aug 13, 2025
c4c3bb1
merge and some code clean up
polinabinder1 Aug 13, 2025
00bf14b
adding baseline and tests for perturbation prediction
polinabinder1 Aug 13, 2025
9d2f7c5
adding loading from dataset
polinabinder1 Aug 13, 2025
e2b37ad
Update metric type
mlgill Aug 13, 2025
f1780b1
Clean up path
mlgill Aug 13, 2025
e0426e3
adding test of dataset equivelancy
polinabinder1 Aug 14, 2025
478fef1
slightly changing naming + fix to test
polinabinder1 Aug 14, 2025
736c1f2
adding t-test
polinabinder1 Aug 14, 2025
bb031e0
t-test works for dataset, small fix to perturbation dataset
polinabinder1 Aug 14, 2025
c2857a7
changes to perturbation equivelancy
polinabinder1 Aug 14, 2025
687ca89
Fix column name
mlgill Aug 15, 2025
b1733c8
making sure tests are properly merged
polinabinder1 Aug 11, 2025
d585819
Reset file to match michelle/v1.0-byomodel-replogle_dataset
polinabinder1 Aug 11, 2025
9c73881
t-test works for dataset, small fix to perturbation dataset
polinabinder1 Aug 14, 2025
c5110c0
chore: CCIE-4332 use GH_ACTIONS_HELPER_* (#310)
wontonst Jul 14, 2025
bf2f3e1
chore(main): release 0.10.1 (#291)
czi-github-helper[bot] Jul 18, 2025
67e9cfc
fix: fix UCE docker image build by switching to miniforge due to cond…
steveherrin Jul 18, 2025
f6a2b6e
chore(main): release 0.10.2 (#334)
czi-github-helper[bot] Jul 18, 2025
d97cdcc
Duplicated lines
mlgill Aug 15, 2025
b1a731f
fixing a small bug
polinabinder1 Aug 15, 2025
27b3634
fixes
polinabinder1 Aug 15, 2025
3913827
addressing PR + tests
polinabinder1 Aug 19, 2025
91d3be1
update latest equivalency dataset script
mlgill Aug 19, 2025
74173f1
changes to cell perturbation task
polinabinder1 Aug 19, 2025
ae43040
test perturb task
polinabinder1 Aug 19, 2025
d08b591
Two paths we don't need to fix. :)
mlgill Aug 19, 2025
3eb44c0
Fix rebase issue
mlgill Aug 19, 2025
e413b8b
removing dataset + print metrics
polinabinder1 Aug 19, 2025
bdf18bc
undoing rebase silliness
polinabinder1 Aug 19, 2025
bbb7763
tests pass
polinabinder1 Aug 19, 2025
5a618b1
adding save/load functionality
polinabinder1 Aug 19, 2025
9766c19
Fix scipy version undone during merge issues
mlgill Aug 19, 2025
5f8195c
WIP
mlgill Aug 19, 2025
3557998
array fix
mlgill Aug 19, 2025
83a7c8d
fixes
polinabinder1 Aug 20, 2025
0a9d0b3
deleting file
polinabinder1 Aug 20, 2025
59759d6
Merge branch 'feat/perturbation_task' into feat/perturbation_task_val…
polinabinder1 Aug 20, 2025
1f11a16
lint
polinabinder1 Aug 20, 2025
01efdcf
lint fix2
polinabinder1 Aug 20, 2025
beac74c
remove extra tests
polinabinder1 Aug 20, 2025
0e47ed8
adding text to example
polinabinder1 Aug 20, 2025
dac36b9
small fix
polinabinder1 Aug 20, 2025
ce0da99
dataset comparisons
polinabinder1 Aug 20, 2025
3e56862
small test fix
polinabinder1 Aug 20, 2025
31467eb
Merge branch 'feat/perturbation_task' into feat/perturbation_task_val…
polinabinder1 Aug 20, 2025
e6ffe0e
lint
polinabinder1 Aug 20, 2025
7cbe8de
Fix uv.lock
mlgill Aug 20, 2025
1cc7f4d
fixes to test file names
polinabinder1 Aug 20, 2025
311c0e4
some fixes
polinabinder1 Aug 20, 2025
299b817
remove uv.lock
polinabinder1 Aug 20, 2025
19c6c7c
Merge branch 'feat/perturbation_task' into feat/perturbation_task_val…
polinabinder1 Aug 20, 2025
d8fafa5
metric changes
polinabinder1 Aug 20, 2025
8a243cb
Merge branch 'feat/perturbation_task' into feat/perturbation_task_val…
polinabinder1 Aug 20, 2025
4363e1f
tests run
polinabinder1 Aug 20, 2025
ae45525
tests run
polinabinder1 Aug 20, 2025
b8a36c3
Integration test for `SingleCellPerturbationDataset` and `Perturbatio…
mlgill Aug 22, 2025
3cb9ccc
Update examples/example_perturbation_expression_prediction.py
polinabinder1 Aug 22, 2025
72e8a2a
Update src/czbenchmarks/tasks/single_cell/perturbation_expression_pre…
polinabinder1 Aug 22, 2025
de6961a
addressing PR
polinabinder1 Aug 22, 2025
d4a1349
Merge branch 'feat/perturbation_task' into feat/perturbation_task_val…
polinabinder1 Aug 23, 2025
c833140
fixing task
polinabinder1 Aug 23, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
91 changes: 0 additions & 91 deletions .github/workflows/docker-build.yml

This file was deleted.

10 changes: 5 additions & 5 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -43,11 +43,11 @@ cover/
*.ipynb_checkpoints/

# Environments
.env
.venv
env/
venv/
ENV/
.env*
.venv*
env*/
venv*/
ENV*/
env.bak/
venv.bak/

Expand Down
45 changes: 4 additions & 41 deletions Makefile
Original file line number Diff line number Diff line change
@@ -1,43 +1,8 @@
# Default target
# Default target to run tests and code checks before commit.
.PHONY: all
all: scvi uce scgpt scgenept geneformer transcriptformer aido
all: test lint mypy-checkmake

# Build the scvi image
.PHONY: scvi
scvi:
docker build -t cz-benchmarks-models:scvi -f docker/scvi/Dockerfile .

# Build the uce image
.PHONY: uce
uce:
docker build -t cz-benchmarks-models:uce -f docker/uce/Dockerfile .

# Build the scgpt image
.PHONY: scgpt
scgpt:
docker build -t cz-benchmarks-models:scgpt -f docker/scgpt/Dockerfile .

# Build the scgenept image
.PHONY: scgenept
scgenept:
docker build -t cz-benchmarks-models:scgenept -f docker/scgenept/Dockerfile .

# Build the geneformer image
.PHONY: geneformer
geneformer:
docker build -t cz-benchmarks-models:geneformer -f docker/geneformer/Dockerfile .

# Build the geneformer image
.PHONY: aido
aido:
docker build -t cz-benchmarks-models:aido -f docker/aido/Dockerfile .

# Build the transcriptformer image
.PHONY: transcriptformer
transcriptformer:
docker build -t cz-benchmarks-models:transcriptformer -f docker/transcriptformer/Dockerfile .

# Clean up images
# Clean up model images generated by the czbenchmarks in version 0.9
.PHONY: clean
clean:
docker rmi cz-benchmarks-models:scvi || true
Expand All @@ -46,9 +11,7 @@ clean:
docker rmi cz-benchmarks-models:scgenept || true
docker rmi cz-benchmarks-models:geneformer || true
docker rmi cz-benchmarks-models:transcriptformer || true
# Helper target to rebuild everything from scratch
.PHONY: rebuild
rebuild: clean all
docker rmi cz-benchmarks-models:aido || true

# Run all unit tests
.PHONY: test
Expand Down
5 changes: 2 additions & 3 deletions README-pypi.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,10 +5,10 @@
⚠️ **Warning:** Repository under active development and is in the alpha phase of development, subject to major refactors as outlined in the public-facing [roadmap](https://chanzuckerberg.github.io/cz-benchmarks/roadmap.html).

### What is cz-benchmarks?
cz-benchmarks is a package for standardized evaluation and comparison of machine learning models for biological applications (first, in the single-cell transcriptomics domain, with future plans to expand to additional domains). The package provides a toolkit for running containerized models, executing biologically-relevant tasks, and computing performance metrics. We see this tool as a step towards ensuring that large-scale AI models can be harnessed to deliver genuine biological insights -- by building trust, accelerating development, and bridging the gap between ML and biology communities.
cz-benchmarks is a package for standardized evaluation and comparison of machine learning models for biological applications (first, in the single-cell transcriptomics domain, with future plans to expand to additional domains). The package provides a toolkit for running containerized models, executing biologically-relevant tasks, and computing performance metrics. We see this tool as a step towards ensuring that large-scale ML Models can be harnessed to deliver genuine biological insights -- by building trust, accelerating development, and bridging the gap between ML and biology communities.

### Why benchmarking? Why now?
Last year, CZI hosted a workshop focused on benchmarking and evaluation of AI models in biology, and the [insights gained](https://virtualcellmodels.cziscience.com/micro-pub/benchmarking-workshop) have reinforced our commitment to supporting the development of a robust benchmarking infrastructure, which we see as critical to achieving our Virtual Cell vision.
Last year, CZI hosted a workshop focused on benchmarking and evaluation of ML Models in biology, and the [insights gained](https://virtualcellmodels.cziscience.com/micro-pub/benchmarking-workshop) have reinforced our commitment to supporting the development of a robust benchmarking infrastructure, which we see as critical to achieving our Virtual Cell vision.

### 💬 Community Feedback & Contributions
We're working to get the alpha version of cz-benchmarks stable to build with the community. In the meantime, for issues you may identify, feel free to open an issue on GitHub or reach out to us at [[email protected]](mailto:[email protected]).
Expand All @@ -22,7 +22,6 @@ To get started with `cz-benchmarks`, refer to the [Quick Start Guide](https://

- [How To Guides](https://chanzuckerberg.github.io/cz-benchmarks/how_to_guides/index.html)
- [Add a Custom Dataset](https://chanzuckerberg.github.io/cz-benchmarks/how_to_guides/add_custom_dataset.html)
- [Add a Custom Model](https://chanzuckerberg.github.io/cz-benchmarks/how_to_guides/add_custom_model.html)
- [Developer Guides](https://chanzuckerberg.github.io/cz-benchmarks/developer_guides/index.html)
- [API Reference](https://chanzuckerberg.github.io/cz-benchmarks/api_reference.html)
- [Assets](https://chanzuckerberg.github.io/cz-benchmarks/assets.html)
Expand Down
6 changes: 2 additions & 4 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,10 +5,10 @@
⚠️ **Warning:** Repository under active development and is in the alpha phase of development, subject to major refactors as outlined in the public-facing [roadmap](docs/source/roadmap.md).

### What is cz-benchmarks?
cz-benchmarks is a package for standardized evaluation and comparison of machine learning models for biological applications (first, in the single-cell transcriptomics domain, with future plans to expand to additional domains). The package provides a toolkit for running containerized models, executing biologically-relevant tasks, and computing performance metrics. We see this tool as a step towards ensuring that large-scale AI models can be harnessed to deliver genuine biological insights -- by building trust, accelerating development, and bridging the gap between ML and biology communities.
cz-benchmarks is a package for standardized evaluation and comparison of machine learning models for biological applications (first, in the single-cell transcriptomics domain, with future plans to expand to additional domains). The package provides a toolkit for running containerized models, executing biologically-relevant tasks, and computing performance metrics. We see this tool as a step towards ensuring that large-scale ML Models can be harnessed to deliver genuine biological insights -- by building trust, accelerating development, and bridging the gap between ML and biology communities.

### Why benchmarking? Why now?
Last year, CZI hosted a workshop focused on benchmarking and evaluation of AI models in biology, and the [insights gained](https://virtualcellmodels.cziscience.com/micro-pub/benchmarking-workshop) have reinforced our commitment to supporting the development of a robust benchmarking infrastructure, which we see as critical to achieving our Virtual Cell vision.
Last year, CZI hosted a workshop focused on benchmarking and evaluation of ML Models in biology, and the [insights gained](https://virtualcellmodels.cziscience.com/micro-pub/benchmarking-workshop) have reinforced our commitment to supporting the development of a robust benchmarking infrastructure, which we see as critical to achieving our Virtual Cell vision.

### 💬 Community Feedback & Contributions
We're working to get the alpha version of cz-benchmarks stable to build with the community. In the meantime, for issues you may identify, feel free to open an issue on GitHub or reach out to us at [[email protected]](mailto:[email protected]).
Expand All @@ -21,7 +21,6 @@ We're working to get the alpha version of cz-benchmarks stable to build with the

### How-To Guides
- [Add a Custom Dataset](docs/source/how_to_guides/add_custom_dataset.md)
- [Add a Custom Model](docs/source/how_to_guides/add_custom_model.md)
- [Add a New Metric](docs/source/how_to_guides/add_new_metric.md)
- [Add a New Task](docs/source/how_to_guides/add_new_task.md)
- [Interactive Mode](docs/source/how_to_guides/interactive_mode.md)
Expand All @@ -30,7 +29,6 @@ We're working to get the alpha version of cz-benchmarks stable to build with the
### Developer Guides
- [Datasets](docs/source/developer_guides/datasets.md)
- [Metrics](docs/source/developer_guides/metrics.md)
- [Models](docs/source/developer_guides/models.md)
- [Tasks](docs/source/developer_guides/tasks.md)
- [Debugging](docs/source/developer_guides/debugging.md)
- [Release Process](docs/source/developer_guides/release_process.md)
Expand Down
65 changes: 0 additions & 65 deletions docker/README.md

This file was deleted.

39 changes: 0 additions & 39 deletions docker/aido/Dockerfile

This file was deleted.

1 change: 0 additions & 1 deletion docker/aido/config.yaml

This file was deleted.

Loading