You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* Install git either through conda (“conda install git”) or through your CLI, e.g. sudo apt-get git.
8
8
9
9
## Repositories
10
-
* Create a directory for the project and initialize git and dvc.
11
-
* As you work on the code, continually commit changes. Generated models you want to keep must be committed to dvc.
10
+
* Create a directory for the project and initialize git.
11
+
* As you work on the code, continually commit changes. Trained models you want to use in production must be committed to GitHub.
12
12
* Connect your local git repo to GitHub.
13
13
* Setup GitHub Actions on your repo. You can use one of the pre-made GitHub Actions if at a minimum it runs pytest and flake8 on push and requires both to pass without error.
14
14
* Make sure you set up the GitHub Action to have the same version of Python as you used in development.
15
-
* Set up a remote repository for dvc.
16
15
17
16
# Data
18
17
* Download census.csv and commit it to dvc.
19
18
* This data is messy, try to open it in pandas and see what you get.
20
19
* To clean it, use your favorite text editor to remove all spaces.
21
-
* Commit this modified data to dvc (we often want to keep the raw data untouched but then can keep updating the cooked version).
22
20
23
21
# Model
24
22
* Using the starter code, write a machine learning model that trains on the clean data and saves the model. Complete any function that has been started.
0 commit comments