Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create prediction pipeline #93

Open
5 tasks
Tracked by #91
emmamendelsohn opened this issue Jun 28, 2024 · 0 comments
Open
5 tasks
Tracked by #91

Create prediction pipeline #93

emmamendelsohn opened this issue Jun 28, 2024 · 0 comments

Comments

@emmamendelsohn
Copy link
Collaborator

emmamendelsohn commented Jun 28, 2024

Some of these steps can use/adapt existing functions from the training pipeline.

  • Download all data for past three months, modifying the existing data download functions to download new data only.
  • Transform data, including steps to scale to 0.1 degrees, calculate lagged anomalies against stored historical values.
  • Augment and aggregate into ADM regions
  • Run predictions using stored model object
  • Return a shapefile

See REPEL2 for the logic we used to create a data processing pipeline for prediction purposes, separate from the model training pipeline: https://github.com/ecohealthalliance/repel2/blob/dev/tar_plans/_targets_prediction_livestock.R

Note that the REPEL pipeline does not provide an example of any logic around downloading only new data, because the downloads were quick enough and the update period infrequent enough (every 6 months) that it was worth just doing a full download. For OpenRVF, we have bigger data and more frequent (monthly) updating needs.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant