-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[AIRFLOW DAG] Update covid ch dashboard #148
Comments
@eduardocorrearaujo could you provide more information about the DAG, for example:
|
what would be the expected result? are the modules/methods for the dag already written? if so, where can it be found? when should the DAG run? daily? weekly? triggered by another DAG? |
I've created a template with the start and end tasks (triggered by |
Another possible issue concerning this DAG is that I train the models in my machine before applying the models. So, we should define a time to retrain the models periodically with new data. Do you think 2 months is a good time interval?? |
I think if you wanna use the CSV with the data that is going to DB before being deleted, just delete the task |
Sorry, I'm not quite sure if I understood the problem here |
Yes. The models don't need to be re-trained very often. Only the prediction has to be generated every week. |
The DAG could still be triggered by a external task and have a 2 months interval at the same time, the version would be the timestamp of the dag run, something lilke: |
@fccoelho @luabida the data collection code is saved into the path: |
From my point of view, |
With apps I refer to the fact that this code is related to a dashboard application created by epigraphhub. So this code is not general. The functions used in this script already come from epigraphhub.analysis.forecast_models.ngboost_models (which is general) |
Any code that is specific to a single dashboard does not belong in the library. Keep that in mind. |
In this case, I don't know where I should put this code, since in the |
Code that is specific to the dashboard and nothing more can stay in the dashboard repo. |
But, the code should be used by the airflow dags. How could I import it if the code is saved in the COVID-CH-dashboard? |
In that case, it can live in a standalone executable script that is run by the DAG. For that, the Airflow container may need to mount an external directory with this and other such scripts |
ok |
DAG Description
It's necessary to create dags to weekly update the results of the dashboard:
https://epigraphhub.org/covidch/
Basic Workflow
what would be the expected result?
This dag would pull the data from the epigraphhub database and apply some machine learning models in this data to forecast it. After making the forecast the dag should upload the data frame with the forecasted values in the database.
are the modules/methods for the dag already written? if so, where can it be found?
This method uses some functions in the epigraphhub_py package. But I already have the scripts written on my personal machine.
when should the DAG run? daily? weekly? triggered by another DAG?
This DAG should be run after the dag that uploads the foph tables.
More info
No response
The text was updated successfully, but these errors were encountered: