How to deal with streaming data? #37
Unanswered
davidgasquez
asked this question in
Q&A
Replies: 1 comment
-
Real time datasets might be better suit for other projects. If we're speaking about public data (science, governments, climate, ...), updating datasets once per day seems reasonable. In theory, we could try doing microbatching each 5 minutes or so for datasets that need that recency. E.g: a GitHub action that scrapes a website every 5 minutes. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Some datasets will need to be updated every few minutes or every hour.
Beta Was this translation helpful? Give feedback.
All reactions