Skip to content

Latest commit

 

History

History
23 lines (20 loc) · 802 Bytes

README.md

File metadata and controls

23 lines (20 loc) · 802 Bytes

safepaths-datascience

Repository with various data science projects in the branches (More coming soon)

Data files

Links to several repositories of publicly available data are stored in the data_sources.json with the following structure:

data_sources.json:
    |--Global:
        |--source:link
    |--Nations:
        |--Regions:
            |--Provinces:
                |--source:link

Ingesting algorithms

Ingesting pipelines need to check:

  • file format: csv, json, etc.
  • based on the file format, select the longitude, latitude and timestamp values (where present) (in progress)
  • logging errors (in progress)
  • build a harmonized file containing only those three columns
  • harmonize values (need to avoid doubling of data qhen present from different sources)