Skip to content

wmcz/wmcz-analytics-service

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

32 Commits
 
 
 
 
 
 

Repository files navigation

wmcz-analytics-service

Welcome to (an early version of) the WMCZ Analytics service! This repository has backend code which builds and releases various WMCZ related datasets to the world. The code is executed on Wikimedia Toolforge (under a tool called "wmcz").

Implementation details

This repository consists of two parts:

  • generator: has logic for generating datasets that can be generated on Wikimedia Toolforge (from data available directly to Toolforge, or data that can be downloaded to Toolforge)
  • orchestrator: orchestrates the data generation and updates the public page with all the datasets

Both are covered in more detail below.

generator

This part generates datasets using data that are available to Wikimedia Toolforge. Right now, there is only one task implemented in this way:

  • dashboard-data: gathers data about WMCZ's courses from the Outreach Dashboard

orchestrator

The orchestrator orchestrates the process of gathering all WMCZ-related data and publishing them via https://wmcz.toolforge.org/datasets/. List of available datasets is available below.

Available datasets

dashboard-data

Data about Wikimedia Czech Republic's courses, as contained in the Outreach Dashboard. This does not include data from the education program extension era yet.

Aggregated data about individual courses are gathered by generator (contained in this repo; see above for details). Data about individual edits are computed in internal Wikimedia Foundation's analytics cluster, and the code is available in wmcz-analytics-wmf-cluster.

About

Code of future WMCZ analytics service

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published