Skip to content
Van Pham edited this page Jan 31, 2024 · 2 revisions

Welcome to the LinkedMusic Data Lake wiki!

Currently there are 2 main components of the pipeline

  1. Data Processing Pipeline
  2. Virtuoso

Data Processing Pipeline

This is the code in DDMAL/linkedmusic-datalake. The code to retrieve and convert data from individual databases into JSON-LD format. Currently including SIMSSADB, CantusDB, and TheSession.

Virtuoso

The JSON-LD processed previously is loaded into Virtuoso Quad Store. We can perform SPARQL queries and text search with Virtuoso. See the Virtuoso Wiki page for detailed Virtuoso setup instructions and usage guide.

We have a staging Virtuoso instance running at http://virtuoso.staging.simssa.ca. A McGill VPN is needed.