Skip to content

nodestream pipelines for ingesting ldbc finbench data

Notifications You must be signed in to change notification settings

aaronWass-neo/ldbc_finbench

Repository files navigation

ldbc_finbench

This nodestream project defines ingestion for the LDBC Financial Benchmark (LDBC FINBENCH). Learn more here: https://ldbcouncil.org/benchmarks/finbench/

Getting Started

To get started, first install the required dependencies:

poetry install

Note that this project uses poetry for dependency management. If you don't have poetry installed, you can install it following the instructions here.

Development

If you don't want to keep typing poetry run before every command, you can run the following command to enter a shell with the correct environment:

poetry shell

The samples below assume you are in a poetry shell.

Running Migrations

It is important to run a migration to prepare your database before ingesting any data. To run a migration against your database run the following command: This will run the migration against the aura-db target, which is defind in the nodestream.yaml file

nodestream migrations run -t aura-db

Running Pipelines

To run a pipeline locally, use the following command:

nodestream run $PIPELINE_NAME 

To run all of the pipelines in this project, use the following command: This will run all the pipelines, and send them to the target aura-db, which is defined in the nodestream.yaml file

nodestream run -t aura-db 

Running Tests

To run tests, use the following command:

pytest

About

nodestream pipelines for ingesting ldbc finbench data

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published