-
Notifications
You must be signed in to change notification settings - Fork 98
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unclear example of export to datahub #198
Comments
Thanks for the kind words. You’ll have to specify “path” which has to
contain the path to the SQLite file.
…On Tue, 1 Nov 2022 at 21:29, Ian Lewis ***@***.***> wrote:
Loving PiiCatcher but I'm having some trouble pushing PII data to Datahub.
I'm following the instructions here
https://tokern.io/docs/catalog/export/#installation and here
https://tokern.io/docs/catalog/export/#basic-recipe.
The problem seems to be when reading my Sqllite database (which definitely
has data).
Original Scan
$ piicatcher scan mysql --name mands_test_cat --database testdb --port
3306 --uri ***.rds.amazonaws.com --username *** --password ****
The sqllite file is called None and has the following structure:
None
-> main
-> alembic_version
-> column_lineage
-> columns
-> default_schema
-> job_executions
-> jobs
-> schemata
-> sources
-> sqlite_master
-> tables
-> tasks
This is my current export.yml file which partially works (talks to
Datahub as expected)
type: dbcat.datahub.CatalogSource
config:
database: main
source_names:
- mands_prod_cat
sink:
type: "datahub-rest"
config:
server: "http://localhost:8080"
On running $ datahub ingest -c ./export.yml I receive the following error
message.
sqlite3.OperationalError: no such table: sources
Do you have any suggestions for how I can approach this?
—
Reply to this email directly, view it on GitHub
<#198>, or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAMP7GURSEXOIQI4SO7EJUTWGE47RANCNFSM6AAAAAARUGG7MQ>
.
You are receiving this because you are subscribed to this thread.Message
ID: ***@***.***>
|
Hi @vrajat, My understanding from the Datahub engineering team is that they do not support piicatcher input. Obviously, if you have a working solution it would be great to see it. I am now able to run the However the import to Datahub is unsuccessful and tags are not applied. Here is an example of the errors I see:
|
@yjagdale will you be able to help? |
Sure @vrajat will have look at it! |
Loving PiiCatcher but I'm having some trouble pushing PII data to Datahub.
I'm following the instructions here https://tokern.io/docs/catalog/export/#installation and here https://tokern.io/docs/catalog/export/#basic-recipe.
The problem seems to be when reading my Sqllite database (which definitely has data).
Original Scan
$ piicatcher scan mysql --name mands_test_cat --database testdb --port 3306 --uri ***.rds.amazonaws.com --username *** --password ****
The sqllite file is called
None
and has the following structure:This is my current
export.yml
file which partially works (talks to Datahub as expected)On running
$ datahub ingest -c ./export.yml
I receive the following error message.Do you have any suggestions for how I can approach this?
The text was updated successfully, but these errors were encountered: