-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
WIP Iceberg integration #47
base: main
Are you sure you want to change the base?
Conversation
Generic Table source integration
|
Test catalog based readstream in Iceberg reads:
|
For catalog compatibility we can use Iceberg Session Catalog to add compatibility with Delta:
|
We have to find a way to standarize tests in local environment between this two options:
For the moment, we are going to test it directly in glue dev environment with the following Spark configuration:
Since we are using iceberg 1.6.1, it's important to take into account this configuration options and not include iceberg in the |
To make the iceberg integration work, we needed to add the iceberg libraries as --extra-jars.
In the previous configuration we received an error of missing warehouse location. Seems like the warehouse is mandatory for Flue to be able to write in a s3 folder as the main catalog:
This property must be configured in Metabolic Spark configuration:
|
Added two changes not related with this PR but needed for test integrity:
|
Add Iceberg support to Metabolic