-
Notifications
You must be signed in to change notification settings - Fork 101
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature] Update the table_type field to table_format #729
Comments
@amychen1776 some historical context on why we used table_type. There is another paramter called:
We dindn't want the final user to be confused about format vs table_format, I do believe that using table_format and having still another parameter called format is really confusing. Also to be clear, table_type specify the type of table in the context of Athena, that can be Hive (external) or Iceberg(open table format) - the name come from AWS itself actually, and it's an extention of it, see here
The config format, on the other hand, specify how the data is going to be written in S3: parquet, avro, json and so on. IMHO using the naming table_format is not exactly correct, and leads to ambiguity - I just want to give my user prospective on this. |
Thank you @nicor88 - I appreciate the input! I think the standardization should look like this: table_format = iceberg, hive, delta This would align with the broader industry standards on what to call all of these. My goal with this issue is create a shared language across all the adapters so that we have a standard way of referring to this. This also improves adapter maintenance for someone who oversees multiple. We can also look into aliasing which we have done for past things like BQ with projects rather than database so that it's the same on the backend but the user has something different. In this situation - I'm not quite seeing the same exactly validation that we should do this |
Using After this refactor is in, we should aim to bump the minor version to "signal" the breaking change. |
Good - that's exactly why we want to make this more properly supported so it's more clear. |
Is this your first time submitting a feature request?
Describe the feature
Right now the iceberg table configuration calls the table format field as table_type. Let's standardize to
table_format
Describe alternatives you've considered
N/A
Who will this benefit?
No response
Are you interested in contributing this feature?
No response
Anything else?
No response
The text was updated successfully, but these errors were encountered: