-
Notifications
You must be signed in to change notification settings - Fork 227
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug] Complex types are truncated during describe extended
#1107
Comments
@benc-db This is the issue we were talking about yesterday about the issues with the Databricks Metadata API. Is this just a Databricks specific issue? |
It is Databricks specific, but may affect dbt-spark as well. |
lol, I didn't see where I was commenting. So, I do not know the extent to which describe extended is standard Spark vs Databricks, which is probably what you're asking here. |
@benc-db yup :) @mikealfare did you find this bug running on Databricks then? |
@amychen1776 Apologies for the late reply; my GH notifications have been out of control. I believe this was reported by a Cloud customer that was running |
I'll summarize here what I'm doing in dbt-databricks: in 1.9 I'm introducing a behavior flag to use information schema to get column types for UC tables. The reason I'm guarding with a flag is because I learned in testing that information schema is not always synced up with reality, and to ensure that it is, I have run a repair table operation before gathering columns. This adds overhead. I'm hopeful that I can remove the flag when sync gets better for information schema, because in my testing, I hit columns missing between successive dbt runs that took on the order of minutes...too long for me to feel comfortable about trusting it for this. |
Hi, not sure if I encountered the same issue. I got runtime error when adding a struct column to an incremental model on dbt-spark. Here's the error.
It seems the data type read in |
This impacts unit testing as well. I can't provide test values for my complex type because the I'm not using Databricks. |
Is this a new bug in dbt-spark?
Current Behavior
Complex types are truncated when running this macro:
dbt-spark/dbt/include/spark/macros/adapters.sql
Lines 281 to 286 in 3fc624c
This happens due to
DESCRIBE EXTENDED
, which truncates the results before returning them.Expected Behavior
The types should be complete.
Steps To Reproduce
DESCRIBE EXTENDED my_model
Relevant log output
No response
Environment
Additional Context
No response
The text was updated successfully, but these errors were encountered: