Redact values from logs due 'duplicate key' error #773

jaypeedevlin · 2023-09-14T05:11:37Z

resolves #772
Docs N/A

Assuming this is merged, it would be amazing if this could be backported into 1.6.x so we can take advantage in our org without having to go through a minor version upgrade (which there's a lot of work involved given our scale)

Problem

We are leaking raw data into logs in certain cases.

Solution

Copies the method that exists, placing code into a loop to handle any other future cases that may arise.

Checklist

I have read the contributing guide and understand what's expected of me
I have run this code in CI and it appears to resolve the stated issue
This PR includes tests, or tests are not required/relevant for this PR
This PR has no interface changes (e.g. macros, cli, logs, json artifacts, config files, adapter interface, etc) or this PR has already received feedback and approval from Product or DX

dbeatty10

@jaypeedevlin thanks for raising this PR 🤩 !

Three things to get this passing CI:

Code suggestion to expect dbt run to fail when it raises the error
Code suggestion to apply the expected code format
Add a changelog entry via changie new

Just let @dataders or myself if you want an assist with any of the above.

tests/functional/duplicate_key_unlogged/test_duplicate_key_not_in_exceptions.py

dbt/adapters/snowflake/connections.py

Co-authored-by: Doug Beatty <[email protected]>

jaypeedevlin · 2023-09-15T00:58:39Z

@dbeatty10 thanks for the review — I believe this should now be good to go. Assuming the merge goes through if there's any possibility to get this backported to 1.6.x that would be a massive help for us!

mikealfare

Thanks for the PR @jaypeedevlin! I just had two suggestions, otherwise I like the approach.

dbt/adapters/snowflake/connections.py

tests/functional/redact_log_values/test_duplicate_key_not_in_exceptions.py

* Add failing test * Fix model test naming * Add redaction looping logic * Apply suggestions from code review Co-authored-by: Doug Beatty <[email protected]> * Add changelog * Fix test case * Rename test class * Colocate redaction tests * Ignore if dbt run passes or fails * Materialize as a table instead of a view to trigger an error * Expect the run to fail with a specific error message * Reverse order of dict, assert that sensitive data is replaced * Add newline --------- Co-authored-by: Doug Beatty <[email protected]> Co-authored-by: Mike Alfare <[email protected]> (cherry picked from commit 9fa8de2)

* Add failing test * Fix model test naming * Add redaction looping logic * Apply suggestions from code review Co-authored-by: Doug Beatty <[email protected]> * Add changelog * Fix test case * Rename test class * Colocate redaction tests * Ignore if dbt run passes or fails * Materialize as a table instead of a view to trigger an error * Expect the run to fail with a specific error message * Reverse order of dict, assert that sensitive data is replaced * Add newline --------- Co-authored-by: Doug Beatty <[email protected]> Co-authored-by: Mike Alfare <[email protected]> (cherry picked from commit 9fa8de2) Co-authored-by: Josh Devlin <[email protected]>

* Add failing test * Fix model test naming * Add redaction looping logic * Apply suggestions from code review Co-authored-by: Doug Beatty <[email protected]> * Add changelog * Fix test case * Rename test class * Colocate redaction tests * Ignore if dbt run passes or fails * Materialize as a table instead of a view to trigger an error * Expect the run to fail with a specific error message * Reverse order of dict, assert that sensitive data is replaced * Add newline --------- Co-authored-by: Doug Beatty <[email protected]> Co-authored-by: Mike Alfare <[email protected]>

jaypeedevlin added 3 commits September 14, 2023 14:22

Add failing test

dfd5940

Fix model test naming

98b40a9

Add redaction looping logic

c686d4a

cla-bot bot added the cla:yes label Sep 14, 2023

jaypeedevlin changed the title ~~Jaypeedevlin redact values~~ Redact values from logs due 'duplicate key' error Sep 14, 2023

jaypeedevlin marked this pull request as ready for review September 14, 2023 05:13

jaypeedevlin requested a review from a team as a code owner September 14, 2023 05:13

jaypeedevlin requested a review from colin-rogers-dbt September 14, 2023 05:13

dataders added ok to test test snowflake labels Sep 14, 2023

dataders closed this Sep 14, 2023

dataders reopened this Sep 14, 2023

dbeatty10 reviewed Sep 14, 2023

View reviewed changes

tests/functional/duplicate_key_unlogged/test_duplicate_key_not_in_exceptions.py Outdated Show resolved Hide resolved

dbt/adapters/snowflake/connections.py Outdated Show resolved Hide resolved

jaypeedevlin and others added 5 commits September 15, 2023 09:09

Apply suggestions from code review

3d8b846

Co-authored-by: Doug Beatty <[email protected]>

Add changelog

1ea98a3

Fix test case

d88a187

Rename test class

dc85148

Colocate redaction tests

5759ec7

dataders requested a review from dbeatty10 September 15, 2023 13:17

dbeatty10 added 4 commits September 15, 2023 07:36

Merge branch 'main' into jaypeedevlin-redact-values

d3878e5

Ignore if dbt run passes or fails

c644545

Materialize as a table instead of a view to trigger an error

909c1cd

Expect the run to fail with a specific error message

8aa5d8b

dbeatty10 added the ready_for_review Externally contributed PR has functional approval, ready for code review from Core engineering label Sep 15, 2023

Merge branch 'main' into jaypeedevlin-redact-values

616087c

mikealfare self-assigned this Sep 15, 2023

mikealfare reviewed Sep 15, 2023

View reviewed changes

dbt/adapters/snowflake/connections.py Show resolved Hide resolved

tests/functional/redact_log_values/test_duplicate_key_not_in_exceptions.py Show resolved Hide resolved

Reverse order of dict, assert that sensitive data is replaced

fd71023

jaypeedevlin requested a review from mikealfare September 18, 2023 06:06

dbeatty10 added 2 commits September 18, 2023 06:50

Merge branch 'main' into jaypeedevlin-redact-values

2cf2258

Add newline

11f2ce9

colin-rogers-dbt approved these changes Sep 18, 2023

View reviewed changes

colin-rogers-dbt merged commit 9fa8de2 into dbt-labs:main Sep 18, 2023

dataders added backport 1.5.latest and removed backport 1.5.latest labels Sep 19, 2023

This was referenced Sep 19, 2023

[Backport 1.5.latest] Redact values from logs due 'duplicate key' error #781

Merged

[Backport 1.6.latest] Redact values from logs due 'duplicate key' error #782

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Redact values from logs due 'duplicate key' error #773

Redact values from logs due 'duplicate key' error #773

jaypeedevlin commented Sep 14, 2023 •

edited

Loading

dbeatty10 left a comment

jaypeedevlin commented Sep 15, 2023

mikealfare left a comment

Redact values from logs due 'duplicate key' error #773

Redact values from logs due 'duplicate key' error #773

Conversation

jaypeedevlin commented Sep 14, 2023 • edited Loading

Problem

Solution

Checklist

dbeatty10 left a comment

Choose a reason for hiding this comment

jaypeedevlin commented Sep 15, 2023

mikealfare left a comment

Choose a reason for hiding this comment

jaypeedevlin commented Sep 14, 2023 •

edited

Loading