Add Redshift Iam Idc token authentication method with an eye towards future supported Idps #970

VersusFacit · 2024-12-17T00:13:42Z

resolves #898

Problem

Add refresh_token-based authentication method. refresh_token allows for use of Iam Idc tokens that we generate adhoc for the redshift_connector.connect call. We had originally sought to provide a single access token and reuse it until TTL reached, but this is impossible -- tokens are only good for one use in this integration. Hence we must adhoc generate one each time and refresh tokens are an expedient and industry compliant method for doing so

Solution

Provide a refresh token endpoint and necessary information to enable this.

Additional testing

End to end test using the following profile which points to a test redshift cluster on an AWS account with an integrated Okta <> Iam idc <> Redshift token authentication suite.

class TestMyTest:
    @pytest.fixture(scope="class")
    def models(self):
        return {
            "base_table.sql": "{{ config(materialized='table') }} select 1 as id",
        }

    @pytest.fixture(scope="class")
    def dbt_profile_target(self):
        return {
            "type": "redshift",
            "host": "...",
            "port": 5439,
            "dbname": "dev",
            "threads": 1,
            "token_endpoint": {
                "request_url": "https://....oktapreview.com/oauth2/default/v1/token",
                "idp_auth_credentials": ...,
                "request_data": 'grant_type=refresh_token&redirect_uri=http%3A%2F%2Flocalhost%3A8080%2Flogin%2Foauth2%2Fcode%2Fokta&refresh_token=...'

            },
            "method": "oauth_token_identity_center",
            "schema": "dbt_mila",
        }


    def test_my_test(self, project):
        run_dbt()

where ... is some credential I've used for testing.

Checklist

I have read the contributing guide and understand what's expected of me
I have run this code in development and it appears to resolve the stated issue
This PR includes tests, or tests are not required/relevant for this PR
This PR has no interface changes (e.g. macros, cli, logs, json artifacts, config files, adapter interface, etc) or this PR has already received feedback and approval from Product or DX

github-actions · 2024-12-17T00:13:59Z

Thank you for your pull request! We could not find a changelog entry for this change. For details on how to document a change, see the dbt-redshift contributing guide.

We expect users of this method to provide a YAML-structured set of params including a uri, an authentication string, and whatever paramters might be needed to construct the correct payload equivalent to data in a curl request. There is an all-important under the hood POST which needs a set of params unique to each identity provider to generate access tokens for use with TokenAuthIdpPlugin.

colin-rogers-dbt · 2025-01-14T16:39:56Z

dbt/adapters/redshift/connections.py

+            normal request failures.
+            """
+            # Handle the 429 rate-limiting case first
+            if response.status_code == 429:


if we're hitting the rate limit could we add an exponential backoff/retry strategy here?

haha, I kind of knew you were going to prompt me for that. We're actively determining that customer experience. Let me filter this up to the wider team to see if it makes sense for now

What are the scenarios where we're hitting the rate limit? I don't see a retry on this, so is this using some generalized retry? Does this get run on each model? Each run?

If we actually do a retry strategy -- the necessity of which is undetermined and trending towards 'not needed' based on @jenniferjsmmiller 's current research -- it'd be used on every connection. The 429 is specifically a rate limiting error status from okta

Since we're reusing connections where possible, this feels like a low occurrence event. In other words, if we hit the rate limit, my guess is that something else is actually going wrong or the user is simply abusing the application (e.g. running a lot of threads of dbt-redshift in parallel via something like airflow, which isn't officially supported).

We don't reuse redshift connections do we? And also there was concern about 100 threads being set and this occuring at startup but even that was something Jenn didn't seem to trigger in their research 🤞

dbt/adapters/redshift/connections.py

VersusFacit · 2025-01-14T18:45:07Z

Was an internal request to check in a version of this integration test without the creds that we skip for now. We can't support this on our current infra. I've made a Jira ticket to track this for our backlog though.

dbt/adapters/redshift/connections.py

mikealfare · 2025-01-14T22:31:53Z

dbt/adapters/redshift/connections.py

+            normal request failures.
+            """
+            # Handle the 429 rate-limiting case first
+            if response.status_code == 429:


What are the scenarios where we're hitting the rate limit? I don't see a retry on this, so is this using some generalized retry? Does this get run on each model? Each run?

dbt/adapters/redshift/connections.py

VersusFacit · 2025-01-15T09:39:33Z

Okay team, I've generalized the framework out to include an Entra option which may work (may need some slight adjustment but I've tested this pretty well considering our infra options imo). Moreover, the okta framework is better:tm: now.

A profile that wants to use this method will thus specify at least:

method: oauth_token_identity_center
token_endpoint:
    type: okta|entra|... 
    request_url: <https url to host with api endpoint>
    request_data: <data params needed>
   ...specific fields for each individual Idp...

I've added a bunch more unit tests and re-tested this using my refresh token end to end model build test :D

dbt/adapters/redshift/connections.py

…on some Idp

Fix tests and add token authentication method to auth flow

28949aa

cla-bot bot added the cla:yes label Dec 17, 2024

VersusFacit self-assigned this Dec 18, 2024

VersusFacit linked an issue Dec 18, 2024 that may be closed by this pull request

Support IAM Identity Center Authentication - browser and token based #898

Closed

VersusFacit added 3 commits December 17, 2024 18:14

Add changelog.

1dbfd76

Merge branch 'main' into ADAP-898/add_token_iam_authentication

5ad2a73

VersusFacit force-pushed the ADAP-898/add_token_iam_authentication branch from 829426f to afd9d13 Compare January 14, 2025 05:08

Add unit tests for current codepaths.

4d85bd4

VersusFacit marked this pull request as ready for review January 14, 2025 05:46

VersusFacit requested a review from a team as a code owner January 14, 2025 05:46

VersusFacit requested review from jenniferjsmmiller and ericbaumann January 14, 2025 05:50

Make test a bit more specific.

f091132

VersusFacit changed the title ~~Fix tests and add token authentication method to auth flow~~ Add Redshift Iam Idc token authentication method with an eye towards future supported Idps Jan 14, 2025

colin-rogers-dbt reviewed Jan 14, 2025

View reviewed changes

dbt/adapters/redshift/connections.py Outdated Show resolved Hide resolved

colin-rogers-dbt reviewed Jan 14, 2025

View reviewed changes

dbt/adapters/redshift/connections.py Outdated Show resolved Hide resolved

Add skeleton of test case I've been used.

5f5d090

mikealfare reviewed Jan 14, 2025

View reviewed changes

jenniferjsmmiller reviewed Jan 15, 2025

View reviewed changes

dbt/adapters/redshift/connections.py Show resolved Hide resolved

Code review comments and adapt for Entra + future providers.

dba6a01

VersusFacit requested review from colin-rogers-dbt, mikealfare and jenniferjsmmiller January 15, 2025 09:43

colin-rogers-dbt reviewed Jan 15, 2025

View reviewed changes

dbt/adapters/redshift/connections.py Outdated Show resolved Hide resolved

colin-rogers-dbt approved these changes Jan 15, 2025

View reviewed changes

ericbaumann approved these changes Jan 15, 2025

View reviewed changes

jenniferjsmmiller approved these changes Jan 15, 2025

View reviewed changes

VersusFacit added 2 commits January 15, 2025 14:59

Improve comment.

55f555a

Better error handling for missing access_token since it could happen …

54678b6

…on some Idp

VersusFacit enabled auto-merge (squash) January 15, 2025 23:43

VersusFacit merged commit de078b8 into main Jan 16, 2025
23 checks passed

VersusFacit deleted the ADAP-898/add_token_iam_authentication branch January 16, 2025 00:42

jenniferjsmmiller mentioned this pull request Mar 4, 2025

Implement support for External OAuth to Bigquery via GCP's Workload Identity Federation and Entra dbt-labs/dbt-adapters#867

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Redshift Iam Idc token authentication method with an eye towards future supported Idps #970

Add Redshift Iam Idc token authentication method with an eye towards future supported Idps #970

VersusFacit commented Dec 17, 2024 •

edited

Loading

github-actions bot commented Dec 17, 2024

colin-rogers-dbt Jan 14, 2025

VersusFacit Jan 14, 2025 •

edited

Loading

mikealfare Jan 14, 2025

VersusFacit Jan 14, 2025 •

edited

Loading

mikealfare Jan 15, 2025

VersusFacit Jan 15, 2025

VersusFacit commented Jan 14, 2025

mikealfare Jan 14, 2025

VersusFacit commented Jan 15, 2025 •

edited

Loading

Add Redshift Iam Idc token authentication method with an eye towards future supported Idps #970

Add Redshift Iam Idc token authentication method with an eye towards future supported Idps #970

Conversation

VersusFacit commented Dec 17, 2024 • edited Loading

Problem

Solution

Additional testing

Checklist

github-actions bot commented Dec 17, 2024

colin-rogers-dbt Jan 14, 2025

Choose a reason for hiding this comment

VersusFacit Jan 14, 2025 • edited Loading

Choose a reason for hiding this comment

mikealfare Jan 14, 2025

Choose a reason for hiding this comment

VersusFacit Jan 14, 2025 • edited Loading

Choose a reason for hiding this comment

mikealfare Jan 15, 2025

Choose a reason for hiding this comment

VersusFacit Jan 15, 2025

Choose a reason for hiding this comment

VersusFacit commented Jan 14, 2025

mikealfare Jan 14, 2025

Choose a reason for hiding this comment

VersusFacit commented Jan 15, 2025 • edited Loading

VersusFacit commented Dec 17, 2024 •

edited

Loading

VersusFacit Jan 14, 2025 •

edited

Loading

VersusFacit Jan 14, 2025 •

edited

Loading

VersusFacit commented Jan 15, 2025 •

edited

Loading