Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Data Docs host on S3 cannot redirect to other pages due to access denied #10775

Open
marxaem opened this issue Dec 13, 2024 · 1 comment
Open

Comments

@marxaem
Copy link

marxaem commented Dec 13, 2024

Describe the bug
I am trying to host and share Data Docs on AWS S3. After checkpoint.run(), an index.html file on S3 was generated as expected. I followed the instruction to configure bucket policy as guided in Host and share Data Docs but when I opened the index.html from my bucket and clicked on any run record, it couldn't redirect to the detailed validation results page, it gave an Access Denied error. This problem is similar to an old issue Data Docs > S3 > Links to runs are access Denied but in a different version of GX.

To Reproduce
The data docs site configurations are as follows

data_docs_sites:
  S3_site:
    class_name: SiteBuilder
    store_backend:
      class_name: TupleS3StoreBackend
      bucket: bucket-name
      prefix: data-docs/
    site_index_builder:
      class_name: DefaultSiteIndexBuilder

By clicking index.html in S3 bucket, the Data Docs link will look like:
https://bucket-name.s3.ap-southeast-1.amazonaws.com/data-docs/index.html?X-Amz-Algorithm=xxx&...
However, when I clicked the run record, those keys generated from AWS were not following resulting in an Access Denied.
https://bucket-name.s3.ap-southeast-1.amazonaws.com/data-docs/expectations/my_suite.html

Expected behavior
Data Docs hosted on AWS S3 can be clicked to redirect to each page inside.

Environment (please complete the following information):

  • Operating System: MacOS
  • Great Expectations Version: 1.1.0
  • Data Source: Spark dataframe
  • Cloud environment: AWS
@marxaem marxaem changed the title Data Docs host on S3 cannot redirect to other page due to access denied Data Docs host on S3 cannot redirect to other pages due to access denied Dec 13, 2024
@ekeras
Copy link

ekeras commented Dec 13, 2024

I am having the same issue on GCP, one way of solving it is to make your bucket/directory publicly available but it's not secure.

It would be nice to have a way to pre-sign these generated urls with a key, so that private bucket could be accessed having an url.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants