Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Will local environment eventually support all software steps of EMR? #33

Open
vishalovercome opened this issue May 16, 2023 · 2 comments

Comments

@vishalovercome
Copy link

This extension can really help validate the correctness of the runtime environment. That said, an EMR cluster creation involves various steps (software, configuration, bootstrap actions, etc.) each of which could be error prone.

It will be interesting to know what all EMR features can be incorporated. Perhaps there are some MER automation tools that can be leveraged by this extension to support the software steps.

@dacort
Copy link
Contributor

dacort commented May 16, 2023

Interesting idea, for sure. For now, the focus of the tool is on local development.

That said, there is also the emr-cli that focuses on building reproducible packages.

@vishalovercome
Copy link
Author

EMR makes it easy to install softwares other than Spark. Configuration and bootstrap actions are the bare minimum.

To realize the true potential of this extension, I would suggest you try to map devcontainer concepts to EMR concepts and then use this mapping at the time of deploying to EMR:

  1. postCreateCommand / postStartCommand can be to generate bootstrap actions
  2. devcontainer features (software catalog) => bundled software
  3. container tags => emr tags
  4. ...

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants