Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Split auxiliary processes our of the ScyllaDB container #1940

Open
2 tasks
tnozicka opened this issue May 24, 2024 · 2 comments
Open
2 tasks

Split auxiliary processes our of the ScyllaDB container #1940

tnozicka opened this issue May 24, 2024 · 2 comments
Labels
kind/epic Categorizes issue as an epic. priority/important-longterm Important over the long term, but may not be staffed and/or may need multiple releases to complete. triage/accepted Indicates an issue or PR is ready to be actively worked on.

Comments

@tnozicka
Copy link
Member

tnozicka commented May 24, 2024

We are running multiple processes within ScyllaDB container which is against the best practices but most importantly a cgroup is assigned to a particular container. The issues that stem from that are:

  • other processes steal cpu quota from ScyllaDB and affect latency
  • coupling the lifetimes - if the sidecar fails, it kills ScyllaDB too
  • multiple unrelated logs are mixes into 1 container and neither can be parsed when collected
  • zombies can be left around

There are 2 classes of processes that are run within ScyllaDB container. The ones coming from the scylla-operator binary that gets copied there through a shared filesystem with mostly static linking and provides probes, sidecar controller and startup logic. Then there are also the processes coming from the ScyllaDB container itself that use the anti-pattern with supervisord and running multiple processes there.

Tasks

Preview Give feedback
  1. 2 of 6
    kind/epic priority/important-soon triage/accepted
  2. 2 of 7
    kind/epic priority/important-longterm
@tnozicka tnozicka added priority/important-longterm Important over the long term, but may not be staffed and/or may need multiple releases to complete. kind/epic Categorizes issue as an epic. labels May 24, 2024
@tnozicka tnozicka self-assigned this May 24, 2024
Copy link
Contributor

The Scylla Operator project currently lacks enough contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

  • After 30d of inactivity, lifecycle/stale is applied
  • After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
  • After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

  • Mark this issue as fresh with /remove-lifecycle stale
  • Close this issue with /close
  • Offer to help out

/lifecycle stale

@scylla-operator-bot scylla-operator-bot bot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jul 11, 2024
@tnozicka
Copy link
Member Author

/remove-lifecycle stale
/triage accepted

@scylla-operator-bot scylla-operator-bot bot added triage/accepted Indicates an issue or PR is ready to be actively worked on. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Jul 12, 2024
This was referenced Jul 22, 2024
@tnozicka tnozicka removed their assignment Nov 25, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/epic Categorizes issue as an epic. priority/important-longterm Important over the long term, but may not be staffed and/or may need multiple releases to complete. triage/accepted Indicates an issue or PR is ready to be actively worked on.
Projects
None yet
Development

No branches or pull requests

1 participant