Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[feature] Expose more information about a job for debugging purposes #544

Open
fosterseth opened this issue Feb 9, 2022 · 1 comment
Open

Comments

@fosterseth
Copy link
Member

fosterseth commented Feb 9, 2022

What I'd like to be able to tell at a glance

  1. what was the exact work_submit command that started the work? this would be analogous to the job_args field in awx
  2. what ongoing threads/processes/file descriptors are open for a particular work unit?
    • i.e. is monitorRemoteUnit still running? we can't infer that from the status because monitorRemoteStatus and monitorRemoteStdout operate in separate threads
    • is GetResults spinning in a loop

Solutions:

  1. Logging data. This could be structured logging, similar to job_lifecycle in awx.
    Pros: doesn't require changes to work unit structs, or to the status file.
    Cons: Finding and parsing logs is tedious and not fun.

  2. Put this data in work unit structures. This can either be written to disk or kept just in memory.
    Pros: we can just narrow in on some culprit work unit by looking at its status file or by calling some receptorctl that gives us more information about the work unit
    Cons: more work to implement and get right

@fosterseth
Copy link
Member Author

fosterseth commented Feb 9, 2022

@AlanCoding maybe you have some other ideas of nice-to-haves when it comes to extra information about a receptor work unit

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants