Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Debugging MPI programs #4

Open
renatobellotti opened this issue Feb 23, 2021 · 4 comments
Open

Debugging MPI programs #4

renatobellotti opened this issue Feb 23, 2021 · 4 comments

Comments

@renatobellotti
Copy link

Are there plans to process the raw communication data and provide a high-level event history for MPI communication? I think the scientific open source community is in desperate need of a tool to debug distributed codes.

@liad-guardicore
Copy link
Contributor

Hey @renatobellotti can you please share some examples where you think IPCDump could help? How MPI Application usually communicate?

@renatobellotti
Copy link
Author

Thanks for your answer.

Well, the problem is that I have no clue how MPI works internally. It is more a standard for a communication API, in the style of "send-array-to-process-a", "wait-for-array-from-process-b" and other stuff in that direction. I guess the implementation can even use different kinds of mechanism depending on which cores/nodes the processes run, but I'm just a user, I don't know what happens behind the scenes. I was hoping that you knew about MPI and could add something like a filter to group messages semantically. :)

@liad-guardicore
Copy link
Contributor

As far as I know about MPI (which is not a lot) the mechanism on which it uses to pass information differs between implementations, but I do think it could be a nice feature to have a wrapper that knows how to identify MPI communication and present them in a nicer way than just random IPC events. (Not sure this is possible, but will be nice to check if it is and if it is, to implement it)

@renatobellotti
Copy link
Author

I'm very happy to hear that, I think the scientific community would love this feature!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants