Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GNI provider: Remote counters are not being incremented some of the times #1420

Open
tonyzinger opened this issue Nov 7, 2017 · 2 comments
Assignees

Comments

@tonyzinger
Copy link

I have a test that is doing an all-to-one communication pattern. On the receiving rank, the counter does not receive the correct number of transfers.
I use fi_cntr_read() to get the current value. I then increment the counter value by the number of transfers that I am receiving. This value is then used in the fi_cntr_wait() as the threshold value. In my program the wait value is 500 ms and I retry the fi_cntr_wait() 20 times before I return an error.
The sending ranks are using the fi_inject_write() and fi_inject_writedata() APIs.

I am attaching the debug output for the receiving rank.
RMA_gni_rank_0_pid_5606.log

@hppritcha
Copy link
Member

@tonyzinger could you supply a test case?

@tonyzinger
Copy link
Author

On jupiter in the directory: /home/users/ajz/issue_1420, read the README file and then execute the
test case via the execute_test_case.sh script.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants