-
Notifications
You must be signed in to change notification settings - Fork 859
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
accelerator: build components as dso's by default #12055
Conversation
bot:ompi:retest strange failure of make check shouldn't be impacted by this PR. rerunning |
bot:aws:retest |
bot:aws:retest |
60e1ba7
to
a055f11
Compare
okay, i've added smcuda, accelerator (so all components), and rcache-gpusm rcache-rgpusm but still see libcuda linked in to opal and ompi libs and hence to commands like ompi_info. digging... |
oh i had a typo in the config file. ignore last comment. |
7100bed
to
88d46f2
Compare
i still have no idea why the make check |
@jsquyres @edgargabriel please recheck |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I can confirm that these components are now built as dso's by default.
You need to rebase your branch however, it is missing the Makefile.am fixes to btl/smcuda and the two rcache components, and hence doesn't compile.
88d46f2
to
db8b2c5
Compare
i'm going to push a temporary commit to undo the default dso module thing to see if that's somehow causing the make check to fail. |
5c937d8
to
e6ed21b
Compare
okay the problem is that opal_init call in the opal_conditional test fails in the make distcheck build. can't find an accelerator component. |
This is rather annoying. I've narrowed the problem down to the fact that when we try to select an accelerator component, if none are found, thoughts? |
e6ed21b
to
b5ca28d
Compare
i forgot that we can specify specific accelerator components to build as DSO's so trying that route. |
In theory there is no reason to have the null component as a dso, that could be part statically compiled, and that might resolve your problem |
that's what I'm trying. note this behavior in the accel base select component may need a better fix. For example, if someone configures with |
also need to switch rcache/gpsum and rcache/rgpusum to DSO by default. Fix a problem in opal_mca.m4 where the enable-mca-dso list wasn't being processed correctly starting with 5.0.0. related to open-mpi#12036 Signed-off-by: Howard Pritchard <[email protected]>
b5ca28d
to
8601eb5
Compare
@jsquyres please review again |
@jsquyres ping, i'd like to pr over to 5.0.x for the 5.0.1 release |
@jsquyres check now |
cb67b13
to
539a03a
Compare
Signed-off-by: Howard Pritchard <[email protected]>
539a03a
to
9580fd5
Compare
bot:nvidia:retest |
@open-mpi/cuda could someone see what's going on with ompi_NVIDIA CI? |
@jsquyres ping |
related to #12036