-
Notifications
You must be signed in to change notification settings - Fork 861
WeeklyTelcon_20191210
Geoffrey Paulsen edited this page Dec 16, 2019
·
1 revision
- Dialup Info: (Do not post to public mailing list or public wiki)
- Geoffrey Paulsen (IBM)
- William Zhang (AWS)
- Austen Lauria (IBM)
- Brian Barrett (AWS)
- Josh Hursey (IBM)
- Brendan Cunningham (Intel)
- Michael Heinz (Intel)
- Todd Kordenbrock (Sandia)
- Noah Evans (Sandia)
- Akshay Venkatesh (NVIDIA)
- Edgar Gabriel (UH)
- Harumi Kuno (HPE)
- Howard Pritchard (LANL)
- Matthew Dosanjh (Sandia)
- Thomas Naughton (ORNL)
- Artem Polyakov (Mellanox)
- Jeff Squyres (Cisco)
- George Bosilca (UTK)
- David Bernhold (ORNL)
- Brandon Yates (Intel)
- Charles Shereda (LLNL)
- Erik Zeiske
- Joshua Ladd (Mellanox)
- Mark Allen (IBM)
- Matias Cabral (Intel)
- Nathan Hjelm (Google)
- Ralph Castain (Intel)
- Xin Zhao (Mellanox)
- mohan (AWS)
Blockers All Open Blockers
Review v3.0.x Milestones v3.0.4
Review v3.1.x Milestones v3.1.4
- 3.0.5 and 3.1.5 have shipped
- Planning for no new fixes on 3.x, unless super critical
- BUT, looks like something was messed up with 3.1.5, not sure about 3.0.x branch
- Brian will read up on the issue and see if we need to release to address.
- May be just an issue with Fedora / RHEL 7.8 that we don't see it on earlier RHEL.
Review v4.0.x Milestones v4.0.3
- v4.0.3 in the works.
- Schedule: End of january.
- There's a problem in Open MPI v4.0.2, that packagers will hit in UCX 1.7
- PR 1752 may drive an earlier release in case if UCX will be released sooner.
- PR 7116
- Ensure no backwards compat issues?
- Howard will send email to ARM.
- PR 7149 - Geoff go look at.
- A few new enhancements desirable.
- Added a Target v4.1.x label
- Many new enhancements / features would be useful
- 7151 - This is indeed a performance enhancement.
- 7173
- Should look into amount of work back-porting features to a release branch.
- It would be a major thing. But always say we don't take features into release branch thats out there.
- people continue to open PRs with features.
- Two issues:
- One - we've really stalled out v5.0.0
- Two - are performance features really an issue to pull in?
- PR 7151 - seems to be boarderline bugfix / feature / risky
- PR 7151 - enhancement -
- Schedule: April 2020?
- Wiki - go look at items, and we should discuss a bit in weekly calls.
- Some items:
- MPI1 removed stuff.
- vader leaves SHM files laying around: https://github.com/open-mpi/ompi/issues/7220
- Verified in 3.1.x and v4.0.1, but not v4.0.2
- v4.0.x registers with PMIx
- vader doesn't use session dirs.
- Confusion about target labels on PRs.
- Some folks were adding release milestones to master PRs.
- Rules: Don't specify more than one target label on PR.
-
Only set Milestone on release branch PRs???
-
or on master PR needed for a particular release?
- Writeup here: https://github.com/open-mpi/ompi/wiki/GitHub-Robot-Tasks
- Probably should get down to supporting only one runtime.
- Josh, Ralph, Jeff, Brian , and Tom
- Met one day to talk about PRRTE / ORTE and what to do.
- PRRTE probably makes the most sense
- git submodules much better than subversion external modules.
- Being part of the OMPI package is limiting.
- Boxes in the Runtime to prevent ORTE from taking off on it's own.
- Not a huge operation.
- PMIx would be a first class citizen
- Still bundle PRRTE in tarballs, so could launch over ssh.
- Have to add additional Nightly testing to catch issues.
- Talked about not being a bash script.
- Ralph said he had most of this working on a branch.
- PRRTE only has external hwloc, pmix, and libevent.
- If you pull this in, will need to build PRRTE with the internal versions of
- May accelerate need to kill off internals in Open-MPI to simplify things.
- Release tarballs;
- Still drop these into tarball for conveience?
- Should discuss, perhaps a version of the tarball that has everything?
- Possibly do a survey again, to just have everything external?
- PRRTE Testing
- Can develop some PMIx Unit test(s) for PMIx library and for Resource managers
- To mimic the way that Open MPI uses PMIx.
- PMIx acceptance tests in Open MPI project
- Currently don't have much Runtime tests.
- Mapping, binding, output filename, etc.
- Use these tests to
- Can develop some PMIx Unit test(s) for PMIx library and for Resource managers
- Questions, and discussion. Interested.
- It's official! Portland Oregon, Feb 17, 2020.
- Safe to begin booking travel now.
- Please register on Wiki page, since Jeff has to register you.
- Date looks good. Feb 17th right before MPI Forum
- 2pm monday, and maybe most of Tuesday
- Cisco has a portland facility and is happy to host.
- about 20-30 min drive from MPI Forum, will probably need a car.
Review Master Master Pull Requests
- IBM's PGI test has NEVER worked. Is it a real issue or local to IBM.
- Austen is looking into
- Absoft 32bit fortran failures.
- No discussion this week.
- See older weekday notes for prior items.
- No discussion this week.