-
Notifications
You must be signed in to change notification settings - Fork 144
Conference call notes 20220202
Kenneth Hoste edited this page Feb 16, 2022
·
9 revisions
(back to Conference calls)
Notes on the 190th EasyBuild conference call, Wednesday 2 Feb 2022 (09:00 UTC)
Alphabetical list of attendees (12):
- Kenneth Hoste (HPC-UGent, Belgium)
- Kurt Lust (Univ. of Antwerpen - LUMI)
- Åke Sandgren (Umeå University, Sweden)
- Jurij Pečar (EMBL, Germany)
- Jörg Saßmannshausen
- Terje Kvernes (University of Oslo, Norway)
- Jorge Guerra (Universidad Politécnica de Madrid, Spain
- Sam Moors (Vrije Universiteit Brussel, Belgium)
- Mikael Öhman (Chalmers University of Technology, Sweden)
- Thomas Röblitz (Univ. of Bergen, Norway)
- Adam Huffman (Big Data Institute, Oxford, UK)
- Alex Domingo (Vrije Universiteit Brussel, Belgium)
- overview of recent developments
- OpenMPI + CUDA, OpenMPI 5.x
- Q&A
- release timeline
- latest release: EasyBuild v4.5.2 (24 Jan 2022)
- ETA next release: end of Feb'22?
- recent changes
-
framework
-
bug fixes
- only run GitHub tests when testing with Lua module syntax, to avoid hitting GitHub rate limit when running tests (PR #3938)
- fix
get_os_name
andget_os_version
to avoid reportingUNKNOWN
in output ofeb --show-system-info
(PR #3942) - take into account that patch files can also be zipped when checking filename extension for patches (PR #3936)
-
enhancements
- ...
-
changes
- ...
-
bug fixes
-
easyblocks
-
bug fixes
- convert version numbers to stricly numerical in Siesta easyblock (PR #2553)
- enhancements
-
changes
- ...
- new software
- ...
-
bug fixes
-
easyconfigs
- ~50 easyconfig PRs merged since last conf call!
-
bug fixes
- add patch for hard-coded checksum value of downloaded source file in the source code of RDKit 2021.03.4 (PR #14743)
- fix CVE-2021-23437 in Pillow (PR #14765) + Pillow-SIMD (PR #14792)
- add libXfont2 patch to fix build when libbsd is present (PR #14821)
- add missing UCX-CUDA dep to GROMACS for foss-2021a-CUDA-11.3.1 (PR #14859)
- add alternative checksum for MASS, class, nnet, spatial extensions in R v4.1.0 + v4.1.2 easyconfigs (PR #14873 + PR #14880)
- add patch to fix Kraken2 ncbi ftp/https check in
rsync_from_ncbi.pl
for versions 2.0.9-2.1.1 (PR #14889)
-
enhancements
- ...
-
(noteworthy) new software
- see also overview at https://github.com/easybuilders/easybuild-easyconfigs/issues?q=is%3Amerged+label%3Anew
- ...
-
noteworthy software updates
- ...
- changes
-
framework
-
reported bugs / bug fixes
- switch to using pip3 for installing EasyBuild in Singularity definition file generated by EasyBuild (PR #3945)
-
enhancements
- tolerate pre-existing edges in depgraph (PR #2784)
- extend framework to enable modules to ship RPATH wrappers (issue #3918)
- allow setting extension-specific envars in module file (PR #3948)
- add a "
clone_into
" field to git_config source specification (PR #3949) - add support for optional comment parameter for extensions (issue #3946)
- drop into shell with full environment in case of failure (issue #3950)
-
changes
- meaningful error output (and actual location of the log file on the line about log files) (issue #3915)
-
reported bugs / bug fixes
-
easyblocks
- bug fixes
- enhancements
- add support to NAMD easyblock to opt out of building with CUDA support even if CUDA is included as dependency (PR #2666)
- updates
- new software
-
changes
- set cairo as default bitmaptype for
R
inetc/Rprofile.site
(PR #2647)- see also https://github.com/easybuilders/easybuild/issues/772 for matplotlib
- revisit default use of
ld.gold
in binutils easyblock (PR #2657)
- set cairo as default bitmaptype for
-
easyconfigs
- bug fixes
- define environment variables to fix installing PyTorch-Geometric on a GPU-less node (PR #14361)
- update TALYS sources to (hopefully) ensure constant checksums (PR #14797)
- add missing dependencies on graphite2, HarfBuzz to Qt5 (PR #14802)
- add missing xxd build dependency for recent PLUMED versions (PR #14847)
- enhancements
- also install Python bindings for SHTns 3.5.1 (PR #14893)
-
new software
- see https://github.com/easybuilders/easybuild-easyconfigs/labels/new
- MPItrampoline (WIP) (PR #14607)
- noteworthy software updates
- changes
- ignore missing binutils dependency for binaries in easyconfigs test suite (PR #14718)
- bug fixes
-
framework
-
foss
+ CUDA + UCX-CUDA apparently doesn't fully work as expected... - can we avoid having to go back to
fosscuda
? - see easyconfigs issue #14801
- see upstream issue "
Specific OSU benchmarks segfault when non-CUDA aware OpenMPI 4.1.1 compiled with CUDA-aware UCX
" @ https://github.com/open-mpi/ompi/issues/9906 - Mikael: OpenMPI's OPAL has some custom code when built on top of CUDA
- custom
opal_cuda_memcpy
(viaMEMCPY_CUDA
macro) with additional check whether it's on a GPU
- custom
- things to change:
- enable OPAL CUDA support in OpenMPI (but without requiring CUDA)
- Åke: requires change to
--with-cuda
to add support for enabling CUDA-awareness by using--with-cuda=yes
- Åke: requires change to
- add OpenMPI-CUDA for CUDA mca bits (only install specific *.so` libraries)
- enable OPAL CUDA support in OpenMPI (but without requiring CUDA)
- OpenMPI 5.x
- recent change is problematic, essentially adds hard dependency on CUDA (cfr. https://github.com/open-mpi/ompi/commit/deb37ac03fff566b0ad235734f63213ec0775c72)
- see also https://github.com/open-mpi/ompi/issues/9762
- can we build a full OpenMPI with CUDA dep and use that to shadow OpenMPI without CUDA?
- may cause trouble, even depending on order in which things are loaded...
- foss + CUDA is a problem when using certain MPI collectives with CUDA buffers
- which is a quite specific use case
- we should try and fix this also for the
foss/2021a
/foss/2021b
toolchains, not just going forward forfoss/2022a
- Åke: same CUDA-aware problems will pop up with UCC library (see https://ucfconsortium.org/projects/ucc/)
- relevant: Alex' FOSDEM'22 talk: https://fosdem.org/2022/schedule/event/exascale_pmi
- is there an OpenMPI community call we can jump into to explain our problems and how we plan to fix it
- ...