Skip to content

Commit

Permalink
remove and reinstall current libnccl each run
Browse files Browse the repository at this point in the history
  • Loading branch information
akiki-liang0 committed Dec 4, 2024
1 parent 91ab62f commit 8aa68bf
Showing 1 changed file with 2 additions and 0 deletions.
2 changes: 2 additions & 0 deletions tools/prologs-epilogs/receive-data-path-manager-mega
Original file line number Diff line number Diff line change
Expand Up @@ -39,6 +39,8 @@ if [[ ${SLURM_SCRIPT_CONTEXT} == "prolog_slurmd" ]]; then
export PATH=${PATH}:/usr/local/lib/google-cloud-sdk/bin/
gcloud auth configure-docker --quiet us-docker.pkg.dev 2>&1 &>/dev/null

rm -rf /var/lib/tcpxo/lib64 || true

# Install the nccl, nccl-net lib into /var/lib/tcpxo/lib64/.
docker run --rm --name nccl-installer \
--network=host \
Expand Down

0 comments on commit 8aa68bf

Please sign in to comment.