Skip to content

Commit

Permalink
Update A3 Mega solution to use NCCL plugin 1.0.2
Browse files Browse the repository at this point in the history
  • Loading branch information
tpdownes committed May 31, 2024
1 parent e9614ad commit 8e41adf
Showing 1 changed file with 1 addition and 4 deletions.
5 changes: 1 addition & 4 deletions tools/prologs-epilogs/receive-data-path-manager-mega
Original file line number Diff line number Diff line change
Expand Up @@ -30,10 +30,7 @@ fi
# ensure that dmabuf-import-helper is loaded
modprobe import-helper

# Populate /etc/hostname for pyxis/enroot
hostname | tee /etc/hostname

NCCL_PLUGIN_IMAGE=us-docker.pkg.dev/gce-ai-infra/gpudirect-tcpxo/nccl-plugin-gpudirecttcpx-dev:v1.0.1
NCCL_PLUGIN_IMAGE=us-docker.pkg.dev/gce-ai-infra/gpudirect-tcpxo/nccl-plugin-gpudirecttcpx-dev:v1.0.2
RXDM_IMAGE=us-docker.pkg.dev/gce-ai-infra/gpudirect-tcpxo/tcpgpudmarxd-dev:v1.0.8
RXDM_CONTAINER=receive-datapath-manager-"${SLURM_JOB_ID}"
if [[ ${SLURM_SCRIPT_CONTEXT} == "prolog_slurmd" ]]; then
Expand Down

0 comments on commit 8e41adf

Please sign in to comment.