Skip to content

Commit 155beed

Browse files
authored
Merge pull request #53 from AI-Hypercomputer/raymondzou-collectives
Add more info to the README for running multislice
2 parents ccb1247 + ffd5971 commit 155beed

File tree

1 file changed

+7
-2
lines changed
  • microbenchmarks/trillium/collectives

1 file changed

+7
-2
lines changed

microbenchmarks/trillium/collectives/README.md

Lines changed: 7 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ Please follow this [link](https://github.com/AI-Hypercomputer/tpu-recipes/blob/m
77

88
### Starting workload
99

10-
Launch the XPK workload:
10+
Launch the XPK workload, example to run on 1 slice of v6e-256:
1111
```
1212
python3 ~/xpk/xpk.py workload create \
1313
--cluster=${CLUSTER_NAME} \
@@ -20,6 +20,11 @@ python3 ~/xpk/xpk.py workload create \
2020
--workload=${WORKLOAD_NAME}
2121
```
2222

23+
To run on more than 1 slice, modify the `--num_slices` and `--config` flags to use the target number of slices and the corresponding yaml config file e.g
24+
```
25+
--num_slices=2 --config=configs/2x_v6e_256.yaml
26+
```
27+
2328
From your workload logs, you should start seeing benchmark logs:
2429
```
2530
psum_dcn: Matrix size: 17408x17408, dtype=<class 'jax.numpy.bfloat16'>, matrix_size_gbyte=0.606076928,achieved_bandwidth_gbyte_s=4.1130934137328214
@@ -31,4 +36,4 @@ Results will be printed out and also stored at `/tmp/microbenchmarks/collectives
3136
gsutil cp -r /tmp/microbenchmarks/collectives gs://<your-gcs-bucket>
3237
```
3338

34-
Check out the other scripts for running on more than 1 slice.
39+

0 commit comments

Comments
 (0)