Skip to content

MagriLab/YellowstoneTest

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

YellowstoneTest

This repo contains some sample code to test gpu capabilities on the Yellowstone cluster.

Ideally, we will be able to run the multi_node.py file to access gpus on multiple nodes. This should be tested with four nodes, eight gpus on each node.

The jax.distributed.initialize should not require the coordinator address or process id as the cluster uses the slurm scheduler. Some care must be taken to ensure than slurm launches a single process per gpu, as this is what jax expects.

About

Quick code to test Yellowstone cluster

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published