Pynta restart for NERSC and Polaris #43

sakim8048 · 2024-04-03T20:37:26Z

Previous comment from Ray (October 2023):
I have updated the way we map tasks on each node for ALCF machines. Each task runs on a different FWorker, and each FWorker is associated with a node. This is available for multilauncher. The optimal approach is to set num_jobs to the number of nodes.

Additionally, I have added functionality to use PWDFT, including the calculator and related functions.

Updated comments from Shinae (Feb 2024)
I have updated the way we restart Pynta based on how Ray implemented the restart from Polaris.
From Pynta object, machine=<Machine type> should be specified to restart Pynta. Also for NERSC and any other machines, workflow id should be added to pyn.reset() to rerun the previous workflow.
pyn.reset() is tested in Perlmutter and was able to restart from queue=True mode.

This updates include Trevor's pull request (#33), which not yet merged.

Updated comments from Shinae (March 2024)
I rebased to current master, with recent changes. Added @rayhe88 as the author. I still need to add him as an author for previous commits.

sakim8048 and others added 10 commits April 3, 2024 09:48

polaris mapping info added in polaris.py

14ee869

Add machine keyword to pynta object

9633af3

Pickle slab info

9555e8e

add machine keyword and polaris mapping

a1b8967

Contibutor acknowledgement: Restart design contribution by RHE

598dc5b

copyDataAndSave added in util. Function called in pyn.reset()

2ceead7

import module

26a2cb6

multi_launcher.py for parallel environment. Required for reset()

8ef0994

add pickle to pynta object

f61a7a8

delete unnecessary variables

b3016d2

sakim8048 marked this pull request as draft April 15, 2024 04:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pynta restart for NERSC and Polaris #43

Pynta restart for NERSC and Polaris #43

sakim8048 commented Apr 3, 2024

Pynta restart for NERSC and Polaris #43

Are you sure you want to change the base?

Pynta restart for NERSC and Polaris #43

Conversation

sakim8048 commented Apr 3, 2024