Use `touch EXIT` to stop CP2K at walltime end #51

danieleongari · 2019-10-09T14:51:05Z

CP2K allows for a soft kill, by creating in the running folder an empty file called EXIT.
This solution con be implemented in the plugin to stop the code.

When using EXIT, CP2K stops at the current SCF step (returning a warning as if it did not converge) but printing the standard termination of the program, and therefore allowing for a smoother parsing of the output.

The text was updated successfully, but these errors were encountered:

yakutovicha · 2019-10-18T11:25:23Z

Another possibility would be to specify the walltime parameter in the cp2k input slightly smaller then it is set in the batch file.

yakutovicha · 2019-10-18T11:27:37Z

I am not sure, though, if the input plugin can do something while the calculation is running. I am actually pretty sure it can't. @ltalirz do you think it is possible?

yakutovicha · 2020-03-26T11:34:17Z

Made an issue upstream: aiidateam/aiida-core#3868. Let's see how this evolves.

sphuber · 2020-03-26T12:34:12Z

Another possibility would be to specify the walltime parameter in the cp2k input slightly smaller then it is set in the batch file.

If CP2K provides this functionality, I would definitely go for this route. This is what I do in Quantum ESPRESSO as well and works relatively well. I implement this directly on the PwBaseWorkChain, where I always take the metadata.options.resources.walltime_seconds input and set a fraction of that in the input parameters of the PwCalculation. This way, all other workflow always automatically inherit this behavior.

I responded to the issue Sasha opened and although the other possibility is in principle possible to be implemented, it is quite challenging I would say.

dev-zero · 2021-04-08T07:32:24Z

Would this also mean that if I'm going to kill a process the base workchain could possibly react to it by first trying to write an EXIT file to terminate the process on remote gracefully?

sphuber · 2021-04-08T07:58:25Z

Would this also mean that if I'm going to kill a process the base workchain could possibly react to it by first trying to write an EXIT file to terminate the process on remote gracefully?

Not really. The workchain execution is blocked until the child process (the CalcJob in this example) is terminated. That means the workchain cannot retake control and perform an action before the CalcJob has finished running. You could of course manually write an EXIT file in the working directory of the CalcJob which would cause the code to stop gracefully. The daemon will then realize the job is done and start retrieval and parsing. If the parser properly recognizes the graceful shutdown and sets an appropriate exit code, and the base restart has a handler for that exit code to simply start a new calcjob, restarting from the output of the last, then that works. That is exactly what the PwBaseWorkChain and PwCalculation do.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use `touch EXIT` to stop CP2K at walltime end #51

Use `touch EXIT` to stop CP2K at walltime end #51

danieleongari commented Oct 9, 2019

yakutovicha commented Oct 18, 2019 •

edited

Loading

yakutovicha commented Oct 18, 2019

yakutovicha commented Mar 26, 2020

sphuber commented Mar 26, 2020

dev-zero commented Apr 8, 2021

sphuber commented Apr 8, 2021

Use touch EXIT to stop CP2K at walltime end #51

Use touch EXIT to stop CP2K at walltime end #51

Comments

danieleongari commented Oct 9, 2019

yakutovicha commented Oct 18, 2019 • edited Loading

yakutovicha commented Oct 18, 2019

yakutovicha commented Mar 26, 2020

sphuber commented Mar 26, 2020

dev-zero commented Apr 8, 2021

sphuber commented Apr 8, 2021

Use `touch EXIT` to stop CP2K at walltime end #51

Use `touch EXIT` to stop CP2K at walltime end #51

yakutovicha commented Oct 18, 2019 •

edited

Loading