`Parser.parse_from_node`: Validate outputs against process spec #6159

sphuber · 2023-10-21T18:21:58Z

The parse_from_node clasmethod is a utility function to call the parse method of a Parser implementation for a given CalcJobNode. It automatically calls parse in the correct manner, passing the retrieved output node, and wrapping it in a calcfunction to optionally store the provenance.

However, since the calcfunction by default has a dynamic output namespace and so accepts all outputs, it would not perform the same output validation that the original CalcJob would have done since that most likely will have defined specific outputs. Especially given that the parse_from_node method would often be used in unit tests to test Parser implementations, having the output validation be different would make it possible to mis bugs. For example, if the parser assigns an invalid output node, this would go unnoticed.

The parse_from_node is updated to patch the output specification of the calcfunction with that of the process class that was used to created the CalcJobNode which is being passed as an argument. As long as the process can of course be loaded. This ensures that when the calcfunction return the outputs returned by Parser.parse they are validated against the output specification of the original CalcJob class. If it fails, a ValueError is raised.

The `parse_from_node` clasmethod is a utility function to call the `parse` method of a `Parser` implementation for a given `CalcJobNode`. It automatically calls `parse` in the correct manner, passing the `retrieved` output node, and wrapping it in a calcfunction to optionally store the provenance. However, since the `calcfunction` by default has a dynamic output namespace and so accepts all outputs, it would not perform the same output validation that the original `CalcJob` would have done since that most likely will have defined specific outputs. Especially given that the `parse_from_node` method would often be used in unit tests to test `Parser` implementations, having the output validation be different would make it possible to mis bugs. For example, if the parser assigns an invalid output node, this would go unnoticed. The `parse_from_node` is updated to patch the output specification of the `calcfunction` with that of the process class that was used to created the `CalcJobNode` which is being passed as an argument. As long as the process can of course be loaded. This ensures that when the `calcfunction` return the outputs returned by `Parser.parse` they are validated against the output specification of the original `CalcJob` class. If it fails, a `ValueError` is raised.

sphuber · 2023-10-21T18:23:04Z

@mbercx this would have caught the bug in the Q2rParser for which we just merged the fix

mbercx

Thanks @sphuber, LGMT!

sphuber requested a review from mbercx October 21, 2023 18:22

mbercx approved these changes Oct 23, 2023

View reviewed changes

sphuber merged commit d16792f into aiidateam:main Oct 23, 2023
17 checks passed

sphuber deleted the fix/parse-from-node-output-spec branch October 23, 2023 08:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`Parser.parse_from_node`: Validate outputs against process spec #6159

`Parser.parse_from_node`: Validate outputs against process spec #6159

sphuber commented Oct 21, 2023

sphuber commented Oct 21, 2023

mbercx left a comment

Parser.parse_from_node: Validate outputs against process spec #6159

Parser.parse_from_node: Validate outputs against process spec #6159

Conversation

sphuber commented Oct 21, 2023

sphuber commented Oct 21, 2023

mbercx left a comment

Choose a reason for hiding this comment

`Parser.parse_from_node`: Validate outputs against process spec #6159

`Parser.parse_from_node`: Validate outputs against process spec #6159