Replies: 4 comments 11 replies
-
Can you clarify what exactly you mean with
As you say, this is not in the official MTZ format description nor in the C API or Fortran API. At the moment I can't find it in either cmtzlib.c or cmtzlib.h - at least not with the phrase "appendix". It would be great if you could provide explicit pointers to the actual part of the CCP4 library that deals with this. I'm mainly concerned by this part in the MTZ format specification
Does that mean that one could have less than 30 history records? And what about those optional orientation data blocks: could one have just a batch title without orientation data? This seems imprecise enough that I wonder if the fact that some programs seemingly pass additions at the end of a MTZ file into output - while others don't - could be due to unclear handling of the data after the END record within those programs. Would it make sense to define a new section explicitely, e.g.
to (1) have section delimiters and (2) provide a format definition. The latter could allow storage of other formats - JSON, XML, CBF etc. There are probably better ways of doing this, but you get the idea. Small addition: all this obviously applies not just to synchrotron pipelines, but to any (processing) program producing MTZ files. A stable and portable method of storing additional information and/or helping with provenance tracking is valuable in all environments. :-) |
Beta Was this translation helpful? Give feedback.
-
xia2 writes scaled, unmerged mmCIF under the assumption that this will be used for deposition dials/dials#1457 (comment). As such the information is already there. Adding it to the MTZ as well implies that users won't necessarily use the mmCIF during deposition, but will take the scaled, unmerged MTZ. But why then are we writing the mmCIF? |
Beta Was this translation helpful? Give feedback.
-
We are planning to implement the mtz appendix stuff in KAMO for the use at SPring-8. A complexity is that we need a mechanism to track the information when merging multi-crystal data. Moreover, I don’t know what to put when data from multiple beamlines are merged. Another worry just came to my mind - is there any risk this appendix may be transferred undesirably? (e.g. when transferring test flags) |
Beta Was this translation helpful? Give feedback.
-
It might be useful if gemmi provided a parameterised function that would add an appendix to an MTZ. Something like
Then developers of data processing software who use this won't have to worry about the format of the appendix block, while you will be able to define this more easily, making use of the close relationship between gemmi and the PDB. You also then get control over what to do in corner cases, NULL values and so on. |
Beta Was this translation helpful? Give feedback.
-
Problem
During a PDB deposition, in most of cases the user needs to manually input details of data collection:
This is tedious, error-prone1, and the web form includes only the most important items. If the software that prepares mmCIF files had access to information about data collection it would include it in mmCIF file and this form would be pre-filled.
Such information is available in the synchrotron, but how to transfer it all the way down to the deposition?
Proposed solution
The MTZ format supports storing extra text at the end of the file2. Here, we call this extra text appendix.
Such text can be added even using Unix shell:
or
It's proposed here that synchrotron pipelines add metadata (info about detector, collection time, processing software, etc) to the MTZ appendix. This metadata will then be used when preparing PDB deposition3. Let us use the PDBx/mmCIF format – the same format that is used for deposition to the PDB. The metadata from synchrotron (any items4 that software developers in the synchrotron can provide) could be appended in a block named
collection
. Here is an example:The metadata should be added to unmerged MTZ files and (if the pipeline merges the data) also to merged MTZ files.
The appendix in data is likely to get lost during further processing5, but the software that prepares mmCIF files for deposition may still have access to the original files. In particular, that software may ask for scaled unmerged data6, so it's good if the appendix is also there.
If you plan to (or already did) implement it please leave a comment below. 👋
To check if an MTZ file has appendix you can print it with one of the following commands:
Footnotes:
Beta Was this translation helpful? Give feedback.
All reactions