-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Category PDBX_DIFFRN_MERGE_STAT_SHELL is isolated (and redundant?) #7
Comments
Using "widest resolution range" is bound to create problems (rounding differences, "nice number" for overall values etc). I can see the benefit of using a special On the other hand, the typical ordering in all tables presented to users is (from the low-resolution end) 1,2,3,...,N and then (N+1) is the overall value. How about adding an additional item into the |
This is designed to parallel reflns and reflns_shell statistics. Having different categories makes it easy to say - you provided a high resolution shell statistic - how about overall data - and how to find it - but personally I can go either way. |
Isn't an overall value identical to a per-shell value? The actual metric (and computation) is the same - using reflections within two given resolution limits. Keeping two different categories for identically defined and computed values seems a slightly unnecessary duplication. Given the inconsistent item naming and definitions within REFLNS and REFLNS_SHELL I'm not sure this is a good enough reason to necessarily stick with this. The handling of missing data at deposition (shell value(s) given, but no overall value) seems more closely related to actual deposition software and not necessarily impacting directly on the dictionary definitions. It should be easy for software to see if one of the shell values encompasses all other shells (and there is no overlap between those other ones - once we know if those limits are inclusive or not) and act in the same way as described above, I think. |
Per shell could be the entire resolution range. Historically, _reflns_shell was intended for the highest resolution shell. With data harvesting with pdb_extract, parsing log files became easier (and I suspect xia2 does the same). Therefore, ranges are now accepted. Ordering is a challenge. Do we go with low to high or high to low. If you are trying to determine the overall data statistics, would you want them mixed in here? |
The category
PDBX_DIFFRN_MERGE_STAT_SHELL
:PDBX_DIFFRN_MERGE_STAT
, and seems to have much the same purposeThe difference is that one category refers to a resolution shell from a data section, and the other to the whole data section. It seems to me that we don't need both -- resolution limits are needed in both cases anyway.
My initial suggestion:
ordinal_id
toPDBX_DIFFRN_MERGE_STAT
and make it part of the category keynon_negative_int
orpositive_int
rather thancode
_pdbx_diffrn_merge_stat.d_limit_high
and_pdbx_diffrn_merge_stat.d_limit_low
mandatoryPDBX_DIFFRN_MERGE_STAT_SHELL
.Row(s) in this category that refer to a whole data section could be identified from the widest resolution range. If there isn't a single resolution range for a data section that encloses all the shells for the same data section, that is no different from the situation where the
PDBX_DIFFRN_MERGE_STAT
category is absent now, and if any action is needed it should be handled in the same way.OTOH, if an explicit indicator for the row(s) in the category that refer to a whole data section is needed (rather than looking for the widest resolution range), this could be done in one of two ways:
ordinal_id
indicates the whole data section. A suitable value might be one of:.
with_item.mandatory_code no
(if this is allowed for a key item - I've been out of this game too long and I forget 😉 ).0
with a type ofnon_negative_int
1
with a type ofpositive_int
Comments anyone?
The text was updated successfully, but these errors were encountered: