-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Property] Many fields incorrectly described as percentage instead of proportion #224
Comments
Making changes to the headers would have a significant impact on all parties involved. In this regard, I want to refer to an issue that I have highlighted this in (as it's as relevant here): #215. Additionally, such changes would necessitate further operational rewrites on the Oasis LMF side, including adding additional code to divide by 100 wherever any of the fields are utilized. On a more positive note, making descriptions clearer is certainly a beneficial change. The rationale behind expressing these values as proportions makes all downstream processes smoother, as it eliminates the need to constantly divide by 100 during calculations. It would be advantageous to include clear descriptions for all relevant fields, stating something along the lines of: "All percentages are expressed as proportions to streamline downstream calculations. For example, 12.5% should be expressed as 0.125". This approach would add clarity and avoid the disruption caused by changing the headers. It is important to remember that the specifications should always be used in conjunction with descriptions and other available information, rather than relying solely on field names. The only instances I have found where the [0,1] range is omitted are as follows, and these are intentional:
|
@aiste-kalinauskaite I can see that in practice it's too late to correct the headers now but it's worth considering if more fields were to be added and worth improving the descriptions of the existing fields.
Not as far as I understand it. All those fields are already proportions. I don't see any that take percentage values (even though the names might suggest so). It would therefore just be a renaming exercise, but point taken that represents a breaking change.
Ideally yes, but it shouldn't be used as an excuse for accepting misleading naming. The names should give strong and accurate hints about the meaning of the values. I'm raising it now because I've seen smart people caught out by it recently.
A percentage is just one way of representing a proportion and it's not the one that's used here, so I would suggest only mentioning percentage as an example: "Proportion sprinklered. Values between 0 and 1 inclusive (0.5 = 50%); -999 means unknown." An additional explanation could be added: "Note that despite the name this is not a percentage value between 0 and 100 but a value between 0 and 1 representing a proportion." but maybe that's excessive.
OK, I see now that they link to the Other values sheet and are described as "Code that represents...". |
For OED v4 I'll go through the descriptions for the fields and make sure it is clear where values should be entered as proportions, but won't rename the Percent fields as it would be too disruptive. |
@carlfischerjba the comment below was made in relation to if your proposal were to go ahead:
As it is not, then it is not relevant. |
Description
Several columns are described as percentages which supposes values between 0 and 100 when they are in fact proportions between 0 and 1.
Reasons for change
It's essential for creators and users of OED to use the right values and for the spec to be unambiguous. 0.5% is not the same as a proportion of 0.5 which would be equivalent to 50%, however, 0.5 would be accepted in either case and therefore not flagged by validation tools.
Scope of change
Impact of change
Renaming fields to replace Percent with Proportion would be the clearest but would be a breaking change.
In some cases, the field name is correct but the description mentions a percentage. Any change to the description would be fully backwards compatible and would be a good first step.
Data type, default values, are blanks allowed, list valid values
The following fields have Percent in their name but should ideally have Proportion:
The following fields incorrectly mention "percentage" in their description instead of "proportion".
Additionally, some of the above do not contain valid values in the spec
[0, 1]
which means that errors will never be caught.The text was updated successfully, but these errors were encountered: