-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Inconsistent Preservation of OSHB's Unique Ids for Words #8
Comments
Yes, stripping these ids is the easiest thing to do, and we have another way to cross-reference, using our |
Stripped them by hand in the initial release. We need to add this step in the pipeline (it's just a |
On it.
Adding:
delete nodes ***@***.***
to prepare-oshb pipeline.
Patrick
On 4/2/22 11:24, Jonathan Robie wrote:
@pdurusau <https://github.com/pdurusau> @klosoter
<https://github.com/klosoter> could one of you please add this to the
|prepare-oshb| pipeline?
—
Reply to this email directly, view it on GitHub
<#8 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AACOWYFRATRCF6VMZUNSME3VDBRCDANCNFSM5RVRDTTA>.
You are receiving this because you were mentioned.Message ID:
***@***.***>
--
Patrick Durusau
***@***.***
Technical Advisory Board, OASIS (TAB)
Editor, OpenDocument Format TC (OASIS), Project Editor ISO/IEC 26300
Co-Editor, ISO/IEC 13250-1, 13250-5 (Topic Maps)
Another Word For It (blog): http://tm.durusau.net
Homepage: http://www.durusau.net
Twitter: patrickDurusau
|
To clarify, I'm adding it to prepare-oshb-for-trees.bxs as a separate step. |
Maybe all we need to do is change this line, in <m>{ $w/@*, $w/text() }</m> to this: <m>{ $w/@* except $w/@id, $w/text() }</m> Could you please see if that does the trick? It would save us a separate step in the pipeline. |
Setting test for tomorrow. |
Because some words had to be broken up into constituent parts for analysis, one unique id would have to be shared across its two or three constituent parts to carry over into the trees. For example:
"in beginning", "the heavens," "and [object marker]", "the earth" all didn't keep their OSHB unique Ids due to having been separated into 2 parts, while "created", "God", "[object marker]" still show their OSHB ids in the trees. Perhaps should strip all the OSHB ids to avoid confusion.
The text was updated successfully, but these errors were encountered: