You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The custom mapping already has the dash escapes (e.g., -apos-), but those that are looked up from Python's html.entities.codepoint2name do not get the dashes. The fix is essentially:
elif codepoint in codepoint2name:
- chars.append({codepoint2name[codepoint]})+ chars.append(f"-{codepoint2name[codepoint]}-")
A second thing is that underscores are not escaped and spaces are replaced with underscores, resulting in a conflation of the two characters. The fix here is to add it to the escape patterns: -lowbar-
The text was updated successfully, but these errors were encountered:
This came up in globalwordnet/english-wordnet#1107 (comment)
The buggy code is here:
omw-data/scripts/util.py
Lines 67 to 68 in 0a94425
The custom mapping already has the dash escapes (e.g.,
-apos-
), but those that are looked up from Python'shtml.entities.codepoint2name
do not get the dashes. The fix is essentially:A second thing is that underscores are not escaped and spaces are replaced with underscores, resulting in a conflation of the two characters. The fix here is to add it to the escape patterns:
-lowbar-
The text was updated successfully, but these errors were encountered: