Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Consistency Check: Morphology in Skeleton, OSHB #17

Open
jonathanrobie opened this issue Apr 2, 2022 · 1 comment
Open

Consistency Check: Morphology in Skeleton, OSHB #17

jonathanrobie opened this issue Apr 2, 2022 · 1 comment

Comments

@jonathanrobie
Copy link
Contributor

We need to develop tooling that makes it possible for an expert like @rkjtan to systematically examine differences. For morphemes, we have used a consonant-only comparison, using | to indicate boundaries between morphemes. For instance:

Verse: 27002039

  1. ו|בתר|כ|תקומ|מלכו|אחרי|ארע|מנ|כ|ו|מלכו|תליתאה|אחרי|די|נחש|א|די|תשלט|ב|כל|ארע|א
  2. ו|בתר|כ|תקומ|מלכו|אחרי|ארעא|מנ|כ|ו|מלכו|תליתאה|אחרי|די|נחש|א|די|תשלט|ב|כל|ארע|א

Verse: 16005007

  1. ו|ימלכ|לב|י|על|י|ו|אריב|ה|את|ה|חרימ|ו|את|ה|סגנימ|ו|אמר|ה|ל|המ|משא|איש|ב|אחי|ו|אתמ|נשימ|ו|אתנ|עלי|המ|קהלה|גדולה
  2. ו|ימלכ|לב|י|על|י|ו|אריב|ה|את|ה|חרימ|ו|את|ה|סגנימ|ו|אמר|ה|ל|המ|משא|איש|ב|אחי|ו|אתמ|נשאימ|ו|אתנ|עלי|המ|קהלה|גדולה

Verse: 19021002

  1. יהוה|ב|עז|כ|ישמח|מלכ|ו|ב|ישועת|כ|מה|יגל|מאד
  2. יהוה|ב|עז|כ|ישמח|מלכ|ו|ב|ישועת|כ|מה|יגיל|מאד

This would be good to have as part of a general-purpose tool that also identifies other inconsistencies.

@pdurusau
Copy link

pdurusau commented Apr 2, 2022

I don't have the references at hand but your examples remind me of the genome differencing algorithms that spot not only that the strings are different, but also identify WHERE the lines differ. Looking at these it is easy to see the longer line but WHERE it becomes longer is a task for the reader. Thinking that highlighting the difference might be useful to expert readers in find where to concentrate. Unlikely to be an XQuery task, more NLP.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants