Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Context, highlight wrong for typo found as substring #52

Closed
snomos opened this issue Jan 26, 2022 · 4 comments
Closed

Context, highlight wrong for typo found as substring #52

snomos opened this issue Jan 26, 2022 · 4 comments
Assignees
Labels
bug Something isn't working

Comments

@snomos
Copy link
Member

snomos commented Jan 26, 2022

Input text:

Su váldofágadutkamuš lei sosiálapedagogihkas, fáddán ”Samisk barneoppdragelse og kjønnssosialisering”, Oslo 1986.

The Norwegian quote is of course not recognised, and reported as errors. BUT the third Norw. word, og, is also found as a substring in the Sámi word sosiálapedagogihkas, and both the context and the highlight hits this point, instead of the correct part of the input:

Skjermbilde 2022-01-26 kl  11 36 09

Inspecting the data being transferred, it seems the indices are correct:

Skjermbilde 2022-01-26 kl  11 39 50

Running the text through the command line grammar checker returns the same indices as in the network data:

echo 'Su váldofágadutkamuš lei sosiálapedagogihkas, fáddán ”Samisk barneoppdragelse og kjønnssosialisering”, Oslo 1986.' \
| divvun-checker -a se.zcheck | jq .
{
  "errs": [
    [
      "Samisk",
      54,
      60,
      "typo",
      "Ii leat sátnelisttus",
      [],
      "Čállinmeattáhus"
    ],
    [
      "barneoppdragelse",
      61,
      77,
      "typo",
      "Ii leat sátnelisttus",
      [],
      "Čállinmeattáhus"
    ],
    [
      "og",
      78,
      80,
      "typo",
      "Ii leat sátnelisttus",
      [
        "on",
        "ok",
        "ogi",
        "oge",
        "",
        "ja",
        "go",
        "kg",
        "ii",
        ""
      ],
      "Čállinmeattáhus"
    ],
    [
      "kjønnssosialisering",
      81,
      100,
      "typo",
      "Ii leat sátnelisttus",
      [],
      "Čállinmeattáhus"
    ]
  ],
  "text": "Su váldofágadutkamuš lei sosiálapedagogihkas, fáddán ”Samisk barneoppdragelse og kjønnssosialisering”, Oslo 1986."
}

That is, it seems to be an error in the extension. It also appears in both MS Word and GDocs.

Tested with Chrome 97.0.4692.99 (Offisiell delversjon) (x86_64).

@snomos snomos added the bug Something isn't working label Jan 26, 2022
@snomos
Copy link
Member Author

snomos commented Aug 22, 2023

Here is a new example from SMJ: giellalt/lang-smj#27

@snomos
Copy link
Member Author

snomos commented Nov 12, 2023

A likely duplicate: #18

@snomos
Copy link
Member Author

snomos commented Nov 29, 2023

Fixed in GDocs, only deployment missing in Word.

@snomos
Copy link
Member Author

snomos commented Dec 7, 2023

Now also fixed in Word.

@snomos snomos closed this as completed Dec 7, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants