Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

orthographic modes detected as if translation modes, but do not work #223

Open
jonorthwash opened this issue Nov 1, 2023 · 2 comments
Open

Comments

@jonorthwash
Copy link
Member

For example, orthographic conversion modes between zab_Simp and zab_Phon are detected as translation pair modes:

$ curl 'https://beta.apertium.org:/apy/listPairs'
... {"sourceLanguage": "zab_Phon", "targetLanguage": "zab_Simp"}, {"sourceLanguage": "zab_Simp", "targetLanguage": "zab_Phon"}, ...

However, conversion between them doesn't work via APy:

$ curl 'https://beta.apertium.org:/apy/listP/apy/translate?langpair=zab_Simp|zab_Phon&q=dizh'
{"status": "error", "code": 503, "message": "Service Unavailable", "explanation": "internal error"}

This does work in the command-line version of the mode:

$ echo "dizh" | apertium -f none -d . zab_Simp-zab_Phon
dìiʼzh

Some modes between different locales may legitimately be more than just orthography; i.e., they should be thought of as full translation pairs (e.g., between American and British English). So I'm wondering whether, instead of fixing this, it would make sense to name the orthography-only modes -ortho or something and have a separate detection process for them, and serve them separately (and eventually have a different interface for them in html-tools).

@unhammer
Copy link
Member

unhammer commented Nov 2, 2023

yeah some kind of naming convention would do it; apy just looks in the modes folder for pairs names xyz-xyz (where xyz can include underscores)

(Though ideally we'd have metadata associated with modes files to make the detection more robust.)

@jonorthwash
Copy link
Member Author

Agreed about metadata.

The big-picture question is whether it makes sense to treat orthographic conversion modes separately from translation modes. Conceptually they're pretty similar.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants