Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Investigate translating mixed Simplified/Traditional script #965

Open
Tracked by #425
eu9ene opened this issue Dec 18, 2024 · 0 comments
Open
Tracked by #425

Investigate translating mixed Simplified/Traditional script #965

eu9ene opened this issue Dec 18, 2024 · 0 comments
Labels
language-coverage Issues related to covering specific languages

Comments

@eu9ene
Copy link
Collaborator

eu9ene commented Dec 18, 2024

The options are:

  1. To train a joint zh-en model with a larger vocabulary. It would also require making sure we have enough data in Traditional for training and it's not Cantonese
  2. Transliterate Traditional -> Simplified on the fly on the inference side
@eu9ene eu9ene added the language-coverage Issues related to covering specific languages label Dec 18, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
language-coverage Issues related to covering specific languages
Projects
None yet
Development

No branches or pull requests

1 participant