Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

small PR to download contributed models #68

Closed
wants to merge 33 commits into from

Conversation

pachadotdev
Copy link
Contributor

at some point we will provide a model for Romanian text, so here is a starting point for the already available contributed languages

@jeroen
Copy link
Member

jeroen commented Aug 4, 2024

tessdata_contrib only has training data for polytonic and Akkadian? Does anyone really use this?

@pachadotdev
Copy link
Contributor Author

Akkadian

yes, I briefly spoke to the librarian and at UofT Libraries before starting to use Tesseract, they are digitizing a lot of ancient and non-ancient things that go to a Postgres database

I organized this because I am creating my own data for Romanian and I shall upload it in September

@pachadotdev pachadotdev closed this Aug 7, 2024
@pachadotdev pachadotdev deleted the contributedmodels branch August 7, 2024 15:48
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants