-
Notifications
You must be signed in to change notification settings - Fork 379
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
deu: Remove unwanted dependency #17
Conversation
The data included a configuration which required frk.traineddata ("tessedit_load_sublangs frk"). Remove that. Signed-off-by: Stefan Weil <[email protected]>
@jbreiden, there is no such dependency for
|
How can I see a complete list of these dependencies? |
Here it is:
So aze depends on aze_cyrl and aze_cyrl depends on aze. In addition uzb_ depends on uzb_cyrl and vice versa. I extracted the |
I'd better get that fixed before the International Summit of the Book, which will be held in Baku this year. |
Okay, package dependencies updated. Will be in Debian unstable tomorrow. Thanks for pointing this out. |
tesseract-ocr/tessdata@3a94ddd#commitcomment-23584234
That does not match with Stefan's list. Same principle for the two 'chi'. |
That comment is currently wrong for |
Can't possibly be intentional. @theraysmith |
Probably just confirms the view that tessdata_fast is NOT the integer version of tessdata_best. Rather, it is result of a different training, maybe with a different network spec. |
See also my related question on the tesseract-dev forum. |
Jeff is right. The differences in dependencies are not intentional. How does fast relate to best: For languages that have no eval data, both best and fast are a guess, based on using a configuration that worked well for the most closely related language. |
Thanks for the clarification, Ray.
Can you share the list of languages where it is best? |
The data included a configuration which required frk.traineddata
("tessedit_load_sublangs frk"). Remove that.
Signed-off-by: Stefan Weil [email protected]