-
-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
teserract 4.0 compatibility #2581
Comments
This might improve performance of multiple recognitions and will be necessary when fixing #2581. Signed-off-by: Michal Čihař <[email protected]>
What is big question is when the C locales are actually needed. We might load the tesserocr object once per process and reuse it then. That way we would fiddle with the locales just once instead of doing that in every OCR request... |
We really should have skipped test if OCR is not supported. Issue #2581 Signed-off-by: Michal Čihař <[email protected]>
Thank you for your report, the issue you have reported has just been fixed.
|
The context can raise an exception and in that case we would not properly reset the locale. Issue #2581 Signed-off-by: Michal Čihař <[email protected]>
Still happens with current release, seems to be caused by requirements check, see #2684 and WeblateOrg/docker#200. |
Thank you for your report, the issue you have reported has just been fixed.
|
Describe the bug
The tesseract developers "fixed" unability to properly parse files under different locales by doing assert on locale (see tesseract-ocr/tesseract@3292484). This leads to Python crash when importing tesserocr with wrong locales, see sirfz/tesserocr#165.
To Reproduce
Steps to reproduce the behavior:
Expected behavior
It should work no matter what locales are set.
Additional context
C.UTF-8
locales will work for tesseract as well as Allow UTF-8 variant of C locale tesseract-ocr/tesseract#2272 got mergedWant to back this issue? Post a bounty on it! We accept bounties via Bountysource.
The text was updated successfully, but these errors were encountered: