-
Notifications
You must be signed in to change notification settings - Fork 9.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Option --psm 10 digits are not taken account. #2159
Comments
The content of the See #751. |
Thanks, I updated the expected behaviour. |
I have looked into this via debugging (adding
True. From what I can tell, that
True again. As @amitdo and @Shreeshrii pointed out, whitelisting will be re-instated for LSTMs after #2294 is merged. But even with that, for this example, nothing can be decoded (empty result), because the beam is too narrow. It can be made wider, e.g. by changing |
Please try with my finetuned traineddata which is limited to digits and some punctuation (instead of digits config). digits_layer.traineddata from https://github.com/Shreeshrii/tessdata_shreetest |
Environment
Current Behavior:
$ tesseract 000.png stdout --psm 10 digits
Warning: Invalid resolution 0 dpi. Using 70 instead.
iL
000.png:
Expected Behavior:
Since --psm 10 is specified, the output 'iL' is invalid. A single character should be recognized.
As 'digits' are specified as a config file, 'iL' is invalid. A digit character(0123456789-.) should be recognized.
Suggested Fix:
The text was updated successfully, but these errors were encountered: