Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adding box file for whole page results a very high Error rate #21

Closed
engahmed1190 opened this issue Aug 23, 2018 · 3 comments
Closed

Adding box file for whole page results a very high Error rate #21

engahmed1190 opened this issue Aug 23, 2018 · 3 comments
Labels
duplicate This issue or pull request already exists

Comments

@engahmed1190
Copy link

Hello .

I am facing an issue related to the Error rate I have added box file contains many lines break them by \t but the Error rate is very high and the lstm model is not converging.

Here is a page sample example.

image

train_sample.zip

@kba
Copy link
Collaborator

kba commented Aug 23, 2018

You need to train with line images. How did you generate the box file? See #7 for some hints on how to segment lines with other tools for training with tesseract.

@engahmed1190
Copy link
Author

@kba .
no way to train on full image .

@kba
Copy link
Collaborator

kba commented Aug 23, 2018

No, not at the moment, but we'll update #7 if/when we have any updates on this.

@kba kba closed this as completed Aug 23, 2018
@kba kba added the duplicate This issue or pull request already exists label Aug 23, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
duplicate This issue or pull request already exists
Projects
None yet
Development

No branches or pull requests

2 participants