Adding box file for whole page results a very high Error rate #21

engahmed1190 · 2018-08-23T11:26:03Z

Hello .

I am facing an issue related to the Error rate I have added box file contains many lines break them by \t but the Error rate is very high and the lstm model is not converging.

Here is a page sample example.

train_sample.zip

The text was updated successfully, but these errors were encountered:

kba · 2018-08-23T12:11:42Z

You need to train with line images. How did you generate the box file? See #7 for some hints on how to segment lines with other tools for training with tesseract.

engahmed1190 · 2018-08-23T13:16:35Z

@kba .
no way to train on full image .

kba · 2018-08-23T14:04:15Z

No, not at the moment, but we'll update #7 if/when we have any updates on this.

kba closed this as completed Aug 23, 2018

kba added the duplicate This issue or pull request already exists label Aug 23, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding box file for whole page results a very high Error rate #21

Adding box file for whole page results a very high Error rate #21

engahmed1190 commented Aug 23, 2018

kba commented Aug 23, 2018

engahmed1190 commented Aug 23, 2018

kba commented Aug 23, 2018

Adding box file for whole page results a very high Error rate #21

Adding box file for whole page results a very high Error rate #21

Comments

engahmed1190 commented Aug 23, 2018

kba commented Aug 23, 2018

engahmed1190 commented Aug 23, 2018

kba commented Aug 23, 2018