Segmentation fault when using integer models for LSTM training #1573

DihuiLai · 2018-05-12T15:30:00Z

I am running the tutorial on training lstm by fine tuning it following the link https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00#fine-tuning-for-impact

The training works OK when I follow the tutorial instruction and fine tune from .lstm extracted from tessdata/best/eng.traineddata. However the training failed when I try to extract .lstm from tessdata/eng.traineddata

Environment

Tesseract Version: tesseract 4.0.0-beta.1-232-g45a6
Platform: <ubuntu 16.04>

The code I am trying to execute:
training/lstmtraining --model_output ~/tesstutorial/impact_from_full/impact --continue_from ~/tesstutorial/impact_from_full/eng.lstm --traineddata tessdata/eng.traineddata --train_listfile ~/tesstutorial/engeval/eng.training_files.txt --max_iterations 400

The eng.lstm is extracted by "training/combine_tessdata -e tessdata/eng.traineddata ~/tesstutorial/impact_from_full/eng.lstm"

The code will work if I use the tessdata/best/eng.traineddata

The error that I got:
Loaded file /home/dlai/tesstutorial/impact_from_full/eng.lstm, unpacking...
Warning: LSTMTrainer deserialized an LSTMRecognizer!
Continuing from /home/dlai/tesstutorial/impact_from_full/eng.lstm
Loaded 72/72 pages (1-72) of document /home/dlai/tesstutorial/engeval/eng.FreeSans.exp0.lstmf
!int_mode_:Error:Assert failed:in file weightmatrix.cpp, line 244
Segmentation fault (core dumped)

Thanks very much

Dihui

Shreeshrii · 2018-05-12T15:36:30Z

However the training failed when I try to extract .lstm from tessdata/eng.traineddata

Both tessdata and tessdata_fast have integer models which cannot be used for lstmtraining..
Only the float models in tessdata_best can be used for it.

Of course, it should give an appropriate error message and not crash.

@stweil Is it possible to add an error msg for 4.0.0?

DihuiLai · 2018-05-12T15:47:19Z

Thanks for your response, Shreeshrii,

I did read some comments on the integerize in your documentations and should have guessed this.

Still, is there a way to integerize the fine tuned model from the tessdata_best ? The speed of the model on tessdata_best is too slow for our application.

Dihui

Shreeshrii · 2018-05-12T15:51:25Z

The best files can be converted to integer by the following command

Usage for compacting LSTM component to int:
  combine_tessdata -c traineddata_file

The tessdata repo has the integer version of best models plus the old legacy model also.

Shreeshrii · 2018-05-14T11:29:26Z

@DihuiLai Please change issue title to

Segmentation fault when using integer models for LSTM training

stweil · 2018-05-14T12:40:29Z

Segmentation fault when using integer models for LSTM trining

s/trining/training/

@stweil Is it possible to add an error msg for 4.0.0?

Yes, I think so. I added the issue to the planning list.

@zdenop, please add the "bug" label to this issue.

Shreeshrii · 2018-05-14T13:25:56Z

@stweil Thanks for fixing the typo :-) Good to know that it can be fixed for 4.0.0.

DihuiLai · 2018-05-14T14:54:17Z

Changed @Shreeshrii

DihuiLai · 2018-09-06T14:53:03Z

The problem is solved and I am closing the issue

amitdo · 2018-09-17T19:58:27Z

AFAIK this issue was not solved.

stweil · 2018-09-17T20:13:31Z

It was only clarified that it was caused by training based on an integer model which is not allowed.
So that's an error which can be easily avoided. Of course the error handling needs to be improved here. @zdenop or @DihuiLai, please reopen this issue.

Although this is a bug, I think it can be fixed after 4.0.0, as training won't be done by most users of Tesseract.

zdenop · 2018-09-30T14:13:30Z

@stweil : can you send PR, so we can fix this for 4.0 release?

Tesseract currently cannot continue LSTM training from an integer (fast) model. Report this to users who try it nevertheless instead of crashing with an assertion. Signed-off-by: Stefan Weil <[email protected]>

Abort LSTM training with integer model (fixes issue #1573)

Tesseract currently cannot continue LSTM training from an integer (fast) model. Report this to users who try it nevertheless instead of crashing with an assertion. Signed-off-by: Stefan Weil <[email protected]>

stweil · 2021-09-06T10:15:49Z

The issue is now fixed in commits ec87dd4 (master) and 15aee49 (4.1 branch).

DihuiLai changed the title ~~Segmentation fault when training from .lstm extracted from tessdata/eng.traineddata~~ Segmentation fault when using integer models for LSTM trining May 14, 2018

DihuiLai changed the title ~~Segmentation fault when using integer models for LSTM trining~~ Segmentation fault when using integer models for LSTM training May 14, 2018

zdenop added the bug label May 15, 2018

DihuiLai closed this as completed Sep 6, 2018

zdenop reopened this Sep 18, 2018

stweil added this to the 4.1.0 milestone Feb 19, 2019

stweil mentioned this issue Feb 19, 2019

Tesseract Segmentation fault during fine tuning with fast traineddata #2255

Closed

stweil modified the milestones: 4.1.0, 5.0.0 Aug 29, 2019

amitdo mentioned this issue Apr 24, 2020

Segmentation fault ("Speicherzugriffsfehler") caused by "fast" traineddata #2921

Closed

tesseract-ocr deleted a comment from msute Jul 23, 2020

amitdo mentioned this issue Jul 26, 2020

!int_mode_:Error:Assert failed:in file weightmatrix.cpp, line 244 #3069

Closed

amitdo added training unexpected termination labels Mar 24, 2021

egorpugin added a commit that referenced this issue Sep 6, 2021

Merge pull request #3549 from stweil/issue1573

35dee46

Abort LSTM training with integer model (fixes issue #1573)

stweil closed this as completed Sep 6, 2021

stefan6419846 mentioned this issue May 20, 2022

Segfault in lstmtraining when training the demo data tesseract-ocr/tesstrain#269

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Segmentation fault when using integer models for LSTM training #1573

Segmentation fault when using integer models for LSTM training #1573

DihuiLai commented May 12, 2018

Shreeshrii commented May 12, 2018

DihuiLai commented May 12, 2018

Shreeshrii commented May 12, 2018

Shreeshrii commented May 14, 2018 •

edited

Loading

stweil commented May 14, 2018

Shreeshrii commented May 14, 2018

DihuiLai commented May 14, 2018

DihuiLai commented Sep 6, 2018

amitdo commented Sep 17, 2018

stweil commented Sep 17, 2018

zdenop commented Sep 30, 2018

stweil commented Sep 6, 2021

Segmentation fault when using integer models for LSTM training #1573

Segmentation fault when using integer models for LSTM training #1573

Comments

DihuiLai commented May 12, 2018

Environment

Shreeshrii commented May 12, 2018

DihuiLai commented May 12, 2018

Shreeshrii commented May 12, 2018

Shreeshrii commented May 14, 2018 • edited Loading

stweil commented May 14, 2018

Shreeshrii commented May 14, 2018

DihuiLai commented May 14, 2018

DihuiLai commented Sep 6, 2018

amitdo commented Sep 17, 2018

stweil commented Sep 17, 2018

zdenop commented Sep 30, 2018

stweil commented Sep 6, 2021

Shreeshrii commented May 14, 2018 •

edited

Loading