forked from tesseract-ocr/tesseract
-
Notifications
You must be signed in to change notification settings - Fork 438
Project ideas
Stefan Weil edited this page Apr 10, 2023
·
4 revisions
As long as the networks are compatible with the features implemented in Tesseract, it should be possible to convert models made for Keras or Tensorflow to Tesseract and vice versa.
Maybe ONXX can be used as a common exchange format:
Related issues:
- Use Tesseract models for Kraken (requires conversion from Tesseract to Keras)
Tesseract uses Leptonica which can read many important image formats. Releant Leptonica API functions: pixRead
, more?
Missing formats:
Extending Leptonica to support additional image formats is not desired because each format costs much resources for implementation and maintenance. But maybe it is possible to use an external library for image handling. Then only support for that library must be implemented.
Possible libraries:
- ffmpeg2 – https://ffmpeg.org/, LGPL v2.1+, jxl
- graphicsmagick – http://www.graphicsmagick.org/, MIT license, jxl
- libimlib2 – https://docs.enlightenment.org/api/imlib2/html/index.html, license?, heic / jxl
- others?