-
Notifications
You must be signed in to change notification settings - Fork 9.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
TFloat (FAST_FLOAT) work done & slightly different idea used to make code easily switchable between double & float #3490
Commits on Jan 15, 2021
-
Merge remote-tracking branch 'remotes/stweil/network-string'
# Conflicts: # src/training/combine_tessdata.cpp
Configuration menu - View commit details
-
Copy full SHA for cc2f5be - Browse repository at this point
Copy the full SHA cc2f5beView commit details -
Merge remote-tracking branch 'remotes/UB-Mannheim/windows'
# Conflicts: # src/ccutil/errcode.h # src/ccutil/serialis.cpp # src/ccutil/tprintf.h # src/viewer/scrollview.h
Configuration menu - View commit details
-
Copy full SHA for ebfb844 - Browse repository at this point
Copy the full SHA ebfb844View commit details -
Merge remote-tracking branch 'remotes/stweil/fuzzers'
# Conflicts: # Makefile.am # src/ccutil/helpers.h # src/ccutil/scanutils.h # src/ccutil/tprintf.h # unittest/Makefile.am
Configuration menu - View commit details
-
Copy full SHA for cacad1b - Browse repository at this point
Copy the full SHA cacad1bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 81b21e4 - Browse repository at this point
Copy the full SHA 81b21e4View commit details -
Merge remote-tracking branch 'remotes/UB-Mannheim/windows'
# Conflicts: # dll/i686-w64-mingw32/iconv.dll # dll/i686-w64-mingw32/icudt64.dll # dll/i686-w64-mingw32/icuin64.dll # dll/i686-w64-mingw32/icuuc64.dll # dll/i686-w64-mingw32/libarchive-13.dll # dll/i686-w64-mingw32/libbz2-1.dll # dll/i686-w64-mingw32/libcairo-2.dll # dll/i686-w64-mingw32/libcurl-4.dll # dll/i686-w64-mingw32/libeay32.dll # dll/i686-w64-mingw32/libexpat-1.dll # dll/i686-w64-mingw32/libffi-6.dll # dll/i686-w64-mingw32/libfontconfig-1.dll # dll/i686-w64-mingw32/libfreetype-6.dll # dll/i686-w64-mingw32/libgcc_s_sjlj-1.dll # dll/i686-w64-mingw32/libgif-7.dll # dll/i686-w64-mingw32/libglib-2.0-0.dll # dll/i686-w64-mingw32/libgobject-2.0-0.dll # dll/i686-w64-mingw32/libgomp-1.dll # dll/i686-w64-mingw32/libharfbuzz-0.dll # dll/i686-w64-mingw32/libintl-8.dll # dll/i686-w64-mingw32/libjbig-2.dll # dll/i686-w64-mingw32/libjpeg-8.dll # dll/i686-w64-mingw32/liblept-5.dll # dll/i686-w64-mingw32/liblz4-1.dll # dll/i686-w64-mingw32/liblzma-5.dll # dll/i686-w64-mingw32/liblzo2-2.dll # dll/i686-w64-mingw32/libnettle-6.dll # dll/i686-w64-mingw32/libnghttp2-14.dll # dll/i686-w64-mingw32/libopenjp2.dll # dll/i686-w64-mingw32/libpango-1.0-0.dll # dll/i686-w64-mingw32/libpangocairo-1.0-0.dll # dll/i686-w64-mingw32/libpangoft2-1.0-0.dll # dll/i686-w64-mingw32/libpangowin32-1.0-0.dll # dll/i686-w64-mingw32/libpcre-1.dll # dll/i686-w64-mingw32/libpixman-1-0.dll # dll/i686-w64-mingw32/libpng16-16.dll # dll/i686-w64-mingw32/libssh2-1.dll # dll/i686-w64-mingw32/libstdc++-6.dll # dll/i686-w64-mingw32/libtiff-5.dll # dll/i686-w64-mingw32/libwebp-7.dll # dll/i686-w64-mingw32/libwinpthread-1.dll # dll/i686-w64-mingw32/libxml2-2.dll # dll/i686-w64-mingw32/libzstd-1.dll # dll/i686-w64-mingw32/ssleay32.dll # dll/i686-w64-mingw32/zlib1.dll # dll/x86_64-w64-mingw32/iconv.dll # dll/x86_64-w64-mingw32/icudt64.dll # dll/x86_64-w64-mingw32/icuin64.dll # dll/x86_64-w64-mingw32/icuuc64.dll # dll/x86_64-w64-mingw32/libarchive-13.dll # dll/x86_64-w64-mingw32/libbz2-1.dll # dll/x86_64-w64-mingw32/libcairo-2.dll # dll/x86_64-w64-mingw32/libcurl-4.dll # dll/x86_64-w64-mingw32/libeay32.dll # dll/x86_64-w64-mingw32/libexpat-1.dll # dll/x86_64-w64-mingw32/libffi-6.dll # dll/x86_64-w64-mingw32/libfontconfig-1.dll # dll/x86_64-w64-mingw32/libfreetype-6.dll # dll/x86_64-w64-mingw32/libgcc_s_seh-1.dll # dll/x86_64-w64-mingw32/libgif-7.dll # dll/x86_64-w64-mingw32/libglib-2.0-0.dll # dll/x86_64-w64-mingw32/libgobject-2.0-0.dll # dll/x86_64-w64-mingw32/libgomp-1.dll # dll/x86_64-w64-mingw32/libharfbuzz-0.dll # dll/x86_64-w64-mingw32/libintl-8.dll # dll/x86_64-w64-mingw32/libjbig-2.dll # dll/x86_64-w64-mingw32/libjpeg-8.dll # dll/x86_64-w64-mingw32/liblept-5.dll # dll/x86_64-w64-mingw32/liblz4-1.dll # dll/x86_64-w64-mingw32/liblzma-5.dll # dll/x86_64-w64-mingw32/liblzo2-2.dll # dll/x86_64-w64-mingw32/libnettle-6.dll # dll/x86_64-w64-mingw32/libnghttp2-14.dll # dll/x86_64-w64-mingw32/libopenjp2.dll # dll/x86_64-w64-mingw32/libpango-1.0-0.dll # dll/x86_64-w64-mingw32/libpangocairo-1.0-0.dll # dll/x86_64-w64-mingw32/libpangoft2-1.0-0.dll # dll/x86_64-w64-mingw32/libpangowin32-1.0-0.dll # dll/x86_64-w64-mingw32/libpcre-1.dll # dll/x86_64-w64-mingw32/libpixman-1-0.dll # dll/x86_64-w64-mingw32/libpng16-16.dll # dll/x86_64-w64-mingw32/libssh2-1.dll # dll/x86_64-w64-mingw32/libstdc++-6.dll # dll/x86_64-w64-mingw32/libtiff-5.dll # dll/x86_64-w64-mingw32/libwebp-7.dll # dll/x86_64-w64-mingw32/libwinpthread-1.dll # dll/x86_64-w64-mingw32/libxml2-2.dll # dll/x86_64-w64-mingw32/libzstd-1.dll # dll/x86_64-w64-mingw32/ssleay32.dll # dll/x86_64-w64-mingw32/zlib1.dll # src/ccutil/errcode.h # src/ccutil/tprintf.h # src/viewer/scrollview.h
Configuration menu - View commit details
-
Copy full SHA for 0b418f5 - Browse repository at this point
Copy the full SHA 0b418f5View commit details -
Configuration menu - View commit details
-
Copy full SHA for e30195a - Browse repository at this point
Copy the full SHA e30195aView commit details
Commits on Jan 27, 2021
-
Merge remote-tracking branch 'remotes/ulb-sachsen-anhalt/master'
# Conflicts: # configure.ac
Configuration menu - View commit details
-
Copy full SHA for c84f864 - Browse repository at this point
Copy the full SHA c84f864View commit details -
Merge remote-tracking branch 'remotes/tesseract-ocr/master'
# Conflicts: # Makefile.am # unittest/Makefile.am
Configuration menu - View commit details
-
Copy full SHA for fd58d5a - Browse repository at this point
Copy the full SHA fd58d5aView commit details
Commits on Jan 29, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 1d717cc - Browse repository at this point
Copy the full SHA 1d717ccView commit details -
updated Pix input format handling
# Conflicts: # src/api/pdfrenderer.cpp
Configuration menu - View commit details
-
Copy full SHA for 62dfe0b - Browse repository at this point
Copy the full SHA 62dfe0bView commit details
Commits on Jan 30, 2021
-
Configuration menu - View commit details
-
Copy full SHA for d92c8c7 - Browse repository at this point
Copy the full SHA d92c8c7View commit details
Commits on Feb 1, 2021
-
Configuration menu - View commit details
-
Copy full SHA for f3f83c5 - Browse repository at this point
Copy the full SHA f3f83c5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4ee6d50 - Browse repository at this point
Copy the full SHA 4ee6d50View commit details
Commits on Feb 7, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 9e725ec - Browse repository at this point
Copy the full SHA 9e725ecView commit details -
Configuration menu - View commit details
-
Copy full SHA for d632e6c - Browse repository at this point
Copy the full SHA d632e6cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 045d491 - Browse repository at this point
Copy the full SHA 045d491View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2edbe0d - Browse repository at this point
Copy the full SHA 2edbe0dView commit details
Commits on Feb 11, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 33e90db - Browse repository at this point
Copy the full SHA 33e90dbView commit details -
Configuration menu - View commit details
-
Copy full SHA for 589c139 - Browse repository at this point
Copy the full SHA 589c139View commit details
Commits on Feb 13, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 34f2eb0 - Browse repository at this point
Copy the full SHA 34f2eb0View commit details
Commits on Feb 18, 2021
-
Configuration menu - View commit details
-
Copy full SHA for c570bbf - Browse repository at this point
Copy the full SHA c570bbfView commit details -
Configuration menu - View commit details
-
Copy full SHA for c319286 - Browse repository at this point
Copy the full SHA c319286View commit details -
Configuration menu - View commit details
-
Copy full SHA for d91dd4d - Browse repository at this point
Copy the full SHA d91dd4dView commit details
Commits on Feb 21, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 99be81c - Browse repository at this point
Copy the full SHA 99be81cView commit details
Commits on Feb 22, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 1e72f9d - Browse repository at this point
Copy the full SHA 1e72f9dView commit details
Commits on Feb 26, 2021
-
Implement unpack for lstmf files
Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for ed5e40e - Browse repository at this point
Copy the full SHA ed5e40eView commit details -
Support lstmf files with more than one line
Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 5876fc4 - Browse repository at this point
Copy the full SHA 5876fc4View commit details -
Add missing include statement for access
Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 0995870 - Browse repository at this point
Copy the full SHA 0995870View commit details -
Configuration menu - View commit details
-
Copy full SHA for f439170 - Browse repository at this point
Copy the full SHA f439170View commit details
Commits on Feb 27, 2021
-
Configuration menu - View commit details
-
Copy full SHA for eb62f07 - Browse repository at this point
Copy the full SHA eb62f07View commit details
Commits on Feb 28, 2021
-
Use Apple Accelerate framework for training and best models
Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 5469248 - Browse repository at this point
Copy the full SHA 5469248View commit details -
Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for f5cb128 - Browse repository at this point
Copy the full SHA f5cb128View commit details
Commits on Mar 5, 2021
-
Implement unpack for lstmf files
Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 9abc1bd - Browse repository at this point
Copy the full SHA 9abc1bdView commit details -
Support lstmf files with more than one line
Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for abe760d - Browse repository at this point
Copy the full SHA abe760dView commit details -
Add missing include statement for access
Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for f689070 - Browse repository at this point
Copy the full SHA f689070View commit details -
Configuration menu - View commit details
-
Copy full SHA for 83a925b - Browse repository at this point
Copy the full SHA 83a925bView commit details -
Don't use threads for loading documents
Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 9f70fd1 - Browse repository at this point
Copy the full SHA 9f70fd1View commit details -
Use Apple Accelerate framework for training and best models
Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for a004e14 - Browse repository at this point
Copy the full SHA a004e14View commit details -
Remove unused code for serialization
Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 1414854 - Browse repository at this point
Copy the full SHA 1414854View commit details
Commits on Mar 8, 2021
-
Merge commit '1ab6b0fbc6ec757e9e7be44802448bcfd62df187'
# Conflicts: # .github/workflows/sw.yml # src/api/tesseractmain.cpp
Configuration menu - View commit details
-
Copy full SHA for 2b9b9b9 - Browse repository at this point
Copy the full SHA 2b9b9b9View commit details -
- make sure everyone uses tprintf() instead of printf() / fprintf(), so any print output gets routed through the fz_error/warning/info calls and lands in a place where we can actually see/do something with it.
Configuration menu - View commit details
-
Copy full SHA for 99eedbe - Browse repository at this point
Copy the full SHA 99eedbeView commit details -
Merge remote-tracking branch 'remotes/UB-Mannheim/master'
# Conflicts: # src/api/tesseractmain.cpp
Configuration menu - View commit details
-
Copy full SHA for 49a4d07 - Browse repository at this point
Copy the full SHA 49a4d07View commit details -
Merge commit '0cde3ede98ca9f63ea0ef94c294aee67243aaaa0'
# Conflicts: # src/api/tesseractmain.cpp
Configuration menu - View commit details
-
Copy full SHA for 2ce0ab9 - Browse repository at this point
Copy the full SHA 2ce0ab9View commit details -
Configuration menu - View commit details
-
Copy full SHA for e82d245 - Browse repository at this point
Copy the full SHA e82d245View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8aa85e0 - Browse repository at this point
Copy the full SHA 8aa85e0View commit details -
Merge remote-tracking branch 'remotes/stweil/unpack'
# Conflicts: # src/api/tesseractmain.cpp
Configuration menu - View commit details
-
Copy full SHA for 16c577c - Browse repository at this point
Copy the full SHA 16c577cView commit details -
Merge remote-tracking branch 'remotes/stweil/accelerate'
# Conflicts: # src/arch/simddetect.cpp
Configuration menu - View commit details
-
Copy full SHA for 245cf59 - Browse repository at this point
Copy the full SHA 245cf59View commit details -
added the tesseract training tools to the MuPDF build project.
exported each utility as a separate function (to be invoked via `mutool`)
Configuration menu - View commit details
-
Copy full SHA for 771c10a - Browse repository at this point
Copy the full SHA 771c10aView commit details -
Merge remote-tracking branch 'remotes/tesseract-ocr/master'
# Conflicts: # src/ccutil/serialis.cpp # src/ccutil/serialis.h # src/ccutil/unicharcompress.cpp
Configuration menu - View commit details
-
Copy full SHA for 57c9fe1 - Browse repository at this point
Copy the full SHA 57c9fe1View commit details
Commits on Mar 11, 2021
-
Merge remote-tracking branch 'remotes/tesseract-ocr/master'
# Conflicts: # src/ccmain/tessedit.cpp # src/ccmain/tesseractclass.h # src/ccutil/strngs.cpp # src/ccutil/strngs.h # src/lstm/lstmrecognizer.cpp
Configuration menu - View commit details
-
Copy full SHA for 88deb91 - Browse repository at this point
Copy the full SHA 88deb91View commit details -
fix errors after merge commit: missing changes that are needed too to…
… make this codebase compile.
Configuration menu - View commit details
-
Copy full SHA for 33b0a77 - Browse repository at this point
Copy the full SHA 33b0a77View commit details -
fix errors after merge commit: missing changes that are needed too to…
… make this codebase compile.
Configuration menu - View commit details
-
Copy full SHA for 3216647 - Browse repository at this point
Copy the full SHA 3216647View commit details -
Configuration menu - View commit details
-
Copy full SHA for 964a00e - Browse repository at this point
Copy the full SHA 964a00eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3921273 - Browse repository at this point
Copy the full SHA 3921273View commit details
Commits on Mar 12, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 712953c - Browse repository at this point
Copy the full SHA 712953cView commit details -
Configuration menu - View commit details
-
Copy full SHA for deee736 - Browse repository at this point
Copy the full SHA deee736View commit details
Commits on Mar 20, 2021
-
Merge remote-tracking branch 'remotes/tesseract-ocr/master'
# Conflicts: # include/tesseract/baseapi.h # include/tesseract/capi.h # include/tesseract/export.h # src/api/baseapi.cpp # src/api/capi.cpp # src/api/pdfrenderer.cpp # src/api/tesseractmain.cpp # src/arch/simddetect.cpp # src/ccmain/applybox.cpp # src/ccmain/output.cpp # src/ccmain/pageiterator.cpp # src/ccmain/paragraphs.cpp # src/ccmain/resultiterator.cpp # src/ccmain/tessedit.cpp # src/ccmain/tesseractclass.h # src/ccmain/thresholder.cpp # src/ccstruct/boxread.cpp # src/ccstruct/coutln.h # src/ccstruct/imagedata.cpp # src/ccstruct/mod128.cpp # src/ccstruct/ocrblock.cpp # src/ccstruct/points.h # src/ccstruct/polyaprx.cpp # src/ccstruct/rect.h # src/ccutil/errcode.h # src/ccutil/genericvector.h # src/ccutil/host.h # src/ccutil/params.cpp # src/ccutil/scanutils.h # src/ccutil/serialis.cpp # src/ccutil/strngs.cpp # src/ccutil/strngs.h # src/ccutil/tessdatamanager.cpp # src/ccutil/tessdatamanager.h # src/ccutil/tprintf.cpp # src/ccutil/tprintf.h # src/ccutil/unicharcompress.cpp # src/ccutil/unicharcompress.h # src/ccutil/unicharset.cpp # src/dict/dawg.cpp # src/dict/permdawg.cpp # src/dict/stopper.cpp # src/dict/trie.cpp # src/lstm/input.cpp # src/lstm/lstmrecognizer.cpp # src/lstm/recodebeam.cpp # src/lstm/series.cpp # src/lstm/tfnetwork.cpp # src/lstm/tfnetwork.h # src/opencl/oclkernels.h # src/opencl/openclwrapper.h # src/textord/blkocc.cpp # src/textord/drawtord.cpp # src/textord/fpchop.cpp # src/textord/makerow.cpp # src/textord/oldbasel.cpp # src/textord/pithsync.cpp # src/textord/pitsync1.cpp # src/textord/strokewidth.cpp # src/textord/topitch.cpp # src/textord/tordmain.cpp # src/textord/tospace.cpp # src/textord/wordseg.cpp # src/training/ambiguous_words.cpp # src/training/classifier_tester.cpp # src/training/cntraining.cpp # src/training/combine_lang_model.cpp # src/training/combine_tessdata.cpp # src/training/common/commandlineflags.cpp # src/training/common/commandlineflags.h # src/training/common/commontraining.cpp # src/training/common/commontraining.h # src/training/common/mastertrainer.h # src/training/dawg2wordlist.cpp # src/training/lstmeval.cpp # src/training/lstmtraining.cpp # src/training/merge_unicharsets.cpp # src/training/mftraining.cpp # src/training/pango/boxchar.cpp # src/training/pango/boxchar.h # src/training/pango/ligature_table.cpp # src/training/pango/pango_font_info.cpp # src/training/pango/pango_font_info.h # src/training/pango/stringrenderer.cpp # src/training/pango/stringrenderer.h # src/training/set_unicharset_properties.cpp # src/training/text2image.cpp # src/training/unicharset/icuerrorcode.cpp # src/training/unicharset/icuerrorcode.h # src/training/unicharset/normstrngs.cpp # src/training/unicharset/unicharset_training_utils.cpp # src/training/unicharset/validate_grapheme.cpp # src/training/unicharset/validate_myanmar.cpp # src/training/unicharset/validator.cpp # src/training/unicharset_extractor.cpp # src/training/wordlist2dawg.cpp # src/viewer/scrollview.h
Configuration menu - View commit details
-
Copy full SHA for ce5345b - Browse repository at this point
Copy the full SHA ce5345bView commit details -
code inspection: all printf() -> tprintf() + make sure all error mess…
…ages are prefixed with 'ERROR:' (and warning messages with 'WARNING:') for proper handling and dispatching in caller code (MuPDF-based tools which feed these messages to log file(s))
Configuration menu - View commit details
-
Copy full SHA for 1fe5cd5 - Browse repository at this point
Copy the full SHA 1fe5cd5View commit details -
Configuration menu - View commit details
-
Copy full SHA for c983073 - Browse repository at this point
Copy the full SHA c983073View commit details -
tweak: allow Leptonica to yak in debug builds when the severity envir…
…onment variable has not been set. This modifies the beehaviour as mentioned in commit SHA-1: 55d87f6
Configuration menu - View commit details
-
Copy full SHA for 33f7878 - Browse repository at this point
Copy the full SHA 33f7878View commit details -
CMake: SW_BUILD=OFF everywhere: we don't have SW. (See also same issu…
…e in Leptonica: I guess someone created or at least *edited* the CMakefiles for both)
Configuration menu - View commit details
-
Copy full SHA for d52dba8 - Browse repository at this point
Copy the full SHA d52dba8View commit details -
Configuration menu - View commit details
-
Copy full SHA for cc58bdd - Browse repository at this point
Copy the full SHA cc58bddView commit details
Commits on Mar 21, 2021
-
Configuration menu - View commit details
-
Copy full SHA for aec1500 - Browse repository at this point
Copy the full SHA aec1500View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3229481 - Browse repository at this point
Copy the full SHA 3229481View commit details -
Configuration menu - View commit details
-
Copy full SHA for 76d2c1f - Browse repository at this point
Copy the full SHA 76d2c1fView commit details -
Merge remote-tracking branch 'remotes/bhfo/master'
# Conflicts: # src/api/capi.cpp # src/training/common/commandlineflags.cpp
Configuration menu - View commit details
-
Copy full SHA for c382ad9 - Browse repository at this point
Copy the full SHA c382ad9View commit details -
Merge remote-tracking branch 'remotes/tesseract-ocr/master'
# Conflicts: # src/api/tesseractmain.cpp
Configuration menu - View commit details
-
Copy full SHA for 059ec69 - Browse repository at this point
Copy the full SHA 059ec69View commit details
Commits on Mar 22, 2021
-
Fix SIMD architecture detection logic.
Originally, this code would have complained if it was ever compiled on a platform that didn't support it. We changed this so that every file could be built on every platform for simplicity of build files. Attempting to build (say) an SSE file on a platform that didn't support SSE will just compile away to nothing. Unfortunately, while making this change, I didn't remove the slightly strange state whereby it would be impossible to build without SSE optimisations on a platform that supported them. To fix this, I've removed the lines.
Configuration menu - View commit details
-
Copy full SHA for 1df5db9 - Browse repository at this point
Copy the full SHA 1df5db9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0fac00f - Browse repository at this point
Copy the full SHA 0fac00fView commit details
Commits on Mar 29, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 8fa0c71 - Browse repository at this point
Copy the full SHA 8fa0c71View commit details -
Update training simplemake makefiles to correspond to master
Also fix a missing \ at line ending
Configuration menu - View commit details
-
Copy full SHA for f0bdc7b - Browse repository at this point
Copy the full SHA f0bdc7bView commit details -
Configuration menu - View commit details
-
Copy full SHA for b358a6a - Browse repository at this point
Copy the full SHA b358a6aView commit details -
Configuration menu - View commit details
-
Copy full SHA for d396db9 - Browse repository at this point
Copy the full SHA d396db9View commit details -
simplemake: Fix up ccutil file list and be more conservative in defau…
…lt optimisations
Configuration menu - View commit details
-
Copy full SHA for 852345e - Browse repository at this point
Copy the full SHA 852345eView commit details -
simplemake: Ensure version.h is generated first, update to c++20, and…
… ensure all .h files are correctly included in the build
Configuration menu - View commit details
-
Copy full SHA for c211742 - Browse repository at this point
Copy the full SHA c211742View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0e86e1f - Browse repository at this point
Copy the full SHA 0e86e1fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 7ae9001 - Browse repository at this point
Copy the full SHA 7ae9001View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7338bcf - Browse repository at this point
Copy the full SHA 7338bcfView commit details -
Configuration menu - View commit details
-
Copy full SHA for 68ecee5 - Browse repository at this point
Copy the full SHA 68ecee5View commit details -
Merge remote-tracking branch 'remotes/Artifex/artifex'
# Conflicts: # src/arch/dotproductavx.cpp # src/arch/dotproductfma.cpp # src/arch/dotproductsse.cpp # src/arch/intsimdmatrixavx2.cpp # src/arch/intsimdmatrixsse.cpp
Configuration menu - View commit details
-
Copy full SHA for b42bbe8 - Browse repository at this point
Copy the full SHA b42bbe8View commit details -
Merge commit '205cd32184dfb3b9c4ad28681405babf76dbd7d0'
# Conflicts: # src/ccmain/paragraphs.cpp # src/dict/trie.cpp # src/training/unicharset/icuerrorcode.h # src/training/unicharset_extractor.cpp
Configuration menu - View commit details
-
Copy full SHA for b6f022a - Browse repository at this point
Copy the full SHA b6f022aView commit details -
Merge commit '7677b80408db08fcd97399b9f462c783dc018962'
# Conflicts: # abseil # src/api/baseapi.cpp # src/api/pdfrenderer.cpp # src/api/tesseractmain.cpp # src/ccmain/applybox.cpp # src/ccmain/pagesegmain.cpp # src/ccstruct/imagedata.cpp # src/ccstruct/mod128.cpp # src/ccstruct/pageres.cpp # src/lstm/lstmrecognizer.cpp # src/textord/colpartitiongrid.cpp # src/textord/makerow.cpp # src/textord/oldbasel.cpp # src/textord/strokewidth.cpp # src/textord/topitch.cpp # src/training/combine_tessdata.cpp # src/training/mftraining.cpp # src/training/text2image.cpp # src/training/unicharset/lang_model_helpers.cpp # src/training/unicharset/validate_grapheme.cpp # src/training/unicharset/validate_indic.cpp # src/training/unicharset/validate_javanese.cpp
Configuration menu - View commit details
-
Copy full SHA for e497820 - Browse repository at this point
Copy the full SHA e497820View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3421157 - Browse repository at this point
Copy the full SHA 3421157View commit details -
Configuration menu - View commit details
-
Copy full SHA for 81d965c - Browse repository at this point
Copy the full SHA 81d965cView commit details -
Merge remote-tracking branch 'remotes/bhfo/master'
# Conflicts: # src/training/unicharset/lang_model_helpers.cpp
Configuration menu - View commit details
-
Copy full SHA for b743c1c - Browse repository at this point
Copy the full SHA b743c1cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 7aa3295 - Browse repository at this point
Copy the full SHA 7aa3295View commit details
Commits on Mar 30, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 16dc9d9 - Browse repository at this point
Copy the full SHA 16dc9d9View commit details
Commits on Apr 1, 2021
-
Merge remote-tracking branch 'remotes/Alan-love/master'
# Conflicts: # src/lstm/input.cpp # src/lstm/lstmrecognizer.cpp # src/viewer/scrollview.h
Configuration menu - View commit details
-
Copy full SHA for ebc6c2e - Browse repository at this point
Copy the full SHA ebc6c2eView commit details
Commits on Apr 3, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 734055d - Browse repository at this point
Copy the full SHA 734055dView commit details
Commits on Apr 7, 2021
-
Fix function GetFirstWords and modernize function GetPrefixes
Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 054dba3 - Browse repository at this point
Copy the full SHA 054dba3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8404cf1 - Browse repository at this point
Copy the full SHA 8404cf1View commit details -
Update submodule abseil to tagged release 20210324.0
Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for af5cb05 - Browse repository at this point
Copy the full SHA af5cb05View commit details
Commits on Apr 8, 2021
-
Julian Kent committed
Apr 8, 2021 Configuration menu - View commit details
-
Copy full SHA for dfe8410 - Browse repository at this point
Copy the full SHA dfe8410View commit details -
Configuration menu - View commit details
-
Copy full SHA for d731187 - Browse repository at this point
Copy the full SHA d731187View commit details
Commits on Apr 12, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 28756df - Browse repository at this point
Copy the full SHA 28756dfView commit details
Commits on Apr 14, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 856611d - Browse repository at this point
Copy the full SHA 856611dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4658aee - Browse repository at this point
Copy the full SHA 4658aeeView commit details -
Merge remote-tracking branch 'remotes/stweil/master'
# Conflicts: # abseil
Configuration menu - View commit details
-
Copy full SHA for da0e033 - Browse repository at this point
Copy the full SHA da0e033View commit details -
Merge commit 'f77b1c68814b15d0a2638b17aafb08ca96e26ccd'
# Conflicts: # include/tesseract/baseapi.h # include/tesseract/capi.h # src/api/tesseractmain.cpp
Configuration menu - View commit details
-
Copy full SHA for 0735f9d - Browse repository at this point
Copy the full SHA 0735f9dView commit details -
Configuration menu - View commit details
-
Copy full SHA for e199ec7 - Browse repository at this point
Copy the full SHA e199ec7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9f826ec - Browse repository at this point
Copy the full SHA 9f826ecView commit details -
Configuration menu - View commit details
-
Copy full SHA for bffa449 - Browse repository at this point
Copy the full SHA bffa449View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4615014 - Browse repository at this point
Copy the full SHA 4615014View commit details
Commits on Apr 20, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 986f3a2 - Browse repository at this point
Copy the full SHA 986f3a2View commit details
Commits on Apr 22, 2021
-
Configuration menu - View commit details
-
Copy full SHA for a59cefa - Browse repository at this point
Copy the full SHA a59cefaView commit details -
Configuration menu - View commit details
-
Copy full SHA for 28daea4 - Browse repository at this point
Copy the full SHA 28daea4View commit details
Commits on Apr 26, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 217f7cf - Browse repository at this point
Copy the full SHA 217f7cfView commit details -
Configuration menu - View commit details
-
Copy full SHA for 7440da5 - Browse repository at this point
Copy the full SHA 7440da5View commit details -
include small bash shell script to run CMake with the required path d…
…efines, etc. so we don't have to re-invent that wheel every time around.
Configuration menu - View commit details
-
Copy full SHA for 46c4517 - Browse repository at this point
Copy the full SHA 46c4517View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5ea3419 - Browse repository at this point
Copy the full SHA 5ea3419View commit details -
lstmeval: Improve output by ensuring 'Truth:' text is encoded the sam…
…e as OCR output This ensures that transformations like unicode normalisation are done on the truth output as well as the OCR output, so that you can compare the two properly. Before this a perfect OCR could show different lines for Truth and OCR if the OCR output included characters that were normalised.
Configuration menu - View commit details
-
Copy full SHA for f49bc18 - Browse repository at this point
Copy the full SHA f49bc18View commit details -
lstmeval: Improve output by ensuring 'Truth:' text is encoded the sam…
…e as OCR output This ensures that transformations like unicode normalisation are done on the truth output as well as the OCR output, so that you can compare the two properly. Before this a perfect OCR could show different lines for Truth and OCR if the OCR output included characters that were normalised.
Configuration menu - View commit details
-
Copy full SHA for fb7542a - Browse repository at this point
Copy the full SHA fb7542aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 018bd6e - Browse repository at this point
Copy the full SHA 018bd6eView commit details
Commits on Apr 27, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 7aba1e4 - Browse repository at this point
Copy the full SHA 7aba1e4View commit details -
lstmeval: Improve output by ensuring 'Truth:' text is encoded the sam…
…e as OCR output This ensures that transformations like unicode normalisation are done on the truth output as well as the OCR output, so that you can compare the two properly. Before this a perfect OCR could show different lines for Truth and OCR if the OCR output included characters that were normalised.
Configuration menu - View commit details
-
Copy full SHA for 2acaac4 - Browse repository at this point
Copy the full SHA 2acaac4View commit details
Commits on Apr 28, 2021
-
Configuration menu - View commit details
-
Copy full SHA for dfcd8e0 - Browse repository at this point
Copy the full SHA dfcd8e0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2703237 - Browse repository at this point
Copy the full SHA 2703237View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5bb97f9 - Browse repository at this point
Copy the full SHA 5bb97f9View commit details -
Revert "lstmeval: Improve output by ensuring 'Truth:' text is encoded…
… the same as OCR output" This reverts commit 2acaac4. # Conflicts: # src/training/unicharset/lstmtester.cpp
Configuration menu - View commit details
-
Copy full SHA for e7acb56 - Browse repository at this point
Copy the full SHA e7acb56View commit details
Commits on Apr 29, 2021
-
Merge remote-tracking branch 'remotes/tesseract-ocr/master'
# Conflicts: # src/api/tesseractmain.cpp
Configuration menu - View commit details
-
Copy full SHA for cf9bc7f - Browse repository at this point
Copy the full SHA cf9bc7fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 06e6c72 - Browse repository at this point
Copy the full SHA 06e6c72View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2549a72 - Browse repository at this point
Copy the full SHA 2549a72View commit details
Commits on May 1, 2021
-
Configuration menu - View commit details
-
Copy full SHA for c5ce25e - Browse repository at this point
Copy the full SHA c5ce25eView commit details
Commits on May 5, 2021
-
Configuration menu - View commit details
-
Copy full SHA for b262c6c - Browse repository at this point
Copy the full SHA b262c6cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8ea7877 - Browse repository at this point
Copy the full SHA 8ea7877View commit details
Commits on May 6, 2021
-
Remove "v" prefix for version in banner
Instead of printing the version with an additional "v" (which leads to results like `Tesseract Open Source OCR Engine vv4.0.0-beta.4`), just print the version string in the banner text of Tesseract. Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for e6c9f77 - Browse repository at this point
Copy the full SHA e6c9f77View commit details
Commits on May 10, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 1c19a9d - Browse repository at this point
Copy the full SHA 1c19a9dView commit details -
Configuration menu - View commit details
-
Copy full SHA for e91de69 - Browse repository at this point
Copy the full SHA e91de69View commit details -
Configuration menu - View commit details
-
Copy full SHA for e3a164d - Browse repository at this point
Copy the full SHA e3a164dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5c12e2d - Browse repository at this point
Copy the full SHA 5c12e2dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 447b31c - Browse repository at this point
Copy the full SHA 447b31cView commit details
Commits on May 11, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 9c82cc6 - Browse repository at this point
Copy the full SHA 9c82cc6View commit details -
lstmeval: Improve output by ensuring 'Truth:' text is encoded the sam…
…e way as OCR output This ensures that transformations like unicode normalisation are done on the truth output as well as the OCR output, so that you can compare the two properly. Before this a perfect OCR result could show different lines for Truth and OCR if the OCR output included characters that were normalised.
Configuration menu - View commit details
-
Copy full SHA for 6a2bf21 - Browse repository at this point
Copy the full SHA 6a2bf21View commit details
Commits on May 13, 2021
-
Configuration menu - View commit details
-
Copy full SHA for c0eb39a - Browse repository at this point
Copy the full SHA c0eb39aView commit details -
Configuration menu - View commit details
-
Copy full SHA for d17a87f - Browse repository at this point
Copy the full SHA d17a87fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 756c188 - Browse repository at this point
Copy the full SHA 756c188View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3ed444c - Browse repository at this point
Copy the full SHA 3ed444cView commit details
Commits on May 15, 2021
-
Fix function GetFirstWords and modernize function GetPrefixes
Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for b04fbb4 - Browse repository at this point
Copy the full SHA b04fbb4View commit details -
Support Apple Accelerate framework for training and best models
Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 334ac9f - Browse repository at this point
Copy the full SHA 334ac9fView commit details -
Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for abef700 - Browse repository at this point
Copy the full SHA abef700View commit details
Commits on May 17, 2021
-
Configuration menu - View commit details
-
Copy full SHA for fdea3ae - Browse repository at this point
Copy the full SHA fdea3aeView commit details -
- Windows: print only the NAME of the application in on-line help; no…
…t the entire path
Configuration menu - View commit details
-
Copy full SHA for df498d9 - Browse repository at this point
Copy the full SHA df498d9View commit details
Commits on May 18, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 74705f4 - Browse repository at this point
Copy the full SHA 74705f4View commit details
Commits on May 19, 2021
-
Configuration menu - View commit details
-
Copy full SHA for bdf9db9 - Browse repository at this point
Copy the full SHA bdf9db9View commit details -
Support image width and height larger than 32767
Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for eb8f13b - Browse repository at this point
Copy the full SHA eb8f13bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 28809d7 - Browse repository at this point
Copy the full SHA 28809d7View commit details
Commits on May 21, 2021
-
Merge commit '19db25e5e5d6af3d50b3eb0971b82500630a3531'
# Conflicts: # abseil
Configuration menu - View commit details
-
Copy full SHA for 949f868 - Browse repository at this point
Copy the full SHA 949f868View commit details -
Configuration menu - View commit details
-
Copy full SHA for dec75ee - Browse repository at this point
Copy the full SHA dec75eeView commit details -
Configuration menu - View commit details
-
Copy full SHA for 7622111 - Browse repository at this point
Copy the full SHA 7622111View commit details -
Configuration menu - View commit details
-
Copy full SHA for b952dd4 - Browse repository at this point
Copy the full SHA b952dd4View commit details -
Merge remote-tracking branch 'remotes/stweil/master'
# Conflicts: # src/arch/simddetect.cpp
Configuration menu - View commit details
-
Copy full SHA for 780a2ea - Browse repository at this point
Copy the full SHA 780a2eaView commit details -
Merge remote-tracking branch 'remotes/amitdo/threshold2'
# Conflicts: # include/tesseract/publictypes.h # src/ccmain/thresholder.cpp
Configuration menu - View commit details
-
Copy full SHA for dac82e3 - Browse repository at this point
Copy the full SHA dac82e3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1859d77 - Browse repository at this point
Copy the full SHA 1859d77View commit details -
fix warnings about imprecise float constants' conversion: making sure…
… to write `0.1` as `0.1f`, etc. as these don't exactly map to an IEEE754 float32 value.
Configuration menu - View commit details
-
Copy full SHA for 7c44175 - Browse repository at this point
Copy the full SHA 7c44175View commit details
Commits on May 22, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 21f45bd - Browse repository at this point
Copy the full SHA 21f45bdView commit details -
Configuration menu - View commit details
-
Copy full SHA for e4a89fc - Browse repository at this point
Copy the full SHA e4a89fcView commit details
Commits on May 23, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 409b537 - Browse repository at this point
Copy the full SHA 409b537View commit details
Commits on May 25, 2021
-
Support image width and height larger than 32767
Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 1e5b5af - Browse repository at this point
Copy the full SHA 1e5b5afView commit details -
Configuration menu - View commit details
-
Copy full SHA for c380b75 - Browse repository at this point
Copy the full SHA c380b75View commit details
Commits on May 27, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 4787a96 - Browse repository at this point
Copy the full SHA 4787a96View commit details
Commits on May 28, 2021
-
Fix serialization for new larger coordinates
Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for d78b443 - Browse repository at this point
Copy the full SHA d78b443View commit details
Commits on May 29, 2021
-
make sure all the tesseract tools are visible to the outside in monol…
…ithic build mode: the non-supported ones for a given build will simply report that they are NIL operations anyway.
Configuration menu - View commit details
-
Copy full SHA for 8546613 - Browse repository at this point
Copy the full SHA 8546613View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0f6f954 - Browse repository at this point
Copy the full SHA 0f6f954View commit details -
fix very nasty obscure crashes inside system std::xhash code when exe…
…cuting Tesseract Init code, loading the 'tesseract_best' English language file(s): - https://stackoverflow.com/questions/17885060/passing-reference-to-stl-vector-over-dll-boundary Though we DO NOT cross a DLL boundary with that stuff (all the relevant code is included in one single "monolithic" DLL and none of that C++ stuff got outside!) we still got inexplicable crashes this way. The KEY to fixing this: you MUST MAKE SURE ALL RELEVANT MSVC PROJECT FILES HAVE THE **EXACT** **SAME** COMPILER SETTINGS: this has now been 'fixed' for Debug/Win32 build mode only as a PoC: we haven't updated all libraries yet, but this was plenty enough to make the basic bulktest run succeed again (instead of crash fatally) when executing mudraw commands, writing to *.ocr.html output files. Also note another consequence of our C++ compiler settings fiddling: - https://stackoverflow.com/questions/5004858/why-is-stdmin-failing-when-windows-h-is-included we applied the `std::max<int>(a, b)` tweak mentioned there instead of looking for the propr place to plonk a NOMINMAX for windows.h as this was faster and easier, also when we consider future compiler settings changing again as we work on our 'update' script for vcxproj files (TODO!)
Configuration menu - View commit details
-
Copy full SHA for 07a7567 - Browse repository at this point
Copy the full SHA 07a7567View commit details
Commits on May 30, 2021
-
Configuration menu - View commit details
-
Copy full SHA for beefde5 - Browse repository at this point
Copy the full SHA beefde5View commit details
Commits on Jun 10, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 80bfa8c - Browse repository at this point
Copy the full SHA 80bfa8cView commit details -
Merge remote-tracking branch 'remotes/nickjwhite/lstmevalbetteroutput…
…' into lstmevalshowconf # Conflicts: # src/training/unicharset/lstmtester.cpp
Configuration menu - View commit details
-
Copy full SHA for 7507fb1 - Browse repository at this point
Copy the full SHA 7507fb1View commit details -
Merge branch 'lstmevalshowconf'
# Conflicts: # src/training/unicharset/lstmtester.cpp
Configuration menu - View commit details
-
Copy full SHA for c64754a - Browse repository at this point
Copy the full SHA c64754aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1883512 - Browse repository at this point
Copy the full SHA 1883512View commit details -
Configuration menu - View commit details
-
Copy full SHA for bbf6c5b - Browse repository at this point
Copy the full SHA bbf6c5bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0995a90 - Browse repository at this point
Copy the full SHA 0995a90View commit details -
Merge remote-tracking branch 'remotes/Shreeshrii/unpack'
# Conflicts: # src/api/tesseractmain.cpp
Configuration menu - View commit details
-
Copy full SHA for ebd1837 - Browse repository at this point
Copy the full SHA ebd1837View commit details
Commits on Jun 12, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 7cc8ebd - Browse repository at this point
Copy the full SHA 7cc8ebdView commit details
Commits on Jun 16, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 0a4892a - Browse repository at this point
Copy the full SHA 0a4892aView commit details
Commits on Jun 18, 2021
-
CI: Replace g++-8 by g++-11 for MacOS
g++-8 is no longer installed, therefore CI fails for that compiler. Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for a954e39 - Browse repository at this point
Copy the full SHA a954e39View commit details
Commits on Jun 19, 2021
-
Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for b11626d - Browse repository at this point
Copy the full SHA b11626dView commit details -
Fix function GetFirstWords and modernize function GetPrefixes
Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for c10667d - Browse repository at this point
Copy the full SHA c10667dView commit details -
Support Apple Accelerate framework for training and best models
Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 664fc83 - Browse repository at this point
Copy the full SHA 664fc83View commit details
Commits on Jun 20, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 2a0017c - Browse repository at this point
Copy the full SHA 2a0017cView commit details -
Merge remote-tracking branch 'remotes/stweil/master'
# Conflicts: # src/arch/simddetect.cpp
Configuration menu - View commit details
-
Copy full SHA for 7f5ed28 - Browse repository at this point
Copy the full SHA 7f5ed28View commit details
Commits on Jun 21, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 80a4276 - Browse repository at this point
Copy the full SHA 80a4276View commit details
Commits on Jun 28, 2021
-
Support image width and height larger than 32767
Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 2ad9722 - Browse repository at this point
Copy the full SHA 2ad9722View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9c47d71 - Browse repository at this point
Copy the full SHA 9c47d71View commit details -
Fix serialization for new larger coordinates
Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 5a0e815 - Browse repository at this point
Copy the full SHA 5a0e815View commit details
Commits on Jun 29, 2021
-
Fix vector resize with init for all elements (issue tesseract-ocr#3473)
Fixes: c8b8d26 Fixes: 9710bc0 Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for b836f30 - Browse repository at this point
Copy the full SHA b836f30View commit details
Commits on Jul 3, 2021
-
Configuration menu - View commit details
-
Copy full SHA for ec6b822 - Browse repository at this point
Copy the full SHA ec6b822View commit details -
Fix function GetFirstWords and modernize function GetPrefixes
Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 0231ff7 - Browse repository at this point
Copy the full SHA 0231ff7View commit details -
Support Apple Accelerate framework for training and best models
Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for d5ab698 - Browse repository at this point
Copy the full SHA d5ab698View commit details -
Configuration menu - View commit details
-
Copy full SHA for f16e6f1 - Browse repository at this point
Copy the full SHA f16e6f1View commit details
Commits on Jul 4, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 7394908 - Browse repository at this point
Copy the full SHA 7394908View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7189fe1 - Browse repository at this point
Copy the full SHA 7189fe1View commit details -
Configuration menu - View commit details
-
Copy full SHA for dd9b988 - Browse repository at this point
Copy the full SHA dd9b988View commit details -
Configuration menu - View commit details
-
Copy full SHA for ddaadf9 - Browse repository at this point
Copy the full SHA ddaadf9View commit details -
Configuration menu - View commit details
-
Copy full SHA for be648f8 - Browse repository at this point
Copy the full SHA be648f8View commit details -
Merge remote-tracking branch 'remotes/stweil/master'
# Conflicts: # src/arch/simddetect.cpp
Configuration menu - View commit details
-
Copy full SHA for eb7071a - Browse repository at this point
Copy the full SHA eb7071aView commit details -
Merge remote-tracking branch 'remotes/stweil/tfloat'
# Conflicts: # src/arch/intsimdmatrixavx2.cpp # src/arch/intsimdmatrixsse.cpp
Configuration menu - View commit details
-
Copy full SHA for 36307a4 - Browse repository at this point
Copy the full SHA 36307a4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0fdcdc5 - Browse repository at this point
Copy the full SHA 0fdcdc5View commit details
Commits on Jul 5, 2021
-
Add TFloat data type for neural network
Up to now Tesseract used double for training and recognition with "best" models. This commit replaces double by a new data type TFloat which is double by default, but float if FAST_FLOAT is defined. Ideally this should allow faster training. Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for b77cd22 - Browse repository at this point
Copy the full SHA b77cd22View commit details
Commits on Jul 10, 2021
-
Merge remote-tracking branch 'remotes/stweil/tfloat' into TFloat
# Conflicts: # src/arch/dotproductavx.cpp # src/arch/intsimdmatrixavx2.cpp # src/arch/intsimdmatrixsse.cpp # src/arch/simddetect.cpp # src/ccutil/tfloat.h # src/lstm/weightmatrix.cpp # src/lstm/weightmatrix.h # unittest/intsimdmatrix_test.cc
Configuration menu - View commit details
-
Copy full SHA for ba769d5 - Browse repository at this point
Copy the full SHA ba769d5View commit details -
Merge remote-tracking branch 'remotes/StarUI/master'
# Conflicts: # src/ccstruct/imagedata.cpp
Configuration menu - View commit details
-
Copy full SHA for 9edb035 - Browse repository at this point
Copy the full SHA 9edb035View commit details -
Configuration menu - View commit details
-
Copy full SHA for 71c8c3a - Browse repository at this point
Copy the full SHA 71c8c3aView commit details -
fix shadowed variables (MSVC compiler warning about local vars shadow…
…ing other local vars of the same name)
Configuration menu - View commit details
-
Copy full SHA for e0a9b7c - Browse repository at this point
Copy the full SHA e0a9b7cView commit details -
Configuration menu - View commit details
-
Copy full SHA for d156480 - Browse repository at this point
Copy the full SHA d156480View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0a574a6 - Browse repository at this point
Copy the full SHA 0a574a6View commit details
Commits on Jul 11, 2021
-
Part 1: redesigned the TFloat approach using templates for the Serial…
…ization and Deserialization methods. Tested Deserialization with double (i.e. standard, non-optimized) layout: run-time type == storage type.
Configuration menu - View commit details
-
Copy full SHA for d397065 - Browse repository at this point
Copy the full SHA d397065View commit details -
consistent WIN32/WIN64 define check;
use a dedicated feature check for PRId32 where applicable (printf); support long paths on Windows (> 260 chars) by re-defining MAX_PATH to 4096.
Configuration menu - View commit details
-
Copy full SHA for 301aa3b - Browse repository at this point
Copy the full SHA 301aa3bView commit details -
reduce use of SSE,AVX,etc. build defines: let the software discover t…
…he available and enabled features at run-time. Doesn't cost anything and makes the code a little less cluttered with preprocessor checks.
Configuration menu - View commit details
-
Copy full SHA for 21d5cbb - Browse repository at this point
Copy the full SHA 21d5cbbView commit details -
CContinued work on SHA-1: d397065 --> Part 2: completed the redesign …
…of the TFloat approach using templates for the Serialization and Deserialization methods. Tested Deserialization with float (i.e. FAST_FLOAT) layout: run-time type (float) << storage type (double). Also tweaked the SEE/FMA/AVX/AVX2 code sections to use their optimized code while we use TFlaot=float instead of TFloat=double. (WARNING: edited, compiles okay, but has not been field tested yet!)
Configuration menu - View commit details
-
Copy full SHA for 31de23d - Browse repository at this point
Copy the full SHA 31de23dView commit details
Commits on Jul 12, 2021
-
building the monolithic unit tests. Tweaking the use of DISABLED_LEGA…
…CY_ENGINE and HAS_LIBICU define's to ensure both the regular monolithic build and the unit tests compile. Abseil is a mess and ditched for now; the LSTM tests have been tweaked to compile and link without obnoxious errors (abseil::StrCat was ditched as its the same as C++ std::string concatenation)
Configuration menu - View commit details
-
Copy full SHA for 38d777d - Browse repository at this point
Copy the full SHA 38d777dView commit details -
fix bugs in tesseract FAST_FLOAT DotProductSEE implementation
added quick & hacky benchmark code to tesseract-unittests to check the (relative) performance of the various DotProduct implementations.
Configuration menu - View commit details
-
Copy full SHA for 9c7d4ed - Browse repository at this point
Copy the full SHA 9c7d4edView commit details
Commits on Jul 13, 2021
-
Add TFloat data type for neural network
Up to now Tesseract used double for training and recognition with "best" models. This commit replaces double by a new data type TFloat which is double by default, but float if FAST_FLOAT is defined. Ideally this should allow faster training. Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 59af8dd - Browse repository at this point
Copy the full SHA 59af8ddView commit details -
Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for c64ab2e - Browse repository at this point
Copy the full SHA c64ab2eView commit details -
Optimize DotProductStdInnerProduct for float
Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 78871a9 - Browse repository at this point
Copy the full SHA 78871a9View commit details -
Avoid double / float conversion
Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 1b9e462 - Browse repository at this point
Copy the full SHA 1b9e462View commit details -
Implement TFloat for IntSimdMatrix
Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 93e9022 - Browse repository at this point
Copy the full SHA 93e9022View commit details -
Test more implementations of DotProduct
Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 00e4283 - Browse repository at this point
Copy the full SHA 00e4283View commit details -
Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for e2529dd - Browse repository at this point
Copy the full SHA e2529ddView commit details -
Support Apple Accelerate framework for training and best models
Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 01ae69e - Browse repository at this point
Copy the full SHA 01ae69eView commit details -
Configuration menu - View commit details
-
Copy full SHA for abe2d3b - Browse repository at this point
Copy the full SHA abe2d3bView commit details -
Merge remote-tracking branch 'remotes/stweil/tfloat' into TFloat
# Conflicts: # src/arch/dotproduct.h # src/arch/dotproductavx.cpp # src/arch/dotproductfma.cpp # src/arch/dotproductsse.cpp # src/arch/intsimdmatrix.h # src/arch/intsimdmatrixavx2.cpp # src/arch/intsimdmatrixneon.cpp # src/arch/intsimdmatrixsse.cpp # src/arch/simddetect.cpp # src/ccutil/tfloat.h # src/lstm/weightmatrix.cpp # unittest/intsimdmatrix_test.cc
Configuration menu - View commit details
-
Copy full SHA for 91d1f34 - Browse repository at this point
Copy the full SHA 91d1f34View commit details -
Configuration menu - View commit details
-
Copy full SHA for efc7601 - Browse repository at this point
Copy the full SHA efc7601View commit details -
bugfix of FMA port to FAST_FLOAT: 8 float FPs fit in a single 256bit …
…vector (8x32) (contrasting 4 double FPs: 4*64)
Configuration menu - View commit details
-
Copy full SHA for 30bf263 - Browse repository at this point
Copy the full SHA 30bf263View commit details -
Fix TFloat builds for Apple M1
Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for a09531a - Browse repository at this point
Copy the full SHA a09531aView commit details -
Configuration menu - View commit details
-
Copy full SHA for a220999 - Browse repository at this point
Copy the full SHA a220999View commit details -
Fix DotProductNative for TFloat
Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 1a59b6f - Browse repository at this point
Copy the full SHA 1a59b6fView commit details -
Merge branch 'tfloat-patch-4' into TFloat
# Conflicts: # src/arch/simddetect.cpp
Configuration menu - View commit details
-
Copy full SHA for b233ed4 - Browse repository at this point
Copy the full SHA b233ed4View commit details -
correct code to use the template-based serialization/deserialization …
…approach suggested in tesseract-ocr#3490
Configuration menu - View commit details
-
Copy full SHA for c114c1b - Browse repository at this point
Copy the full SHA c114c1bView commit details -
same as patch-4 (tesseract-ocr#3494) but now with reduced code duplic…
…ation: for TFloat to work, we don't need to duplicate the integer work functions as it's only the ExtractResults16[8,16] functions that need different implementations for float vs. double. These are therefor common to both implementations: ``` static void PartialMatrixDotVector64(const int8_t *wi, const TFloat *scales, const int8_t *u, int num_in, TFloat *v) { static void PartialMatrixDotVector32(const int8_t *wi, const TFloat *scales, const int8_t *u, int num_in, TFloat *v) { static void PartialMatrixDotVector16(const int8_t *wi, const TFloat *scales, const int8_t *u, int num_in, TFloat *v) { static inline void PartialMatrixDotVector8(const int8_t *wi, const TFloat *scales, const int8_t *u, int num_in, TFloat *v) { static void matrixDotVector(int dim1, int dim2, const int8_t *wi, const TFloat *scales, const int8_t *u, TFloat *v) { ```
Configuration menu - View commit details
-
Copy full SHA for d2eb7bd - Browse repository at this point
Copy the full SHA d2eb7bdView commit details -
Configuration menu - View commit details
-
Copy full SHA for adb1c5a - Browse repository at this point
Copy the full SHA adb1c5aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2068d61 - Browse repository at this point
Copy the full SHA 2068d61View commit details -
Configuration menu - View commit details
-
Copy full SHA for e9de4a2 - Browse repository at this point
Copy the full SHA e9de4a2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6d01734 - Browse repository at this point
Copy the full SHA 6d01734View commit details -
Merge pull request tesseract-ocr#1 from GerHobbelt/tfloat-patch-2
bugfix of FMA port to FAST_FLOAT: 8 float FPs fit in a single 256bit
Configuration menu - View commit details
-
Copy full SHA for b3adfdd - Browse repository at this point
Copy the full SHA b3adfddView commit details -
- added tfloat float+double DotProduct benchmark for the various inca…
…ntations: `unittest/tfloat_benchmark.cc` - working towards float+double co-existence as desired in stweil#2 (comment) using function templates for DRY as per query in stweil#2 (comment) - fix typo mistake in OpenMP code. (Probably me earlier this morning, too hurried.)
Configuration menu - View commit details
-
Copy full SHA for 6b59323 - Browse repository at this point
Copy the full SHA 6b59323View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9cc4a33 - Browse repository at this point
Copy the full SHA 9cc4a33View commit details -
Configuration menu - View commit details
-
Copy full SHA for 29e2379 - Browse repository at this point
Copy the full SHA 29e2379View commit details -
bugfixing the AVX2 Extract8+16 codes, where there's lines like `__m25…
…6d scale01234567 = _mm256_loadu_ps(scales)`, i.e. loading float vectors into double vector types. Extract from tesseract-ocr#3490.
Configuration menu - View commit details
-
Copy full SHA for 81b69b0 - Browse repository at this point
Copy the full SHA 81b69b0View commit details -
Configuration menu - View commit details
-
Copy full SHA for d23ec1d - Browse repository at this point
Copy the full SHA d23ec1dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5d16bab - Browse repository at this point
Copy the full SHA 5d16babView commit details -
bugfixing the AVX2 Extract8+16 codes, where there's lines like `__m25…
…6d scale01234567 = _mm256_loadu_ps(scales)`, i.e. loading float vectors into double vector types. Extract from tesseract-ocr#3490.
Configuration menu - View commit details
-
Copy full SHA for 4e3c112 - Browse repository at this point
Copy the full SHA 4e3c112View commit details -
HMMM. This is where the float/double co-existence stuff starts to bec…
…ome NOT NICE: code repetition at another level. TODO: Better idea? --> Maybe namespaces and double kernel projects or compile via #define+#include-all-source-files hack collective source code pages? (Latter approach may become a problem when debugging, or will the compiler suite cope well? Will know only once done & tested.) At least this is about the point where the function template solution stops to be useful. The run-time switching desire between float and double is doable, but not by using #ifdef/#else throughout, nor templating all the way up the TFloat usage calltree.
Configuration menu - View commit details
-
Copy full SHA for 8d40552 - Browse repository at this point
Copy the full SHA 8d40552View commit details -
Reverting so we have a useful and still 'kinda clean' codebase.
Revert previous commit: "HMMM. This is where the float/double co-existence stuff starts to become NOT NICE: code repetition at another level." This reverts commit 8d40552.
Configuration menu - View commit details
-
Copy full SHA for 603831b - Browse repository at this point
Copy the full SHA 603831bView commit details -
Merge branch 'tfloat-AVX-SSE-etc' into TFloat
# Conflicts: # src/arch/dotproductsse.cpp
Configuration menu - View commit details
-
Copy full SHA for 160949a - Browse repository at this point
Copy the full SHA 160949aView commit details -
Merge branch 'tfloat-patch-4' into TFloat
# Conflicts: # src/arch/intsimdmatrixavx2.cpp
Configuration menu - View commit details
-
Copy full SHA for 02d94bc - Browse repository at this point
Copy the full SHA 02d94bcView commit details -
Improve build code for native dotproduct
Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 15f7549 - Browse repository at this point
Copy the full SHA 15f7549View commit details -
Enhance unittest/dotproduct_test
Signed-off-by: Stefan Weil <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 3eae6d7 - Browse repository at this point
Copy the full SHA 3eae6d7View commit details -
Merge remote-tracking branch 'remotes/stweil/tfloat' into TFloat
# Conflicts: # src/arch/dotproduct.cpp # src/arch/dotproductsse.cpp # src/arch/intsimdmatrixavx2.cpp
Configuration menu - View commit details
-
Copy full SHA for f32b9de - Browse repository at this point
Copy the full SHA f32b9deView commit details -
Looks like
defined(_OPENMP)
is what's known in the MSVC(2019) world……: added that one as another enabling condition since benchmarks have shown MSVC2019's `/openmp:experimental` to deliver. :-) (See tesseract-ocr#3486 benchmark reports on @stweil's DotProductNative() implementation)
Configuration menu - View commit details
-
Copy full SHA for a5d45b9 - Browse repository at this point
Copy the full SHA a5d45b9View commit details -
Looks like
defined(_OPENMP)
is what's known in the MSVC(2019) world……: added that one as another enabling condition since benchmarks have shown MSVC2019's `/openmp:experimental` to deliver. :-) (See tesseract-ocr#3486 benchmark reports on @stweil's DotProductNative() implementation)
Configuration menu - View commit details
-
Copy full SHA for d025c78 - Browse repository at this point
Copy the full SHA d025c78View commit details -
Configuration menu - View commit details
-
Copy full SHA for 44a8f41 - Browse repository at this point
Copy the full SHA 44a8f41View commit details