Skip to content

Releases: VikParuchuri/surya

Bug fixes

16 May 22:00
c89e015
Compare
Choose a tag to compare
  • Fix memory leak with layout and text detection models and large batch sizes
  • Improve ordering model generation slightly

Save memory when pruning MoE

09 May 17:24
11faf42
Compare
Choose a tag to compare
  • Prune MoE experts before loading model
  • Unpin torch version from 2.2.2

Fix issue with torch and dependencies

08 May 20:02
2ba7e19
Compare
Choose a tag to compare
Merge pull request #96 from VikParuchuri/dev

Fix publishing issue

Add OCR batch size option

08 May 19:30
2133e3f
Compare
Choose a tag to compare
Merge pull request #95 from VikParuchuri/dev

Add batch size to OCR

Set batch sizes programatically

08 May 19:02
40a1043
Compare
Choose a tag to compare
  • Programmatic batch sizes for all models
  • Update package versions
  • Bugfixes

Release ordering model

22 Apr 17:16
e8c98ac
Compare
Choose a tag to compare
  • Release reading order model
  • Add Google Cloud OCR benchmarks to README

Release layout model

26 Mar 18:48
3cdc3b6
Compare
Choose a tag to compare
  • Add layout model
  • Multiprocessing for detection postprocessing (can be much faster)
  • Fix minor bugs

Fix dependency import

06 Mar 20:03
ce8e95b
Compare
Choose a tag to compare
Merge pull request #52 from VikParuchuri/dev

Fix import issue

Improve line detector

06 Mar 19:36
f68379a
Compare
Choose a tag to compare
  • Improved line detector with better recall and higher resolution
  • Confidence scores for detection and recognition
  • Beta layout detection model

Basic sorting, repetition detection, UTF-8 output

16 Feb 22:27
534c237
Compare
Choose a tag to compare
  • Output JSON as UTF-8 so text can be scanned easily
  • Improve output schema to include more information and be easier to parse
  • Sort text lines in basic order, which will work in many cases
  • Detect repetitions and truncate