Skip to content

Commit

Permalink
Merge pull request #18 from stweil/master
Browse files Browse the repository at this point in the history
These models don't work with old versions of Tesseract.
  • Loading branch information
zdenop authored Oct 23, 2018
2 parents 7274cfa + a7cb5a8 commit b893ed3
Showing 1 changed file with 3 additions and 1 deletion.
4 changes: 3 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,11 +2,13 @@

This repository contains fast integer versions of trained models for the [Tesseract Open Source OCR Engine](https://github.com/tesseract-ocr/tesseract).

These models only work with the LSTM OCR engine of Tesseract 4.

- These are a speed/accuracy compromise as to what offered the best "value for money" in speed vs accuracy.
- For some languages, this is still best, but for most not.
- The "best value for money" network configuration was then integerized for further speed.
- Most users will want to use these traineddata files to do OCR and these will be shipped as part of Linux distributions eg. Ubuntu 18.04.
- Fine tuning/incremental training will **NOT** be possible from these `fast` models, as they are 8-bit integer.
- Fine tuning/incremental training will **NOT** be possible from these `fast` models, as they are 8-bit integer.
- When using the models in this repository, only the new LSTM-based OCR engine is supported. The legacy `tesseract` engine is not supported with these files, so Tesseract's oem modes '0' and '2' won't work with them.

## Two types of models
Expand Down

0 comments on commit b893ed3

Please sign in to comment.