Skip to content

Releases: tesseract-ocr/tesseract

5.0.0-rc1

29 Oct 20:43
5.0.0-rc1

Choose a tag to compare

This is the first release candidate of Tesseract 5.0.0.

  • Enable fast float32 LSTM by default
  • Switch to NFC normalisation everywhere
  • Remove banner message
  • Disable music staff detection and removal
  • Add new command line option --loglevel
  • Bug fixes

See also list of all changes.

5.0.0-beta-20210916

16 Sep 04:48
5.0.0-beta-20210916

Choose a tag to compare

5.0.0-beta-20210916 Pre-release
Pre-release

This is a new pre-release of Tesseract 5.0.0.

  • Bug fixes
  • Extend URI support for Tesseract with libcurl
  • Rename processed TIFF output file and add page number if needed

See also list of all changes.

5.0.0-beta-20210815

15 Aug 15:12
5.0.0-beta-20210815

Choose a tag to compare

5.0.0-beta-20210815 Pre-release
Pre-release

This is a new pre-release of Tesseract 5.0.0.

  • Bug fixes
  • Modernize more code
  • More options for binarization
  • Improved support for ARM NEON
  • No longer depends on Abseil for unit tests
  • Support float for model training and text recognition (faster, requires less RAM)

See also list of all changes.

5.0.0-alpha-20210401

01 Apr 20:23
5.0.0-alpha-20210401

Choose a tag to compare

5.0.0-alpha-20210401 Pre-release
Pre-release

This is a new pre-release of Tesseract 5.0.0.

  • Replaced all remaining STRING by std::string
  • Replaced lots of GenericVector by std::vector
  • Replaced all malloc / free by C++ code
  • Modernized and formatted code

See also list of all changes.

5.0.0-alpha-20201231

31 Dec 18:52
5.0.0-alpha-20201231

Choose a tag to compare

5.0.0-alpha-20201231 Pre-release
Pre-release

This is a new pre-release of Tesseract 5.0.0.

It has massive changes in the public API which is a great step towards a final 5.0.0.
All unit tests pass, but because of those changes more practical experience is needed.

  • the public API no longer uses proprietary data types GenericVector, STRING
  • pdf.ttf is no longer needed because it is now integrated into the code

See also list of all changes.

5.0.0-alpha-20201224

24 Dec 08:37
5.0.0-alpha-20201224

Choose a tag to compare

5.0.0-alpha-20201224 Pre-release
Pre-release

This is a new pre-release of Tesseract 5.0.0.

It is considered to be production ready for end users, but nevertheless not stable because more incompatible API changes are planned.

  • improved performance (also on ARM / ARM64)
  • improved unit tests
  • many fixes
  • faster flat build with automake
  • support for latest macOS (including new M1 processor)

See also list of all changes.

4.1.1 Release

26 Dec 15:43

Choose a tag to compare

  • Implemented sw build (cppan is deprecated)
  • Improved cmake build
  • Code cleanup and optimization
  • A lot of bug fixes...

4.1.0 Release

07 Jul 13:43

Choose a tag to compare

  • Added new renderers Alto, LSTMBox, WordStrBox.
  • Added character boxes in hOCR output.
  • Added python training scripts (experimental) as alternative shell scripts.
  • Better support AVX / AVX2 / SSE.
  • Disable OpenMP support by default (see e.g. #1171, #1081).
  • Fix for bounding box problem.
  • Implemented support for whitelist/blacklist in LSTM engine.
  • Improved cmake configuration.
  • Code modernization and improvements.
  • A lot of bug fixes...

Detailed changelog is on wiki.

Windows installer can be downloaded from https://github.com/UB-Mannheim/tesseract/wiki.

4.0.0 Release

29 Oct 09:00

Choose a tag to compare

Detailed Release notes, Changelog and documentation can be found in project wiki.

Windows installer can be downloaded from https://github.com/UB-Mannheim/tesseract/wiki.

3.05.02 Release

19 Jun 09:16

Choose a tag to compare

Bug fix release