NEWS
tesseract 5.2.1 (2023-11-20)
- Fix shell script for cross compilation
tesseract 5.2.0 (2023-09-26)
- Windows: update to tesseract 5.3.2
tesseract 5.1.0 (2022-05-29)
- Win: update to tesseract 5.1.0.
- Win: apply patch for freezes when running under UTF-8 in R-4.2.
See: https://github.com/tesseract-ocr/tesseract/issues/3830
tesseract 5.0.0 (2022-01-10)
- Win/Mac: update to libtesseract 5.0.1
- Remove locale workaround on libtesseract 4.1+ (should only be needed for 4.0)
- Remove cruft that was needed to support Solaris
tesseract 4.2.0 (2021-12-03)
- Prepare for API changes in upcoming Tesseract 5 release
- Change the default language="eng" in tesseract()
tesseract 4.1.2 (2021-09-18)
- Fix for upstream master/main renames in language repos
tesseract 4.1.1 (2021-05-10)
- Win/Mac: update to libtesseract 4.1.1
tesseract 4.1 (2019-07-25)
- Fix memory leak in ocr_data()
- Windows / MacOS: update to libtesseract 4.1.0. This re-enables
the whitelist/blacklist options that were missing in Tesseract 4.0
tesseract 4.0 (2018-11-02)
- Windows, MacOS: Upgrade to upstream Tesseract 4.0! Completely new OCR engine.
- Tesseract 4 has a new training data format. On Windows / MacOS you need to
re-download your language data with tesseract_download(). The package uses
separate directories for storing Tesseract 3 vs 4 data so they shouldn't get
mixed up (hopefully).
- Drop hard-dependency on tibble (only load if available)
tesseract 2.3 (2018-08-10)
- Fix problem with setlocale() not properly restoring locale.
- Switch examples from dontrun{} to donttest{}, and '--run-donttest' on travis/appveyor
tesseract 2.2 (2018-07-10)
- Fixes for breaking changes in Tesseract 4.0.0 beta.3
- Set LC_ALL = C when initiating tesseract
- Include <tesseract/*> to support Tesseract 4
tesseract 2.1 (2018-05-03)
- Fixes for 4.0.0-beta.1: they switched to semver + other data branch
- Set LC_CTYPE to "C" when loading training data (required for some asian languages)
- Add back OSD training data on Windows
tesseract 2.0 (2018-03-02)
- Set tesseract parameters at init so that all parameters types now actually work!
- New function tesseract_params() lists all supported parameters and their default
- Added 'config' argument to tesseract() which specifies a file with parameter values
- Internally validate paremeter names before init to revent tesseract crashes
- Rewrite the ocr_data() function in C++ to make it much faster
- Tesseract 4 now gets data from the tessdata_fast repo as recommended upstream
- Use default resolution of 300dpi when image does not contain resolution info
tesseract 1.9 (2018-02-15)
- Tesseract 4 now dowloads training data from the "tessdata_fast" repo
- Add ocr_data() function that parses the hOCR output
tesseract 1.8 (2018-01-26)
- Add support for HOCR output (#20)
- Remove 'script' and 'orientation' attributes in output (doesn't work in Tesseract 4)
tesseract 1.7
- Add support upcoming Tesseract 4 (compiler fix + separate tessdata dir)
- Configure script now explicitly tests for CXX11 (required by Tesseract 4)
tesseract 1.6 (2017-08-14)
- Windows: update libtesseract to 3.05.01
- tesseract_download now uses 3.04 tree (instead of 4.00) as suggested in readme
- For static packags on Win/Mac, languages stored in: rappdirs::user_data_dir('tesseract')
- Use 'png' instead of 'tiff' to read magick images
- Compile with $(C_VISIBILITY) to hide internal symbols (requires Rcpp 0.12.12)
- Use Rcpp symbol registration
tesseract 1.4 (2017-03-21)
- Run engine finalizer on R exit (requires Rcpp 0.12.10)
- Move autobrew script to separate repository
- Add symbol registration
tesseract 1.3 (2016-12-07)
- tesseract() gains an 'options' parameter for setting engine variables
- New tessseract_download() function for installing training data on Win/Mac
- Initiate default tesseract engine onAttach() to fail for missing training data
- Add support for ocr() on magick images
tesseract 1.2 (2016-11-13)
- Try to fix build for CRAN OS-X, again.
tesseract 1.1 (2016-11-10)
- Try to fix build for CRAN OS-X build server
- Show 'loaded' and 'available' languages in print.tesseract()
tesseract 1.0 (2016-11-04)