Changes in version 5.2.5 (2026-01-26) o Wrap examples in donttest for cran policies Changes in version 5.2.4 (2025-12-29) o Do not use CXX11 anymore in configure script (fixes R-4.6) Changes in version 5.2.1 (2023-11-20) o Fix shell script for cross compilation Changes in version 5.2.0 (2023-09-26) o Windows: update to tesseract 5.3.2 Changes in version 5.1.0 (2022-05-29) o Win: update to tesseract 5.1.0. o Win: apply patch for freezes when running under UTF-8 in R-4.2. See: https://github.com/tesseract-ocr/tesseract/issues/3830 Changes in version 5.0.0 (2022-01-10) o Win/Mac: update to libtesseract 5.0.1 o Remove locale workaround on libtesseract 4.1+ (should only be needed for 4.0) o Remove cruft that was needed to support Solaris Changes in version 4.2.0 (2021-12-03) o Prepare for API changes in upcoming Tesseract 5 release o Change the default language="eng" in tesseract() Changes in version 4.1.2 (2021-09-18) o Fix for upstream master/main renames in language repos Changes in version 4.1.1 (2021-05-10) o Win/Mac: update to libtesseract 4.1.1 Changes in version 4.1 (2019-07-25) o Fix memory leak in ocr_data() o Windows / MacOS: update to libtesseract 4.1.0. This re-enables the whitelist/blacklist options that were missing in Tesseract 4.0 Changes in version 4.0 (2018-11-02) o Windows, MacOS: Upgrade to upstream Tesseract 4.0! Completely new OCR engine. o Tesseract 4 has a new training data format. On Windows / MacOS you need to re-download your language data with tesseract_download(). The package uses separate directories for storing Tesseract 3 vs 4 data so they shouldn't get mixed up (hopefully). o Drop hard-dependency on tibble (only load if available) Changes in version 2.3 (2018-08-10) o Fix problem with setlocale() not properly restoring locale. o Switch examples from dontrun{} to donttest{}, and '--run-donttest' on travis/appveyor Changes in version 2.2 (2018-07-10) o Fixes for breaking changes in Tesseract 4.0.0 beta.3 o Set LC_ALL = C when initiating tesseract o Include to support Tesseract 4 Changes in version 2.1 (2018-05-03) o Fixes for 4.0.0-beta.1: they switched to semver + other data branch o Set LC_CTYPE to "C" when loading training data (required for some asian languages) o Add back OSD training data on Windows Changes in version 2.0 (2018-03-02) o Set tesseract parameters at init so that all parameters types now actually work! o New function tesseract_params() lists all supported parameters and their default o Added 'config' argument to tesseract() which specifies a file with parameter values o Internally validate paremeter names before init to revent tesseract crashes o Rewrite the ocr_data() function in C++ to make it much faster o Tesseract 4 now gets data from the tessdata_fast repo as recommended upstream o Use default resolution of 300dpi when image does not contain resolution info Changes in version 1.9 (2018-02-15) o Tesseract 4 now dowloads training data from the "tessdata_fast" repo o Add ocr_data() function that parses the hOCR output Changes in version 1.8 (2018-01-26) o Add support for HOCR output (#20) o Remove 'script' and 'orientation' attributes in output (doesn't work in Tesseract 4) Changes in version 1.7 o Add support upcoming Tesseract 4 (compiler fix + separate tessdata dir) o Configure script now explicitly tests for CXX11 (required by Tesseract 4) Changes in version 1.6 (2017-08-14) o Windows: update libtesseract to 3.05.01 o tesseract_download now uses 3.04 tree (instead of 4.00) as suggested in readme o For static packags on Win/Mac, languages stored in: rappdirs::user_data_dir('tesseract') o Use 'png' instead of 'tiff' to read magick images o Compile with $(C_VISIBILITY) to hide internal symbols (requires Rcpp 0.12.12) o Use Rcpp symbol registration Changes in version 1.4 (2017-03-21) o Run engine finalizer on R exit (requires Rcpp 0.12.10) o Move autobrew script to separate repository o Add symbol registration Changes in version 1.3 (2016-12-07) o tesseract() gains an 'options' parameter for setting engine variables o New tessseract_download() function for installing training data on Win/Mac o Initiate default tesseract engine onAttach() to fail for missing training data o Add support for ocr() on magick images Changes in version 1.2 (2016-11-13) o Try to fix build for CRAN OS-X, again. Changes in version 1.1 (2016-11-10) o Try to fix build for CRAN OS-X build server o Show 'loaded' and 'available' languages in print.tesseract() Changes in version 1.0 (2016-11-04) o Initial CRAN release