Package: tesseract Type: Package Title: Open Source OCR Engine Version: 5.2.5 Authors@R: person("Jeroen", "Ooms", role = c("aut", "cre"), email = "jeroenooms@gmail.com", comment = c(ORCID = "0000-0002-4035-0289")) Description: Bindings to 'Tesseract': a powerful optical character recognition (OCR) engine that supports over 100 languages. The engine is highly configurable in order to tune the detection algorithms and obtain the best possible results. License: Apache License 2.0 URL: https://docs.ropensci.org/tesseract/ https://ropensci.r-universe.dev/tesseract BugReports: https://github.com/ropensci/tesseract/issues SystemRequirements: Tesseract >= 3.03 (libtesseract-dev / tesseract-devel) and Leptonica (libleptonica-dev / leptonica-devel). On Debian you need to install the English training data separately (tesseract-ocr-eng) Imports: Rcpp (>= 0.12.12), pdftools (>= 1.5), curl, rappdirs, digest LinkingTo: Rcpp RoxygenNote: 7.3.3 Roxygen: list(markdown = TRUE) Suggests: magick (>= 1.7), spelling, knitr, tibble, rmarkdown Encoding: UTF-8 VignetteBuilder: knitr Language: en-US Config/pak/sysreqs: libleptonica-dev libjpeg-dev libssl-dev libtesseract-dev tesseract-ocr-eng zlib1g-dev Repository: https://ropensci.r-universe.dev Date/Publication: 2026-01-26 13:10:43 UTC RemoteUrl: https://github.com/ropensci/tesseract RemoteRef: master RemoteSha: eb79775ec4fd5efb1949647087f87e2216282670 NeedsCompilation: yes Packaged: 2026-06-23 20:42:34 UTC; root Author: Jeroen Ooms [aut, cre] (ORCID: ) Maintainer: Jeroen Ooms