Package: epubr 0.6.5
Matthew Leonawicz
epubr: Read EPUB File Metadata and Text
Provides functions supporting the reading and parsing of internal e-book content from EPUB files. The 'epubr' package provides functions supporting the reading and parsing of internal e-book content from EPUB files. E-book metadata and text content are parsed separately and joined together in a tidy, nested tibble data frame. E-book formatting is not completely standardized across all literature. It can be challenging to curate parsed e-book content across an arbitrary collection of e-books perfectly and in completely general form, to yield a singular, consistently formatted output. Many EPUB files do not even contain all the same pieces of information in their respective metadata. EPUB file parsing functionality in this package is intended for relatively general application to arbitrary EPUB e-books. However, poorly formatted e-books or e-books with highly uncommon formatting may not work with this package. There may even be cases where an EPUB file has DRM or some other property that makes it impossible to read with 'epubr'. Text is read 'as is' for the most part. The only nominal changes are minor substitutions, for example curly quotes changed to straight quotes. Substantive changes are expected to be performed subsequently by the user as part of their text analysis. Additional text cleaning can be performed at the user's discretion, such as with functions from packages like 'tm' or 'qdap'.
Authors:
epubr_0.6.5.tar.gz
epubr_0.6.5.zip(r-4.5)epubr_0.6.5.zip(r-4.4)epubr_0.6.5.zip(r-4.3)
epubr_0.6.5.tgz(r-4.4-any)epubr_0.6.5.tgz(r-4.3-any)
epubr_0.6.5.tar.gz(r-4.5-noble)epubr_0.6.5.tar.gz(r-4.4-noble)
epubr_0.6.5.tgz(r-4.4-emscripten)epubr_0.6.5.tgz(r-4.3-emscripten)
epubr.pdf |epubr.html✨
epubr/json (API)
NEWS
# Install 'epubr' in R: |
install.packages('epubr', repos = c('https://ropensci.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/ropensci/epubr/issues
epubepub-filesepub-formatpeer-reviewed
Last updated 2 months agofrom:0d4097e9a3 (on master). Checks:OK: 7. Indexed: yes.
Target | Result | Date |
---|---|---|
Doc / Vignettes | OK | Nov 10 2024 |
R-4.5-win | OK | Nov 10 2024 |
R-4.5-linux | OK | Nov 10 2024 |
R-4.4-win | OK | Nov 10 2024 |
R-4.4-mac | OK | Nov 10 2024 |
R-4.3-win | OK | Nov 10 2024 |
R-4.3-mac | OK | Nov 10 2024 |
Exports:count_wordsepubepub_catepub_headepub_metaepub_recombineepub_reorderepub_siftepub_unzip
Dependencies:clicpp11dplyrfansigenericsgluelifecyclemagrittrpillarpkgconfigpurrrR6Rcpprlangstringistringrtibbletidyrtidyselectutf8vctrswithrxml2xslt
Readme and manuals
Help Manual
Help page | Topics |
---|---|
Word count | count_words |
Extract and read EPUB e-books | epub epub_meta epub_unzip |
Pretty printing of EPUB text | epub_cat |
Preview the first n characters | epub_head |
Recombine text sections | epub_recombine |
Reorder sections | epub_reorder |
Sift EPUB sections | epub_sift |
epubr: Read EPUB File Metadata and Text | epubr-package epubr |