tif_is_corpus_character
returns TRUE or FALSE for whether the input
is a valid character vector corpus object.
tif_is_tokens_list
returns TRUE or FALSE for whether the input
is a valid list-based tokens object.
tif_as_corpus_character
takes a valid tif corpus object and returns
a character vector corpus object.
tif_as_corpus_df
takes a valid tif corpus object and returns
a data frame corpus object.
tif_as_tokens_character
takes a valid tif tokens object and returns
a list-based tokens object.
tif_as_tokens_df
takes a valid tif tokens object and returns
a list-based tokens object.
tif_is_corpus_df
,
tif_is_dtm
and tif_is_tokens_df
. This is more in line with base-R
functions and separates the "df" version of the corpus and tokens from
the alternative new forms.tif_corpus_validate
returns TRUE or FALSE for whether the input
is a valid corpus object.
tif_dtm_validate
returns TRUE or FALSE for whether the input is
a valid document corpus object.
tif_tokens_validate
returns TRUE or FALSE for whether the input is
a valid tokens object.
do not yet have a test suite for the package
encoding checkin is not yet working