textrecipes: Extra 'Recipes' for Text Processing

Converting text to numerical features requires specifically created procedures, which are implemented as steps according to the 'recipes' package. These steps allows for tokenization, filtering, counting (tf and tfidf) and feature hashing.

Version: 0.5.1
Depends: R (≥ 3.4), recipes (≥ 0.1.15)
Imports: lifecycle, dplyr, generics (≥ 0.1.0), magrittr, Matrix, purrr, Rcpp, rlang, SnowballC, tibble, tidyr, tokenizers, vctrs, glue
LinkingTo: Rcpp
Suggests: covr, janitor, knitr, modeldata, rmarkdown, sentencepiece, spacyr, stopwords, stringi, testthat (≥ 3.0.0), text2vec, textfeatures (≥ 0.3.3), tokenizers.bpe, udpipe, wordpiece
Published: 2022-03-29
Author: Emil Hvitfeldt ORCID iD [aut, cre]
Maintainer: Emil Hvitfeldt <emilhhvitfeldt at gmail.com>
BugReports: https://github.com/tidymodels/textrecipes/issues
License: MIT + file LICENSE
URL: https://github.com/tidymodels/textrecipes, https://textrecipes.tidymodels.org, https://textrecipes.tidymodels.org/
NeedsCompilation: yes
SystemRequirements: GNU make, C++11
Materials: README NEWS
CRAN checks: textrecipes results

Documentation:

Reference manual: textrecipes.pdf
Vignettes: Working with n-grams
Cookbook - Using more complex recipes involving text
Under the hood - tokenlist

Downloads:

Package source: textrecipes_0.5.1.tar.gz
Windows binaries: r-devel: textrecipes_0.5.1.zip, r-release: textrecipes_0.5.1.zip, r-oldrel: textrecipes_0.5.1.zip
macOS binaries: r-release (arm64): textrecipes_0.5.0.tgz, r-oldrel (arm64): textrecipes_0.5.1.tgz, r-release (x86_64): textrecipes_0.5.1.tgz, r-oldrel (x86_64): textrecipes_0.5.1.tgz
Old sources: textrecipes archive

Linking:

Please use the canonical form https://CRAN.R-project.org/package=textrecipes to link to this page.