tm.plugin.webmining: Retrieve Structured, Textual Data from Various Web Sources

Facilitate text retrieval from feed formats like XML (RSS, ATOM) and JSON. Also direct retrieval from HTML is supported. As most (news) feeds only incorporate small fractions of the original text tm.plugin.webmining even retrieves and extracts the text of the original text source.

Version: 1.3
Depends: R (≥ 3.1.0)
Imports: NLP (≥ 0.1-2), tm (≥ 0.6), boilerpipeR, RCurl, XML, RJSONIO
Suggests: testthat
Published: 2015-05-11
Author: Mario Annau [aut, cre]
Maintainer: Mario Annau <mario.annau at gmail.com>
BugReports: https://github.com/mannau/tm.plugin.webmining/issues
License: GPL-3
URL: https://github.com/mannau/tm.plugin.webmining
NeedsCompilation: no
Materials: NEWS
In views: NaturalLanguageProcessing, WebTechnologies
CRAN checks: tm.plugin.webmining results

Documentation:

Reference manual: tm.plugin.webmining.pdf
Vignettes: Introduction to the tm.plugin.webmining Package

Downloads:

Package source: tm.plugin.webmining_1.3.tar.gz
Windows binaries: r-devel: tm.plugin.webmining_1.3.zip, r-release: tm.plugin.webmining_1.3.zip, r-oldrel: tm.plugin.webmining_1.3.zip
macOS binaries: r-release (arm64): tm.plugin.webmining_1.3.tgz, r-oldrel (arm64): tm.plugin.webmining_1.3.tgz, r-release (x86_64): tm.plugin.webmining_1.3.tgz, r-oldrel (x86_64): tm.plugin.webmining_1.3.tgz
Old sources: tm.plugin.webmining archive

Linking:

Please use the canonical form https://CRAN.R-project.org/package=tm.plugin.webmining to link to this page.