Provides a 'tm' Source to create corpora from articles exported from the 'LexisNexis' content provider as HTML files. It is able to read both text content and meta-data information (including source, date, title, author and pages). Note that the file format is highly unstable: there is no warranty that this package will work for your corpus, and you may have to adjust the code to adapt it to your particular format.
| Version: | 1.4.2 |
| Imports: | utils, NLP, tm (≥ 0.6), xml2, ISOcodes |
| Published: | 2025-03-28 |
| DOI: | 10.32614/CRAN.package.tm.plugin.lexisnexis |
| Author: | Milan Bouchet-Valat [aut, cre], Tom Nicholls [ctb] |
| Maintainer: | Milan Bouchet-Valat <nalimilan at club.fr> |
| BugReports: | https://github.com/nalimilan/R.TeMiS/issues |
| License: | GPL-2 | GPL-3 [expanded from: GPL (≥ 2)] |
| URL: | https://github.com/nalimilan/R.TeMiS |
| NeedsCompilation: | no |
| Materials: | NEWS |
| In views: | NaturalLanguageProcessing |
| CRAN checks: | tm.plugin.lexisnexis results |
| Reference manual: | tm.plugin.lexisnexis.html , tm.plugin.lexisnexis.pdf |
| Package source: | tm.plugin.lexisnexis_1.4.2.tar.gz |
| Windows binaries: | r-devel: tm.plugin.lexisnexis_1.4.2.zip, r-release: tm.plugin.lexisnexis_1.4.2.zip, r-oldrel: tm.plugin.lexisnexis_1.4.2.zip |
| macOS binaries: | r-release (arm64): tm.plugin.lexisnexis_1.4.2.tgz, r-oldrel (arm64): tm.plugin.lexisnexis_1.4.2.tgz, r-release (x86_64): tm.plugin.lexisnexis_1.4.2.tgz, r-oldrel (x86_64): tm.plugin.lexisnexis_1.4.2.tgz |
| Old sources: | tm.plugin.lexisnexis archive |
| Reverse imports: | R.temis |
| Reverse suggests: | RcmdrPlugin.temis |
Please use the canonical form https://CRAN.R-project.org/package=tm.plugin.lexisnexis to link to this page.