Package: tm.plugin.lexisnexis 1.4.1

tm.plugin.lexisnexis: Import Articles from 'LexisNexis' Using the 'tm' Text Mining Framework

Provides a 'tm' Source to create corpora from articles exported from the 'LexisNexis' content provider as HTML files. It is able to read both text content and meta-data information (including source, date, title, author and pages). Note that the file format is highly unstable: there is no warranty that this package will work for your corpus, and you may have to adjust the code to adapt it to your particular format.

Authors:Milan Bouchet-Valat [aut, cre], Tom Nicholls [ctb]

tm.plugin.lexisnexis_1.4.1.tar.gz
tm.plugin.lexisnexis_1.4.1.zip(r-4.5)tm.plugin.lexisnexis_1.4.1.zip(r-4.4)tm.plugin.lexisnexis_1.4.1.zip(r-4.3)
tm.plugin.lexisnexis_1.4.1.tgz(r-4.4-any)tm.plugin.lexisnexis_1.4.1.tgz(r-4.3-any)
tm.plugin.lexisnexis_1.4.1.tar.gz(r-4.5-noble)tm.plugin.lexisnexis_1.4.1.tar.gz(r-4.4-noble)
tm.plugin.lexisnexis_1.4.1.tgz(r-4.4-emscripten)tm.plugin.lexisnexis_1.4.1.tgz(r-4.3-emscripten)
tm.plugin.lexisnexis.pdf |tm.plugin.lexisnexis.html
tm.plugin.lexisnexis/json (API)
NEWS

# Install 'tm.plugin.lexisnexis' in R:
install.packages('tm.plugin.lexisnexis', repos = c('https://nalimilan.r-universe.dev', 'https://cloud.r-project.org'))

Peer review:

Bug tracker:https://github.com/nalimilan/r.temis/issues

On CRAN:

text-mining

4.59 score 26 stars 1 packages 9 scripts 254 downloads 2 exports 9 dependencies

Last updated 9 months agofrom:393acd08f4. Checks:OK: 5 NOTE: 2. Indexed: yes.

TargetResultDate
Doc / VignettesOKNov 01 2024
R-4.5-winNOTENov 01 2024
R-4.5-linuxNOTENov 01 2024
R-4.4-winOKNov 01 2024
R-4.4-macOKNov 01 2024
R-4.3-winOKNov 01 2024
R-4.3-macOKNov 01 2024

Exports:LexisNexisSourcereadLexisNexisHTML

Dependencies:BHcliISOcodesNLPRcpprlangslamtmxml2

Readme and manuals

Help Manual

Help pageTopics
A plug-in for the tm text mining framework to import articles from LexisNexistm.plugin.lexisnexis-package tm.plugin.lexisnexis
LexisNexis Sourceeoi.LexisNexisSource getElem.LexisNexisSource LexisNexisSource
Read in a LexisNexis article in the HTML formatreadLexisNexisHTML