Changes in version 1.8.1 (2025-03-28) o Fix R CMD check NOTE about links in docs. Changes in version 1.8 (2019-10-19) o Stop truncating article IDs to avoid duplicates (thanks to Tom Nicholls). o Handle non-numeric 'page' metadata entries (thanks to Tom Nicholls). Changes in version 1.7 (2017-11-20) o Port from XML to xml2 package to support tm 0.8. Changes in version 1.6 (2017-02-08) o Avoid importing each article twice with new Factiva HTML format. o Add screencast showing how to export correct HTML files in ?FactivaSource. Changes in version 1.5 (2014-07-05) o Fix encoding issues on non-UTF-8 systems, adding back the 'encoding' argument to work around a bug in package XML. Changes in version 1.4 (2014-06-11) o Adapt to tm 0.6. o Remove the 'encoding' argument to FactivaSource() as it is not supported by tm 0.6 (normally not needed). o Change all tags to lowercase (for consistency with tm). o Ensure meta-data variables which are supposed to contain only one value always do so. Changes in version 1.3 (2014-01-11) o Extract Company, Industry, Information Provider Code (IPC) and Information Provider Description (IPD) meta-data (based on a patch by Grigorij Ljubownikow). o Remove inconsistent line breaks in HTML format. o Update to support tm 0.5-10 and clean the code a bit. Changes in version 1.2 (2013-01-29) o Extract Subject and Coverage meta-data. o Add Reuters21578 example. o Fix handling of articles with no header or body. o Split lead paragraphs into separate lines. o Fix package help page to mention HTML. Changes in version 1.1 (2012-07-01) o Add support for HTML files since Factiva no longer allows exporting to XML. o Work around encoding issues on Windows (for HTML only). o Preserve paragraphs information so that e.g. makeChunks() from tm can be used to split documents into smaller pieces. Changes in version 1.0 (2012-05-14) o Initial release with support for XML files.