koRpus: An R Package for Text Analysis

A set of tools to analyze texts. Includes, amongst others, functions for automatic language detection, hyphenation, several indices of lexical diversity (e.g., type token ratio, HD-D/vocd-D, MTLD) and readability (e.g., Flesch, SMOG, LIX, Dale-Chall). Basic import functions for language corpora are also provided, to enable frequency analyses (supports Celex and Leipzig Corpora Collection file formats). #' Note: For full functionality a local installation of TreeTagger is recommended. Be encouraged to send feedback to the author(s)!

Version: 0.04-40
Depends: R (≥ 2.10.0), methods
Suggests: testthat, tm, Snowball
Enhances: rkward
Published: 2013-04-08
Author: m.eik michalke, with contributions from Earl Brown, Alberto Mirisola, Alexandre Brulet, and Laura Hauser
Maintainer: m.eik michalke <meik.michalke at hhu.de>
License: GPL (≥ 3)
URL: http://reaktanz.de/?c=hacking&s=koRpus
NeedsCompilation: no
Citation: koRpus citation info
In views: NaturalLanguageProcessing
CRAN checks: koRpus results

Downloads:

Package source: koRpus_0.04-40.tar.gz
MacOS X binary: koRpus_0.04-40.tgz
Windows binary: koRpus_0.04-40.zip
Reference manual: koRpus.pdf
Vignettes: Using the koRpus Package for Text Analysis
News/ChangeLog:NEWS ChangeLog
Old sources: koRpus archive

Reverse dependencies:

Reverse suggests: qdap