Home | Research | Tools and Resources | CV | Tutorials | Outdoor Shenanigans
Desktop Tools
A variety of desktop tools can be found at www.linguisticanalysistools.org, a site I maintain in collaboration with Scott Crossley.
These include:
- The Tool for the Automatic Analysis if Lexical Diversity (TAALED)
- The Tool for the Automatic Analysis of Lexical Sophistication (TAALES)
- The Tool for the Automatic Analysis of Syntactic Sophistication and Complexity (TAASSC)
- among many others
Python Packages
I have developed some text analysis Python Packages (with more in development):
- lxgrtgr is an open source replication of the Biber Tagger (Biber, 1988) that provides detailed (semi)automatic annotation of lexicogrammatical features (including complexity features). It also provides normed counts of these features in texts. This is the result of a collaboration with Doug Biber, Hakyung Sung, Randi Reppen, and Jesse Egbert.
- pylats is a text pre-processing toolkit that will serve as the standard text-preprocessing protocol for upcoming releases of TAALED, TAALES, TAASSC, and TAACO (among other tools).
- taaled is a toolkit for calculating a wide number of lexical diversity indices, and will be the backbone of upcoming releases ofTAALED
- corpus-toolkit is a simple but scalable corpus analysis toolkit primarily designed for introductory corpus linguistics classes.