Orange Textable is an open-source add-on bringing advanced text-analytical functionalities to the Orange Canvas visual programming environment (itself open-source). It has been designed and implemented on behalf of the department of language and information sciences (SLI) at the University of Lausanne and it essentially enables users to build data tables on the basis of text data, by means of a flexible and intuitive interface. Orange Textable offers the following features:
- text data import from keyboard, files, or URLs
- support for various encodings, including Unicode
- standard preprocessing and custom recoding (based on regular expressions)
- segmentation and annotation of various text units (letters, words, etc.)
- ability to extract and exploit XML-encoded annotations
- automatic, random, or arbitrary selection of unit subsets
- unit context examination using concordance and collocation tables
- calculation of frequency and complexity measures
- recoded text data and table export
This documentation is divided into 3 main sections (see detailed contents below):