Preprocess
Basic text preprocessing.
Signals
Inputs:
Segmentation covering the the text that should be preprocessed
Outputs:
Segmentation covering the modified text
Description
The **Preprocess* widget enables simple text preprocessing and creates a modified copy of the input content. The user can modify case (lower and upper) or replace accentuated characters with their non-accentuated equivalents.
- Change case and convert every character to lower or upper case.
- Replace accentuated characters with their non-accentuated equivalents.
- Copy all the annotations of the input segmentation to the output segmentation.
- Information about the number of segments in the output segmentation or the reasons why no segmentation is emitted.
- By clicking Send, changes are communicated to the output of the widget. Alternatively, tick Send automatically and changes will be communicated to the output at every modification.
Example
We used the Text Field widget to input a short text. We then decided to preprocess it in the Preprocess wiget and display it in the Display widget.