
File Selection
At first, you have to read the file, which you want to pre-process. This is executed by the "Read files" button (it is possible to read more files at a time). There are chosen files displayed in the list. Some of them can be removed by choosing them and clicking the button "Remove chosen". The button "Remove all" removes the whole list.Next
After clicking on "Next" button, the following operations are performed:
- Storing the documents into memory
Text documents in the chosen format are read form the selected files. In the operating memory there is a database of these documents created. - Data cleaning
Various symbols, punctuation marks and other characters, which don't have any function for data representation, are removed from text data.