Panel s nápovědou

Settings for storage

The application supports several possibilities of output configuration. The first one is an option, if an information about class will be stored and the second one are constraints of output dataset.
  1. Classes
    If this is chosen, the category of each document will be stored in the output. This information will be stored between document ID and the rest of document representation (the attribute is called "topic").
  2. Constraints of output dataset
    Setting of a minimum value enables pruning of features, which occur only in a small number of documents. Tha maximum value is used to prune features, which occur too frequently in documents. Below these fields for constraints there is shown an interval, inside of which these two values should be.