Corpora Processing Software |
Authors: | Doležal Jan, Dytrych Jaroslav, Karásek Miroslav, Kouřil Jan, Otrusina Lubomír, Smrž Pavel |
Type: | software |
Created: | 2015 |
Licence: | required - no fee | Keywords: | corpora, processing, indexing
|
Description: |
Set of programs for processing large text corpora. The programs transform data from the HTML format to a vertical text, its annotation at different levels and indexing in MG4J and Elastic.
|
Location: |
http://knot.fit.vutbr.cz/corpproc/ |
Research groups: |
---|
|
Licence terms: |
---|
Distributed under The Apache License Version 2.0 http://www.apache.org/licenses/LICENSE-2.0.txt |
|