Corpora Processing Software

Authors:Doležal Jan, Dytrych Jaroslav, Karásek Miroslav, Kouřil Jan, Otrusina Lubomír, Smrž Pavel
Type:software
Created:2015
Licence:required - no fee
Keywords:corpora, processing, indexing
Description:
Set of programs for processing large text corpora. The programs transform data from the HTML format to a vertical text, its annotation at different levels and indexing in MG4J and Elastic.
Location:
http://knot.fit.vutbr.cz/corpproc/
Research groups:
Licence terms:
Distributed under The Apache License Version 2.0 http://www.apache.org/licenses/LICENSE-2.0.txt

Your IPv4 address: 54.166.232.243
Switch to IPv6 connection

DNSSEC [dnssec]