Publication Details

Generator of Synthetic Datasets for Hierarchical Sequential Pattern Mining Evaluation

ŠEBEK Michal and ZENDULKA Jaroslav. Generator of Synthetic Datasets for Hierarchical Sequential Pattern Mining Evaluation. In: Proceedings of the Twelfth International Conference on Informatics 2013. Košice: The University of Technology Košice, 2013, pp. 289-292. ISBN 978-80-8143-127-2.
Czech title
Generátor syntetických datových sad pro vyhodnocení dolování hierarchických sekvenčních vzorů
Type
conference paper
Language
english
Authors
Keywords

Sequence pattern mining, synthetic dataset generators, taxonomy

Abstract

Evaluation is an important part of algorithm design. Algorithms are typically evaluated on real-world and synthetic datasets. Real-world datasets are appropriate for evaluation of algorithm properties in practice but it is difficult to change the dataset to have some particular statistics, e.g. number of input items. In contrast, generated synthetic dataset simply allows changing any of statistic property of the dataset with keeping all other statistic properties. In the paper, we present a procedure for generation of sequence databases with taxonomies for an evaluation of hierarchical sequential pattern mining algorithms.

Annotation

Evaluation is an important part of algorithm design. Algorithms are typically evaluated on real-world and synthetic datasets. Real-world datasets are appropriate for evaluation of algorithm properties in practice but it is difficult to change the dataset to have some particular statistics, e.g. number of input items. In contrast, generated synthetic dataset simply allows changing any of statistic property of the dataset with keeping all other statistic properties. In the paper, we present a procedure for generation of sequence databases with taxonomies for an evaluation of hierarchical sequential pattern mining algorithms.

Published
2013
Pages
289-292
Proceedings
Proceedings of the Twelfth International Conference on Informatics 2013
Conference
Informatics 2013 - 12th International Scientific Conference on Informatics, Spišská Nová Ves, SK
ISBN
978-80-8143-127-2
Publisher
The University of Technology Košice
Place
Košice, SK
BibTeX
@INPROCEEDINGS{FITPUB10435,
   author = "Michal \v{S}ebek and Jaroslav Zendulka",
   title = "Generator of Synthetic Datasets for Hierarchical Sequential Pattern Mining Evaluation",
   pages = "289--292",
   booktitle = "Proceedings of the Twelfth International Conference on Informatics 2013",
   year = 2013,
   location = "Ko\v{s}ice, SK",
   publisher = "The University of Technology Ko\v{s}ice",
   ISBN = "978-80-8143-127-2",
   language = "english",
   url = "https://www.fit.vut.cz/research/publication/10435"
}
Back to top