Thesis Details

Sémantická podobnost textů

Bachelor's Thesis Student: Hajdin Martin Academic Year: 2015/2016 Supervisor: Smrž Pavel, doc. RNDr., Ph.D.
English title
Semantic Similarity of Texts
Language
Czech
Abstract

This paper deals with the determination of the semantic similarity of texts focusing on categorization of web documents in this case bookmarks. The part of the process is a theoretical overview of methods for system implementation. It describes the design and implementation of the various methods used in the system, too. This paper also deals with the evaluation of various methods where the chosen method are tested according to specified criteria.

Keywords

semantic simlarity, vector space model, natural language processing, Python, Gensim, Scikit-learn, TFIDF, LDA, NMF, SVD

Department
Degree Programme
Information Technology
Files
Status
not defended
Date
13 June 2016
Reviewer
Committee
Zbořil František, doc. Ing., Ph.D. (DITS FIT BUT), předseda
Burget Lukáš, doc. Ing., Ph.D. (DCGM FIT BUT), člen
Křivka Zbyněk, Ing., Ph.D. (DIFS FIT BUT), člen
Rozman Jaroslav, Ing., Ph.D. (DITS FIT BUT), člen
Strnadel Josef, Ing., Ph.D. (DCSY FIT BUT), člen
Citation
HAJDIN, Martin. Sémantická podobnost textů. Brno, 2016. Bachelor's Thesis. Brno University of Technology, Faculty of Information Technology. 2016-06-13. Supervised by Smrž Pavel. Available from: https://www.fit.vut.cz/study/thesis/18690/
BibTeX
@bachelorsthesis{FITBT18690,
    author = "Martin Hajdin",
    type = "Bachelor's thesis",
    title = "S\'{e}mantick\'{a} podobnost text\r{u}",
    school = "Brno University of Technology, Faculty of Information Technology",
    year = 2016,
    location = "Brno, CZ",
    language = "czech",
    url = "https://www.fit.vut.cz/study/thesis/18690/"
}
Back to top