Conference paper

BARTÍK Vladimír. Text-Based Web Page Classification with Use of Visual Information. In: 2010 International Conference on Advances in Social Network Analysis and Mining. Odense: IEEE Computer Society, 2010, pp. 416-420. ISBN 978-0-7695-4138-9.
Publication language:english
Original title:Text-Based Web Page Classification with Use of Visual Information
Title (cs):Klasifikace webových stránek založená na textu s využitím vizuální informace
Pages:416-420
Proceedings:2010 International Conference on Advances in Social Network Analysis and Mining
Conference:International Symposium on Open Source Intelligence & Web Mining 2010
Place:Odense, DK
Year:2010
ISBN:978-0-7695-4138-9
Publisher:IEEE Computer Society
Keywords
web page classification, term weights, text classification, TF-IDF weight, visual information, visual  blocks
Annotation
As the number of pages on the web is permanently increasing, there is a need to classify pages into categories to facilitate indexing or searching them. In the method proposed here, we use both textual and visual information to find a suitable representation of web page content. In this paper, several term weights, based on TF or TF-IDF weighting are proposed. Modification is based on visual areas, in which the text appears and their visual properties. Some results of experiments are included in the final part of the paper.
BibTeX:
@INPROCEEDINGS{
   author = {Vladim{\'{i}}r Bart{\'{i}}k},
   title = {Text-Based Web Page Classification with Use of Visual
	Information},
   pages = {416--420},
   booktitle = {2010 International Conference on Advances in Social Network
	Analysis and Mining},
   year = {2010},
   location = {Odense, DK},
   publisher = {IEEE Computer Society},
   ISBN = {978-0-7695-4138-9},
   language = {english},
   url = {http://www.fit.vutbr.cz/research/view_pub.php.en.iso-8859-2?id=9274}
}

Your IPv4 address: 54.224.197.251
Switch to IPv6 connection

DNSSEC [dnssec]