Conference paper

BARTÍK Vladimír. Measuring Web Page Similarity Based on Textual and Visual Properties. In: The 11th International Conference on Artificial Intelligence and Soft Computing. Zakopane: Springer Verlag, 2012, pp. 13-21. ISBN 978-3-642-29349-8. ISSN 0302-9743.
Publication language:english
Original title:Measuring Web Page Similarity Based on Textual and Visual Properties
Title (cs):Měření podobnosti webových stránek na základě textových i vizuálních vlastností
Pages:13-21
Proceedings:The 11th International Conference on Artificial Intelligence and Soft Computing
Conference:The 11th International Conference on Artificial Intelligence and Soft Computing
Series:Lecture Notes in Artificial Intelligence, Vol. 7268
Place:Zakopane, PL
Year:2012
ISBN:978-3-642-29349-8
Journal:Lecture Notes in Computer Science, No. 7268, DE
ISSN:0302-9743
Publisher:Springer Verlag
Files: 
+Type Name Title Size Last modified
iconicaisc.pdf199 KB2012-09-07 09:45:20
^ Select all
With selected:
Keywords
Web page similarity, clustering, vector space model, vector distance, term weighting, visual blocks.
Annotation
Measuring web page similarity is a very important task in the area of web mining and information retrieval. This paper introduces the method for measuring web page similarity, which considers both textual and visual properties of pages. Textual properties of a page are described by means of modified weight vector space model. General visual properties are captured via segmentation of a page, which divides a page into visual blocks, properties of which are stored into a vector of visual properties. These both vectors are then used to compute the whole web page similarity. This method will be described in detail and results of several experiments are also introduced in this paper.
BibTeX:
@INPROCEEDINGS{
   author = {Vladim{\'{i}}r Bart{\'{i}}k},
   title = {Measuring Web Page Similarity Based on Textual and Visual
	Properties},
   pages = {13--21},
   booktitle = {The 11th International Conference on Artificial Intelligence
	and Soft Computing},
   series = {Lecture Notes in Artificial Intelligence, Vol. 7268},
   journal = {Lecture Notes in Computer Science},
   number = {7268},
   year = {2012},
   location = {Zakopane, PL},
   publisher = {Springer Verlag},
   ISBN = {978-3-642-29349-8},
   ISSN = {0302-9743},
   language = {english},
   url = {http://www.fit.vutbr.cz/research/view_pub.php.en.iso-8859-2?id=9850}
}

Your IPv4 address: 54.80.236.48
Switch to IPv6 connection

DNSSEC [dnssec]