Detail publikace

Content-based Copy Detection

BERAN Vítězslav, HRADIŠ Michal, OTRUSINA Lubomír a ŘEZNÍČEK Ivo. Content-based Copy Detection. In: 2011 TREC Video Retrieval Evaluation Notebook Papers. Gaithersburg, MD: National Institute of Standards and Technology, 2011, s. 1-10.
Typ
článek ve sborníku konference
Jazyk
angličtina
Autoři
URL
Abstrakt

This paper describes our approach to semantic indexing and content-based copy detection which was used for TRECVID 2010 evaluation.

Semantic indexing
1. The runs differ in the types of features used. All runs use several bag-of-word representations fed to separate linear SVMs and the SVMs were fused by logistic regression. Visual and audio features were used as well as metadata. We added contextual features extracted from the video from which a shot originated.

  • F_A_brno.run1 (run1) - Only visual information. Dense sampling and Harris-Laplace detector with SIFT and RGB-SIFT descriptors
  • F_A_brno.run1 (run2) - The same as in run1 with added features from audio and metadata.
  • F_A_brno.run3 (run3) - The same as in run2 with added contextual features extracted from the whole video.

2. Audio and metadata significantly improves results. Even grater improvement was achieved by using the contextual features.

 

Content-based Copy Detection
1. One run submitted in two versions (the difference is only in relevance threshold setting)

  • brnoccd: SIFT and SURF combination, bag-of-words (visual codebook: 100k size, 4 nearest neighbors used in soft-assignment), inverted file index, geometry (homography) based image similarity metric

2. What if any significant differences (in terms of what measures) did you find among the runs?

  • only one setting used - no differences

3. Based on the results, can you estimate the relative contribution of each component of your system/approach to its effectiveness?

  • slow search in reference dataset due to pure indexing effectiveness

4. Overall, what did you learn about runs/approaches and the research question(s) that motivated them?

  • change the way of describing the video content - frame based (or key-frame based) approach is not sufficient
Rok
2011
Strany
1-10
Sborník
2011 TREC Video Retrieval Evaluation Notebook Papers
Konference
2011 TRECVID Workshop, Gaithersburg, US
Vydavatel
National Institute of Standards and Technology
Místo
Gaithersburg, MD, US
EID Scopus
BibTeX
@INPROCEEDINGS{FITPUB9841,
   author = "V\'{i}t\v{e}zslav Beran and Michal Hradi\v{s} and Lubom\'{i}r Otrusina and Ivo \v{R}ezn\'{i}\v{c}ek",
   title = "Content-based Copy Detection",
   pages = "1--10",
   booktitle = "2011 TREC Video Retrieval Evaluation Notebook Papers",
   year = 2011,
   location = "Gaithersburg, MD, US",
   publisher = "National Institute of Standards and Technology",
   language = "english",
   url = "https://www.fit.vut.cz/research/publication/9841"
}
Soubory
Nahoru