Publication Details

MASK+: Data-driven regions selection for acoustic fingerprinting

ONDEL Yang Lucas Antoine Francois, ANGUERA Xavier and LUQUE Jordi. MASK+:Data-Driven Regions Selection for Acoustic Fingerprinting. In: Proceedings of 2015 IEEE International Conference on Acoustics, Speech and Signal Processing. South Brisbane, Queensland: IEEE Signal Processing Society, 2015, pp. 335-339. ISBN 978-1-4673-6997-8.

Czech title

MASK+: Regiony určené pomocí dat pro tvorbu akustických otisků

Type

conference paper

Language

english

Authors

Ondel Yang Lucas Antoine Francois, Mgr., Ph.D. (DCGM FIT BUT)
Anguera Xavier (Telefónica)
Luque Jordi (Telefónica)

URL

http://www.fit.vutbr.cz/research/groups/speech/publi/2015/ondel_icassp2015_0000335.pdf PDF

Keywords

Audio fingerprinting, content recognition

Abstract

In this paper we propose an improvement to MASK, a recently proposed acoustic fingerprint that has been shown to be effective at compactly representing an acoustic signal using binary descriptors.

Annotation

Acoustic fingerprinting is the process to deterministically obtain a compact representation of an audio segment, used to compare multiple audio files or to efficiently search for a file within a big database. Recently, we proposed a novel fingerprint named MASK (Masked Audio Spectral Keypoints) that encodes the relationship between pairs of spectral regions around a single spectral energy peak into a binary representation. In the original proposal the configuration of location and size of the regions pairs was determined manually to optimally encode how energy flows around the spectral peak. Such manual selection has always been considered as a weakness in the process as it might not be adapted to the actual data being represented. In this paper we address this problem by proposing a unsupervised, data-driven method based on mutual information theory to automatically define an optimal MASK fingerprint structure. Audio retrieval experiments optimizing for data distorted with additive Gaussian white noise show that the proposed method is much more robust than the original MASK and a well known acoustic fingerprint

Published

2015

Pages

335-339

Proceedings

Proceedings of 2015 IEEE International Conference on Acoustics, Speech and Signal Processing

Conference

2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), Brisbane, AU

ISBN

978-1-4673-6997-8

Publisher

IEEE Signal Processing Society

Place

South Brisbane, Queensland, AU

DOI

10.1109/ICASSP.2015.7177986

UT WoS

000427402900067

EID Scopus

2-s2.0-84946023338

BibTeX

@INPROCEEDINGS{FITPUB10958,
   author = "Francois Antoine Lucas Yang Ondel and Xavier Anguera and Jordi Luque",
   title = "MASK+: Data-driven regions selection for acoustic fingerprinting",
   pages = "335--339",
   booktitle = "Proceedings of 2015 IEEE International Conference on Acoustics, Speech and Signal Processing",
   year = 2015,
   location = "South Brisbane, Queensland, AU",
   publisher = "IEEE Signal Processing Society",
   ISBN = "978-1-4673-6997-8",
   doi = "10.1109/ICASSP.2015.7177986",
   language = "english",
   url = "https://www.fit.vut.cz/research/publication/10958"
}