Rechercher

[BGR11] Large scale disk-based metric indexing structure for approximate information retrieval by content

Conférence Internationale avec comité de lecture : 1st Workshop on New Trends in Similarity Search (NTSS’11), in Conjunction with the EDBT 2011 Conference, March 2011, pp.1-6, Uppsala,, Sweden,

Mots clés: indexing, multidimensional, similarity search, curse of dimensionality, database

Résumé: In order to achieve large scalability, indexing structures are usually distributed to incorporate more of expensive main memory during the query processing. In this paper, an indexing structure, that does not su er from a performance degradation by its transition from main memory storage to hard drive, is proposed. The high eciency of the index is achieved using a very e ective pruning based on precomputed distances and so called locality phenomenon which substantially diminishes the number of retrieved candidates. The trade-o s for the large scalability are, rstly, the approximation and, secondly, longer query times, yet both are still bearable enough for recent multimedia content-based search systems, proved by an evaluation using visual and audio data and both metric and semi-metric distance functions. The tuning of the index's parameters based on the analysis of the particular's data intrinsic dimensionality is also discussed.

Equipe: vertigo
Collaboration: LAMSADE

BibTeX

@inproceedings {
BGR11,
title="{Large scale disk-based metric indexing structure for approximate information retrieval by content}",
author=" S. Barton and V. Gouet-Brunet and M. Rukoz ",
booktitle="{1st Workshop on New Trends in Similarity Search (NTSS’11), in Conjunction with the EDBT 2011 Conference}",
year=2011,
month="March",
pages="1-6",
address="Uppsala,, Sweden",
}