BIROn - Birkbeck Institutional Research Online

    Nearest neighbor retrieval using distance-based hashing

    Athitsos, V. and Potamias, M. and Papapetrou, Panagiotis and Kollios, G. (2008) Nearest neighbor retrieval using distance-based hashing. In: UNSPECIFIED (ed.) International Conference on Data Engineering. Washington, USA: IEEE Computer Society, pp. 327-336. ISBN 9781424418367.

    Full text not available from this repository.

    Abstract

    A method is proposed for indexing spaces with arbitrary distance measures, so as to achieve efficient approximate nearest neighbor retrieval. Hashing methods, such as Locality Sensitive Hashing (LSH), have been successfully applied for similarity indexing in vector spaces and string spaces under the Hamming distance. The key novelty of the hashing technique proposed here is that it can be applied to spaces with arbitrary distance measures, including non-metric distance measures. First, we describe a domain-independent method for constructing a family of binary hash functions. Then, we use these functions to construct multiple multibit hash tables. We show that the LSH formalism is not applicable for analyzing the behavior of these tables as index structures. We present a novel formulation, that uses statistical observations from sample data to analyze retrieval accuracy and efficiency for the proposed indexing method. Experiments on several real-world data sets demonstrate that our method produces good trade-offs between accuracy and efficiency, and significantly outperforms VP-trees, which are a well-known method for distance-based indexing.

    Metadata

    Item Type: Book Section
    School: Birkbeck Schools and Departments > School of Business, Economics & Informatics > Computer Science and Information Systems
    Depositing User: Sarah Hall
    Date Deposited: 01 Aug 2013 10:28
    Last Modified: 17 Feb 2014 14:37
    URI: http://eprints.bbk.ac.uk/id/eprint/7885

    Statistics

    Downloads
    Activity Overview
    0Downloads
    125Hits

    Additional statistics are available via IRStats2.

    Archive Staff Only (login required)

    Edit/View Item Edit/View Item