Reducing the Distance Calculations when Searching an M-Tree

Guhlemann S, Petersohn U, Meyer-Wegener K (2017)


Publication Language: English

Publication Type: Journal article, Original article

Publication year: 2017

Journal

Book Volume: 17

Pages Range: 155-167

Journal Issue: 2

URI: http://rdcu.be/tz7S

DOI: 10.1007/s13222-017-0258-5

Abstract

Recent years have brought rising interest in efficiently searching for similar entities in a broad range of domains. Such search can be used to facilitate working with unstructured data such as genome sequences, text corpora, complex production information, or multimedia content, where queries always contain an amount of noise. In such domains the only common structure is a distance function obeying the axioms of a metric. As mostly no other structure information is available, a lot of distances have to be computed during the course of a search. Contrary to classical database indexes, where the optimization focus is on reducing the number of disk accesses (or in case of in-memory databases the number of tree traversal operations), a major cost driver in such multimedia domains is this number of distance calculations which can be very computation intense.

There exists a range of index structures for supporting similarity search in metric spaces. A very promising one is the M-Tree, along with a number of compatible extensions (e. g. Slim-Tree, Bulk Loaded M-Tree, multi way insertion M-Tree, M2-Tree, etc.). The M-Tree family uses common algorithms for the k-nearest-neighbor and range search. These algorithms leave room for optimization in terms of necessary distance calculations. In this paper we present new algorithms for these tasks to considerably improve retrieval performance of all M-Tree-compatible data structures.

Authors with CRIS profile

Involved external institutions

How to cite

APA:

Guhlemann, S., Petersohn, U., & Meyer-Wegener, K. (2017). Reducing the Distance Calculations when Searching an M-Tree. Datenbank-Spektrum, 17(2), 155-167. https://dx.doi.org/10.1007/s13222-017-0258-5

MLA:

Guhlemann, Steffen, Uwe Petersohn, and Klaus Meyer-Wegener. "Reducing the Distance Calculations when Searching an M-Tree." Datenbank-Spektrum 17.2 (2017): 155-167.

BibTeX: Download