B-tries for disk-based string management

14Citations
Citations of this article
40Readers
Mendeley users who have this article in their library.
Get full text

Abstract

A wide range of applications require that large quantities of data be maintained in sort order on disk. The B-tree, and its variants, are an efficient general-purpose disk-based data structure that is almost universally used for this task. The B-trie has the potential to be a competitive alternative for the storage of data where strings are used as keys, but has not previously been thoroughly described or tested. We propose new algorithms for the insertion, deletion, and equality search of variable-length strings in a disk-resident B-trie, as well as novel splitting strategies which are a critical element of a practical implementation. We experimentally compare the B-trie against variants of B-tree on several large sets of strings with a range of characteristics. Our results demonstrate that, although the B-trie uses more memory, it is faster, more scalable, and requires less disk space. © 2008 Springer-Verlag.

Cite

CITATION STYLE

APA

Askitis, N., & Zobel, J. (2009). B-tries for disk-based string management. VLDB Journal, 18(1), 157–179. https://doi.org/10.1007/s00778-008-0094-1

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free