Comparison between performance of various database systems for implementing a language corpus

2Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Data storage and information retrieval are some of the most important aspects when it comes to the development of a language corpus. Currently most corpora use either relational databases or indexed file systems. When selecting a data storage system, most important facts to consider are the speeds of data insertion and information retrieval. Other than the aforementioned two approaches, currently there are various database systems which have different strengths that can be more useful. This paper compares the performance of data storage and retrieval mechanisms which use relational databases, graph databases, column store databases and indexed file systems for various steps such as inserting data into corpus and retrieving information from it, and tries to suggest an optimal storage architecture for a language corpus.

Cite

CITATION STYLE

APA

Upeksha, D., Wijayarathna, C., Siriwardena, M., Lasandun, L., De Silva, C. W. N. H. N. D., & Dias, G. (2015). Comparison between performance of various database systems for implementing a language corpus. Communications in Computer and Information Science, 521, 82–91. https://doi.org/10.1007/978-3-319-18422-7_7

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free