Comparison between performance of various database systems for implementing a language corpus

Dimuthu Upeksha; Chamila Wijayarathna; Maduranga Siriwardena; Lahiru Lasandun; Chinthana Wimalasuriya N.H.N.D. De Silva; Gihan Dias

Journal Article

Comparison between performance of various database systems for implementing a language corpus

Communications in Computer and Information Science (2015) 521 82-91

DOI: 10.1007/978-3-319-18422-7_7

2Citations

8Readers

Get full text

Abstract

Data storage and information retrieval are some of the most important aspects when it comes to the development of a language corpus. Currently most corpora use either relational databases or indexed file systems. When selecting a data storage system, most important facts to consider are the speeds of data insertion and information retrieval. Other than the aforementioned two approaches, currently there are various database systems which have different strengths that can be more useful. This paper compares the performance of data storage and retrieval mechanisms which use relational databases, graph databases, column store databases and indexed file systems for various steps such as inserting data into corpus and retrieving information from it, and tries to suggest an optimal storage architecture for a language corpus.

Author supplied keywords

Cite

CITATION STYLE

APA

Upeksha, D., Wijayarathna, C., Siriwardena, M., Lasandun, L., De Silva, C. W. N. H. N. D., & Dias, G. (2015). Comparison between performance of various database systems for implementing a language corpus. Communications in Computer and Information Science, 521, 82–91. https://doi.org/10.1007/978-3-319-18422-7_7

Comparison between performance of various database systems for implementing a language corpus

Abstract

Author supplied keywords

Cite

Register to see more suggestions