Dealing with large tabular datasets often requires extensive preprocessing. This preprocessing happens only once, so that loading and indexing the data in a database or triple store may be an overkill. In this paper, we present an approach that allows preprocessing large tabular data in Datalog – without indexing the data. The Datalog query is translated to Unix Bash and can be executed in a shell. Our experiments show that, for the use case of data preprocessing, our approach is competitive with state-of-the-art systems in terms of scalability and speed, while at the same time requiring only a Bash shell on a Unix system.
CITATION STYLE
Rebele, T., Tanon, T. P., & Suchanek, F. (2018). Bash datalog: Answering datalog queries with unix shell commands. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11136 LNCS, pp. 566–582). Springer Verlag. https://doi.org/10.1007/978-3-030-00671-6_33
Mendeley helps you to discover research relevant for your work.