Over the last decade, the data lake concept has emerged as an alternative to data warehouses for data storage and analysis. Data lakes adopt a schema-on-read approach to provide a flexible and extendable decision support system. In absence of a fixed schema, data querying and exploration depend on a metadata system. However, existing works on metadata management in data lakes mainly focus on structured and semi-structured data, with little research on unstructured data. Thence, we propose in this thesis a methodological approach to enable textual data analyses from data lakes through an efficient metadata system.
CITATION STYLE
Sawadogo, P. N. (2019). Textual Data Analysis from Data Lakes. In Communications in Computer and Information Science (Vol. 1064, pp. 558–563). Springer Verlag. https://doi.org/10.1007/978-3-030-30278-8_54
Mendeley helps you to discover research relevant for your work.