Efficient iot data management for geological disasters based on big data-turbocharged data lake architecture

5Citations
Citations of this article
17Readers
Mendeley users who have this article in their library.

Abstract

Multi-source Internet of Things (IoT) data, archived in institutions’ repositories, are becoming more and more widely open-sourced to make them publicly accessed by scientists, developers, and decision makers via web services to promote researches on geohazards prevention. In this paper, we design and implement a big data-turbocharged system for effective IoT data management following the data lake architecture. We first propose a multi-threading parallel data ingestion method to ingest IoT data from institutions’ data repositories in parallel. Next, we design storage strategies for both ingested IoT data and processed IoT data to store them in a scalable, reliable storage environment. We also build a distributed cache layer to enable fast access to IoT data. Then, we provide users with a unified, SQL-based interactive environment to enable IoT data exploration by leveraging the processing ability of Apache Spark. In addition, we design a standard-based metadata model to describe ingested IoT data and thus support IoT dataset discovery. Finally, we implement a prototype system and conduct experiments on real IoT data repositories to evaluate the efficiency of the proposed system.

References Powered by Scopus

Apache spark: A unified engine for big data processing

1946Citations
N/AReaders
Get full text

Spark SQL: Relational data processing in spark

1093Citations
N/AReaders
Get full text

Big data in natural disaster management: A review

279Citations
N/AReaders
Get full text

Cited by Powered by Scopus

TinyML Algorithms for Big Data Management in Large-Scale IoT Systems

16Citations
N/AReaders
Get full text

An assessment on the off-road trafficability using a quantitative rule method with geographical and geological data

8Citations
N/AReaders
Get full text

Research on the Construction and Data Ingestion Methodology of the Railway Data Lake

0Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Huang, X., Fan, J., Deng, Z., Yan, J., Li, J., & Wang, L. (2021). Efficient iot data management for geological disasters based on big data-turbocharged data lake architecture. ISPRS International Journal of Geo-Information, 10(11). https://doi.org/10.3390/ijgi10110743

Readers' Seniority

Tooltip

Lecturer / Post doc 2

50%

PhD / Post grad / Masters / Doc 2

50%

Readers' Discipline

Tooltip

Computer Science 2

40%

Social Sciences 1

20%

Engineering 1

20%

Economics, Econometrics and Finance 1

20%

Save time finding and organizing research with Mendeley

Sign up for free