Industrial data mining projects in general and big data mining projects in particular suffer from long project execution. The resulting high costs render many interesting use cases otherwise economically unattractive. This contribution shows on the example of anomaly detection for process plants, how the major obstacles - namely the inefficient development tools for Big Data Frameworks like Apache Hadoop and Spark and the lack of reuse of software artifacts across different projects can be overcome. This is achieved by selecting an application case that shares considerable commonalities across different projects and providing a supported project workflow implemented in a scalable and extensible big data architecture.
CITATION STYLE
Borrison, R., Klöpper, B., Chioua, M., Dix, M., & Sprick, B. (2018). Reusable Big Data System for Industrial Data Mining - A Case Study on Anomaly Detection in Chemical Plants. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11314 LNCS, pp. 611–622). Springer Verlag. https://doi.org/10.1007/978-3-030-03493-1_64
Mendeley helps you to discover research relevant for your work.