Large-Scale Learning from Data Streams with Apache SAMOA

8Citations
Citations of this article
19Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Apache SAMOA (Scalable Advanced Massive Online Analysis) is an open-source platform for mining big data streams. Big data is defined as datasets whose size is beyond the ability of typical software tools to capture, store, manage, and analyze, due to the time and memory complexity. Apache SAMOA provides a collection of distributed streaming algorithms for the most common data mining and machine learning tasks such as classification, clustering, and regression, as well as programming abstractions to develop new algorithms. It features a pluggable architecture that allows it to run on several distributed stream processing engines such as Apache Flink, Apache Storm, and Apache Samza. Apache SAMOA is written in Java and is available at https://samoa.incubator.apache.org under the Apache Software License version 2.0.

Cite

CITATION STYLE

APA

Kourtellis, N., De Francisci Morales, G., & Bifet, A. (2019). Large-Scale Learning from Data Streams with Apache SAMOA. In Studies in Big Data (Vol. 41, pp. 177–207). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-319-89803-2_8

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free