Abstract
Real-time predictive data analytics is a very important tool for effective decision support within intelligent systems. When making decisions using data, it is critical to use the most appropriate data. When creating predictive analytics, the selection of data sources is important as the quality of the sources influences the accuracy of the predictive model. Within a smart environment, a dataspace is valuable for data scientists as it provides a one-stop shop of all the data required for creating their analytical models: enterprise data, Internet of Things (IoT), sensor data, and open data. However, the increase in the number of data sources presents a challenge in selecting the most appropriate data source to use. The co-existence approach of dataspaces results in them containing much more data sources than within traditional data management approaches. This means that the need to perform source selection is an ongoing activity; as the dataspace is incrementally improved, sources will need to be re-examined to determine their suitability for tasks. We propose an autonomic source selection service for predictive analytics for intelligent systems within a smart environment. This service has been evaluated in real-world settings using a Real-time Linked Dataspace for energy predictions using IoT sensor data and open weather data.
Author supplied keywords
Cite
CITATION STYLE
Arabekar, N., Derguech, W., Burke, E., & Curry, E. (2019). Autonomic Source Selection for Real-time Predictive Analytics Using the Internet of Things and Open Data. In Real-time Linked Dataspaces: Enabling Data Ecosystems for Intelligent Systems (pp. 237–253). Springer International Publishing. https://doi.org/10.1007/978-3-030-29665-0_15
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.