The objective of this paper is to present a methodology for discovering comprehensible, valid, potentially innovative, and useful patterns, i.e., new knowledge, in multidimensional spatial data. Techniques from statistics, machine learning, and data mining are applied in consecutive logical steps to allow the visualization of results and the application of validation procedures at each stage. However, the approach does not end with a data cluster; rather, if such a valid cluster has been achieved, then the question is posed: “What do the clusters mean?”. Symbolic machine learning methods are employed to produce an explanation of the clusters in terms of rules employing an understandable subset of the high-dimensional data variables. This combined with canonical representatives of a cluster and consideration of the spatial distribution of the clusters lead to hypothesis on emergent data structures, that is, potential new knowledge. The approach is demonstrated on an exemplary data set of German urban districts featuring seven dimensions of land use.
CITATION STYLE
Behnisch, M., & Ultsch, A. (2015). Knowledge discovery in spatial planning data: A concept for cluster understanding. In Computational Approaches for Urban Environments (pp. 49–75). Springer International Publishing. https://doi.org/10.1007/978-3-319-11469-9_3
Mendeley helps you to discover research relevant for your work.