The increasing volume of data generated and the shortage of professionals trained to extract value from it, raises a question of how to automate data analysis processes. This work investigates how to increase the automation in the data interpretation process by proposing a relevance classification heuristic model, which can be used to express which views over the data are potentially meaningful and relevant. The relevance classification model uses the combination of semantic types derived from the data attributes and visual human interpretation cues as input features. The evaluation shows the impact of these features in improving the prediction of data relevance, where the best classification model achieves a F1 score of 0.906.
CITATION STYLE
Kamioka, E. H., Freitas, A., Caroli, F., & Handschuh, S. (2016). Determining data relevance using semantic types and graphical interpretation cues. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9897 LNCS, pp. 332–342). Springer Verlag. https://doi.org/10.1007/978-3-319-46349-0_29
Mendeley helps you to discover research relevant for your work.