This chapter introduces the standard formulation for the data input to data mining algorithms that will be assumed throughout this book. It goes on to distinguish between different types of variable and to consider issues relating to the preparation of data prior to use, particularly the presence of missing data values and noise. The UCI Repository of datasets is introduced.
CITATION STYLE
Bramer, M. (2020). Data for Data Mining (pp. 9–19). https://doi.org/10.1007/978-1-4471-7493-6_2
Mendeley helps you to discover research relevant for your work.