Writing styles can be described by stylometric features. They are quantitative in nature, often continuous, which either limits techniques used in mining to those capable of calculations on this form, or adds discretisation to initial pre-processing of data. The paper describes research on unsupervised and supervised discretisation applied in the stylometric domain for the task of authorship attribution. The recognition of authorship is executed as classification performed by chosen inducers capable of operating on both continuous and categorical attributes.
CITATION STYLE
Stańczyk, U. (2020). On Unsupervised and Supervised Discretisation in Mining Stylometric Features. In Advances in Intelligent Systems and Computing (Vol. 1061, pp. 156–166). Springer. https://doi.org/10.1007/978-3-030-31964-9_15
Mendeley helps you to discover research relevant for your work.