This paper introduces the pysubgroup package for subgroup discovery in Python. Subgroup discovery is a well-established data mining task that aims at identifying describable subsets in the data that show an interesting distribution with respect to a certain target concept. The presented package provides an easy-to-use, compact and extensible implementation of state-of-the-art mining algorithms, interestingness measures, and visualizations. Since it builds directly on the established pandas data analysis library—a de-facto standard for data science in Python—it seamlessly integrates into preprocessing and exploratory data analysis steps. Code related to this paper is available at: http://florian.lemmerich.net/pysubgroup.
CITATION STYLE
Lemmerich, F., & Becker, M. (2019). pysubgroup: Easy-to-use subgroup discovery in python. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11053 LNAI, pp. 658–662). Springer Verlag. https://doi.org/10.1007/978-3-030-10997-4_46
Mendeley helps you to discover research relevant for your work.