PyLZJD: An Easy to Use Tool for Machine Learning

  • Raff E
  • Aurelio J
  • Nicholas C
N/ACitations
Citations of this article
8Readers
Mendeley users who have this article in their library.

Abstract

As Machine Learning (ML) becomes more widely known and popular, so too does the desire for new users from other backgrounds to apply ML techniques to their own domains. A difficult prerequisite that often confounds new users is the feature creation and engineering process. This is especially true when users attempt to apply ML to domains that have not historically received attention from the ML community (e.g., outside of text, images, and audio). The Lempel Ziv Jaccard Distance (LZJD) is a compression based technique that can be used for many machine learning tasks. Because of its compression background, users do not need to specify any feature extraction, making it easy to apply to new domains. We introduce PyLZJD, a library that implements LZJD in a manner meant to be easy to use and apply for novice practitioners. We will discuss the intuition and high-level mechanics behind LZJD, followed by examples of how to use it on problems of disparate data types.

Cite

CITATION STYLE

APA

Raff, E., Aurelio, J., & Nicholas, C. (2019). PyLZJD: An Easy to Use Tool for Machine Learning. In Proceedings of the 18th Python in Science Conference (pp. 101–106). SciPy. https://doi.org/10.25080/majora-7ddc1dd1-00e

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free