pyjanitor: A Cleaner API for Cleaning Data

  • J. E
  • Barry Z
  • Zuckerman S
  • et al.
N/ACitations
Citations of this article
11Readers
Mendeley users who have this article in their library.

Abstract

The pandas library has become the de facto library for data wrangling in the Python programming language. However, inconsistencies in the pandas application programming interface (API), while idiomatic due to historical use, prevent use of expressive, fluent programming idioms that enable self-documenting pandas code. Here, we introduce pyjanitor, an open source Python package that extends the pandas API with such idioms. We describe its design and implementation of the package, provide usage examples from a variety of domains, and discuss the ways that the pyjanitor project has enabled the inclusion of first-time contributors to open source projects.

Cite

CITATION STYLE

APA

J., E., Barry, Z., Zuckerman, S., & Sailer, Z. (2019). pyjanitor: A Cleaner API for Cleaning Data. In Proceedings of the 18th Python in Science Conference (pp. 50–53). SciPy. https://doi.org/10.25080/majora-7ddc1dd1-007

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free