We present Vizier, a multi-modal data exploration and debugging tool. The system supports a wide range of operations by seamlessly integrating Python, SQL, and automated data curation and debugging methods. Using Spark as an execution backend, Vizier handles large datasets in multiple formats. Ease-of-use is attained through integration of a notebook with a spreadsheet-style interface and with visualizations that guide and support the user in the loop. In addition, native support for provenance and versioning enable collaboration and uncertainty management. In this demonstration we will illustrate the diverse features of the system using several realistic data science tasks based on real data.
CITATION STYLE
Brachmann, M., Bautista, C., Castelo, S., Feng, S., Freire, J., Glavic, B., … Yang, Y. (2019). Data debugging and exploration with vizier. In Proceedings of the ACM SIGMOD International Conference on Management of Data (pp. 1877–1880). Association for Computing Machinery. https://doi.org/10.1145/3299869.3320246
Mendeley helps you to discover research relevant for your work.