Transforming ML Predictive Pipelines into SQL with MASQ

7Citations
Citations of this article
15Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Inference of Machine Learning (ML) models, i.e. the process of obtaining predictions from trained models, is often an overlooked problem. Model inference is however one of the main contributors of both technical debt in ML applications and infrastructure complexity. MASQ is a framework able to run inference of ML models directly on DBMSs. MASQ not only averts expensive data movements for those predictive scenarios where data resides on a database, but it also naturally exploits all the "Enterprise-grade"features such as governance, security and auditability which make DBMSs the cornerstone of many businesses. MASQ compiles trained models and ML pipelines implemented in scikit-learn directly into standard SQL: no UDFs nor vendor-specific syntax are used, and therefore queries can be readily executed on any DBMS. In this demo, we will showcase MASQ's capabilities through a GUI allowing attendees to: (1) train ML pipelines composed of data featurizers and ML models; (2) compile the trained pipelines into SQL, and deploy them on different DBMSs (MySQL and SQLServer in the demo); and (3) compare the related performance under different configurations (e.g., the original pipeline on the ML framework against the SQL implementations).

Cite

CITATION STYLE

APA

Del Buono, F., Paganelli, M., Sottovia, P., Interlandi, M., & Guerra, F. (2021). Transforming ML Predictive Pipelines into SQL with MASQ. In Proceedings of the ACM SIGMOD International Conference on Management of Data (pp. 2696–2700). Association for Computing Machinery. https://doi.org/10.1145/3448016.3452771

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free