Abstract
Inference of Machine Learning (ML) models, i.e. the process of obtaining predictions from trained models, is often an overlooked problem. Model inference is however one of the main contributors of both technical debt in ML applications and infrastructure complexity. MASQ is a framework able to run inference of ML models directly on DBMSs. MASQ not only averts expensive data movements for those predictive scenarios where data resides on a database, but it also naturally exploits all the "Enterprise-grade"features such as governance, security and auditability which make DBMSs the cornerstone of many businesses. MASQ compiles trained models and ML pipelines implemented in scikit-learn directly into standard SQL: no UDFs nor vendor-specific syntax are used, and therefore queries can be readily executed on any DBMS. In this demo, we will showcase MASQ's capabilities through a GUI allowing attendees to: (1) train ML pipelines composed of data featurizers and ML models; (2) compile the trained pipelines into SQL, and deploy them on different DBMSs (MySQL and SQLServer in the demo); and (3) compare the related performance under different configurations (e.g., the original pipeline on the ML framework against the SQL implementations).
Author supplied keywords
Cite
CITATION STYLE
Del Buono, F., Paganelli, M., Sottovia, P., Interlandi, M., & Guerra, F. (2021). Transforming ML Predictive Pipelines into SQL with MASQ. In Proceedings of the ACM SIGMOD International Conference on Management of Data (pp. 2696–2700). Association for Computing Machinery. https://doi.org/10.1145/3448016.3452771
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.