A Data Virtual Machine (DVM) is a novel graph-based conceptual model, similar to the entity-relationship model, representing existing data (persistent, transient, derived) of an organization. A DVM can be built quickly, agilely, offering schematic flexibility to data engineers. Data scientists can visually define complex dataframe queries in an intuitive and simple manner, which are evaluated within an algebraic framework. A DVM can be easily materialized in any logical data model and can be "reoriented'' around any node, offering a "single view of any entity''. In this paper we demonstrate DataMingler, a tool implementing DVMs. We argue that DVMs can have a significant practical impact in analytics environments.
CITATION STYLE
Chatziantoniou, D., & Kantere, V. (2021). DataMingler: A Novel Approach to Data Virtualization. In Proceedings of the ACM SIGMOD International Conference on Management of Data (pp. 2681–2685). Association for Computing Machinery. https://doi.org/10.1145/3448016.3452752
Mendeley helps you to discover research relevant for your work.