Decision-making bias in instance matching model selection

Mayank Kejriwal; Daniel P. Miranker

Conference ProceedingsOPEN ACCESS

Decision-making bias in instance matching model selection

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2015) 9366 392-407

DOI: 10.1007/978-3-319-25007-6_23

1Citations

19Readers

Abstract

Instance matching has emerged as an important problem in the Semantic Web, with machine learning methods proving especially effective. To enhance performance, task-specific knowledge is typically used to introduce bias in the model selection problem. Such biases tend to be exploited by practitioners in a piecemeal fashion. This paper introduces a framework where the model selection design process is represented as a factor graph. Nodes in this bipartite graphical model represent opportunities for explicitly introducing bias. The graph is first used to unify and visualize common biases in the design of existing instance matchers. As a direct application, we then use the graph to hypothesize about potential unexploited biases. The hypotheses are evaluated by training 1032 neural networks on three instance matching tasks on Microsoft Azure’s cloud-based platform. An analysis over 25 GB of experimental data indicates that the proposed biases can improve efficiency by over 65% over a baseline configuration, with effectiveness improving by a smaller margin. The findings lead to a promising set of four recommendations that can be integrated into existing supervised instance matchers.

Author supplied keywords

Cite

CITATION STYLE

APA

Kejriwal, M., & Miranker, D. P. (2015). Decision-making bias in instance matching model selection. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9366, pp. 392–407). Springer Verlag. https://doi.org/10.1007/978-3-319-25007-6_23

Decision-making bias in instance matching model selection

Abstract

Author supplied keywords

Cite

Register to see more suggestions