Join operation is usually hard to achieve high quality with machine alone. We adopt crowdsourcing to improve the quality of join. Depending on the number of generated pairs, the overall cost can be expensive for hiring workers to do the verification. We propose a hybrid approach to generate pairs by leveraging attributes, which combines category, sorting and clustering techniques, called CSCER. We also propose an adaptive attribute-selection strategy to efficiently generate pairs based on attributes. Experiments on a real crowdsourcing platform using real datasets indicate that our approaches save the overall cost compared to existing methods and achieve high quality of join results. © 2014 Springer International Publishing Switzerland.
CITATION STYLE
Feng, J., Feng, J., & Hu, H. (2014). Leveraging attributes and crowdsourcing for join. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8485 LNCS, pp. 448–452). Springer Verlag. https://doi.org/10.1007/978-3-319-08010-9_47
Mendeley helps you to discover research relevant for your work.