Making an accurate classifier needs accurate labeling, and accurate labeling needs accurate domain knowledge, experience and criteria, that is, experts to label. In reality, having such experts label all data that we need is often impossible because it requires of the high cost, and sometimes we have to make use of 'cheaper' data labeled by non-experts. In such case, experts' and non-experts' data are not discriminated in learning, even if mislabeled data in non-experts' data may make the resultant classifier poor. In this paper, we propose a classification method utilizing reliably labeled data. We utilize the previous knowledge of how reliable persons have given the labels, and set the degrees of label confidence on non-experts' data based on neighboring reliable experts data. The degrees of confidence are reflected in learning as data with higher confidence make a greater contribution to the classifier. With these assumptions, the results of experiments with publicly available data suggest that our method can make a more precise classifier than the conventional method that adopts all data equally. © 2008 Springer-Verlag Berlin Heidelberg.
CITATION STYLE
Nakata, K., Sakurai, S., & Orihara, R. (2008). Classification method utilizing reliably labeled data. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5177 LNAI, pp. 114–122). Springer Verlag. https://doi.org/10.1007/978-3-540-85563-7_20
Mendeley helps you to discover research relevant for your work.