In recent years, dimension of datasets has increased rapidly in many applications which bring great difficulty to data mining and pattern recognition. Also, all the measured variables of these high-dimensional datasets are not relevant for understanding the underlying phenomena of interest. In this paper, firstly, similarities among the attributes are measured by computing similarity factors based on relative indiscernibility relation, a concept of rough set theory. Based on the similarity factors, attribute similarity set AS = {(A B) / A, B are attributes and B similar to A with similarity factor k} is formed which helps to construct a directed weighted graph with weights as the inverse of similarity factor k. Then a minimal spanning tree of the graph is generated, from which iteratively most important vertex is selected in reduct set. The iteration completes when the edge set is empty. Thus the selected attributes, from which edges emanate, are the most relevant attributes and are known as reduct. The proposed method has been applied on some benchmark datasets and the classification accuracy is calculated by various classifiers to demonstrate the effectiveness of the method. © 2012 Springer-Verlag GmbH Berlin Heidelberg.
CITATION STYLE
Das, A. K., Sengupta, S., & Chakrabarty, S. (2012). Reduct generation by formation of directed minimal spanning tree using rough set theory. In Advances in Intelligent and Soft Computing (Vol. 132 AISC, pp. 127–135). Springer Verlag. https://doi.org/10.1007/978-3-642-27443-5_15
Mendeley helps you to discover research relevant for your work.