We have constructed a linear discriminator for handprinted character recognition that uses a (binary) vector of 1, 500 features based on an equidistributed collection of products of pixel pairs. This classifier is competitive with other techniques, but faster to train and to run for classification. However, the 1, 500-member feature set clearly contains many redundant (overlapping or useless) members, and a significantly smaller set would be very desirable (e.g., for faster training, a faster and smaller application program, and a smaller system suitable for hardware implementation). A system using the small set of features should also be better at generalization, since fewer features are less likely to allow a system to “memorize noise in the training data.” We tried several genetic algorithm approaches to search for effective small subsets of features, and we have successfully found a 300-element set of features and built a classifier whose performance is as good on our testing set as the system using the full feature set.
CITATION STYLE
Gaborski, R. S., Anderson, P. G., Asbury, C. T., & Tilley, D. G. (1993). Genetic Algorithm Selection of Features for Hand-printed Character Identification. In Artificial Neural Nets and Genetic Algorithms (pp. 417–422). Springer Vienna. https://doi.org/10.1007/978-3-7091-7533-0_60
Mendeley helps you to discover research relevant for your work.