An optimal approach to mining boolean functions from noisy data

1Citations
Citations of this article
2Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Data Mining of binary sequences has been an area of profound research and is often used as a proof of concept in various aspects of computational learning theory. The inference task in this paper, a specialized version of the segmentation problem, is the estimation of a predefined Boolean function on the real interval [0,1] from a noisy random sample. The framework for this problem was introduced by Kearns et al. (1997) in an earlier empirical evaluation of model selection methods. This paper presents an optimal approach to mining for Boolean functions from noisy data samples based on the Minimum Message Length (MML) principle. The MML method is shown to be optimal in comparison to well-known model selection methods based on Guaranteed Risk Minimization, Minimum Description Length (MDL) Principle and Cross Validation after a thorough empirical evaluation with varying levels of noisy data. © Springer-Verlag 2003.

Cite

CITATION STYLE

APA

Viswanathan, M., & Wallace, C. (2004). An optimal approach to mining boolean functions from noisy data. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2690, 717–724. https://doi.org/10.1007/978-3-540-45080-1_96

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free