Abstract
Traditional wisdom has it that the better a theory compresses the learning data concerning some phenomenon under investigation, the better we leaxn, generalize, and the better the theory predicts unknown data. This belief is vindicated in practice but apparently has not been rigorously proved in a general setting. Making these ideas rigorous involves the length of the shortest effective description of an individual object: its Kolmogorov complexity. In a previous paper we have shown that optimal compression is almost always a best strategy in hypotheses identification (an ideal form of the minimum description length (MDL) principle). Whereas the single best hypothesis does not necessarily give the best prediction, we demonstrate that nonetheless compression is almost always the best strategy in prediction methods in the style of R. Solomonoff.
Cite
CITATION STYLE
VitÁnyi, P., & Li, M. (1997). On prediction by data compression. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 1224, pp. 14–30). Springer Verlag. https://doi.org/10.1007/3-540-62858-4_69
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.