Scalability, search, and sampling: From smart algorithms to active discovery

Stefan Wrobel

Conference ProceedingsOPEN ACCESS

Scalability, search, and sampling: From smart algorithms to active discovery

Wrobel S

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2001) 2167 615

DOI: 10.1007/3-540-44795-4_55

0Citations

6Readers

Abstract

The focus on scalability to very large datasets has been a distinguishing feature of the KDD endeavour right from the start of the area. In the present stage of its development, the field has begun to seriously approach the issue, and a number of different techniques for scaling up KDD algorithms have emerged. Traditionally, such techniques are concentrating on the search aspects of the problem, employing algorithmic techniques to avoid searching parts of the space or to speed up processing by exploiting properties of the underlying host systems. Such techniques guarantee perfect correctness of solutions, but can never reach sublinear complexity. In contrast, researchers have recently begun to take a fresh and principled look at stochastic sampling techniques which give only an approximate quality guarantee, but can make runtimes almost independent of the size of the database at hand. In the talk, we give an overview of both of these classes of approaches, focusing on individual examples from our own work for more detailed illustrations of how such techniques work. We briefly outline how active learning elements may enhance KDD approaches in the future.

Cite

CITATION STYLE

APA

Wrobel, S. (2001). Scalability, search, and sampling: From smart algorithms to active discovery. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2167, p. 615). Springer Verlag. https://doi.org/10.1007/3-540-44795-4_55

Scalability, search, and sampling: From smart algorithms to active discovery

Abstract

Cite

Register to see more suggestions