Mining of Massive Datasets

by Anand Rajaraman, Jeffrey D Ullman
Lecture Notes for Stanford CS345A Web Mining ()


At the highest level of description, this book is about data mining. However, it focuses on data mining of very large amounts of data, that is, data so large it does not fit in main memory. Because of the emphasis on size, many of our examples are about the Web or data derived from the Web. Further, the book takes an algorithmic point of view: data mining is about applying algorithms to data, rather than using data to train a machine-learning engine of some sort.

Cite this document (BETA)

Readership Statistics

1929 Readers on Mendeley
by Discipline
by Academic Status
28% Ph.D. Student
17% Student (Master)
9% Researcher (at a non-Academic Institution)
by Country
4% United States
2% United Kingdom
2% Germany

Sign up today - FREE

Mendeley saves you time finding and organizing research. Learn more

  • All your research in one place
  • Add and import papers easily
  • Access it anywhere, anytime

Start using Mendeley in seconds!

Sign up & Download

Already have an account? Sign in