Page Quality: In Search of an Unbiased Web Ranking

  • Cho J
  • Roy S
  • Adams R
  • 48

    Readers

    Mendeley users who have this article in their library.
  • 68

    Citations

    Citations of this article.

Abstract

"Our snapshots were quite complete mirrors of the 154 web sites. We downloaded pages from each site until we could not reach any more pages from the site or we downloaded the maximum of 200,000 pages. Out of 154 web sites, only four web sites had more than 200,000 pages. The number of pages that we downloaded in each snapshot ranged between 4.6 million pages and 5 million pages. Since we were interested in comparing our estimated page quality with the future PageRank, we first identified the set of pages downloaded in all snapshots. Out of 5 million pages, 2.7 million pages were common in all four snapshots. We then computed the PageRank values from the subgraph of the web obtained from these 2.7 million pages for each snapshot."

Get free article suggestions today

Mendeley saves you time finding and organizing research

Sign up here
Already have an account ?Sign in

Find this document

Authors

  • Junghoo Cho

  • Sourashis Roy

  • Robert E. Adams

Cite this document

Choose a citation style from the tabs below

Save time finding and organizing research with Mendeley

Sign up for free