Balanced allocations
Y. Azar, Andrei Z. Broder, et al.
STOC 1994
We present a framework for approximating random-walk based probability distributions over Web pages using graph aggregation. The basic idea is to partition the graph into classes of quasi-equivalent vertices, to project the page-based random walk to be approximated onto those classes, and to compute the stationary probability distribution of the resulting class-based random walk. From this distribution we can quickly reconstruct a distribution on pages. In particular, our framework can approximate the well-known PageRank distribution by setting the classes according to the set of pages on each Web host. We experimented on a Web-graph containing over 1.4 billion pages and over 6.6 billion links from a crawl of the Web conducted by AltaVista in September 2003. We were able to produce a ranking that has Spearman rank-order correlation of 0.95 with respect to PageRank. The clock time required by a simplistic implementation of our method was less than half the time required by a highly optimized implementation of PageRank, implying that larger speedup factors are probably possible. © Springer Science + Business Media, Inc. 2006.
Y. Azar, Andrei Z. Broder, et al.
STOC 1994
Andrei Z. Broder, D. Dolev
FOCS 1983
Andrei Z. Broder, A.C. Ciccolo
IBM Systems Journal
R. Lempel, S. Moran
Information Retrieval