Efficient sampling from random web graph and its application
Date
2009-01-08T16:20:01Z
Authors
Zhuang, Yan
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
This thesis presents space-efficient algorithms to sample from random web graphs
generated by two important stochastic graph models based on concept of copying:
the linear copy model and the hostgraph model. The goal is to avoid constructing
the entire random graph, and instead use an amount of space nearer to the desired
(smaller) sample size. The efficiency of our algorithms is achieved by refraining from
making unnecessary random decisions when constructing the sample. The construc-
tion of a sample subgraph from a random graph with n nodes and k outgoing links
on each node based on the linear copying model uses an expected O(klnn) words for
each node in the sample subgraph. The construction of a sample subgraph from a
random graph with n nodes based on the hostgraph model uses, for any small sample
size, an expected n+o(n) words.
Description
Keywords
web graph, sampling, graph model, algorithm