SIGIR'98 papers: Improved Algorithms for Topic Distillation in a Hyperlinked Environment
Improved Algorithms for Topic Distillation in a Hyperlinked Environment
Krishna Bharat
Digital Equipment Corporation,
Systems Research Center,
130 Lytton Avenue,
Palo Alto, CA 94301, USA.
Monika R. Henzinger
Digital Equipment Corporation,
Systems Research Center,
130 Lytton Avenue,
Palo Alto, CA 94301, USA.
Abstract
This paper addresses the problem of topic distillation
on the World Wide Web, namely,
given a typical user query to find quality documents related to the
query topic.
Connectivity analysis has been shown to be useful in
identifying high quality pages within a topic specific
graph of hyperlinked documents.
The essence of our approach is to augment a previous
connectivity analysis based algorithm
with content analysis. We identify three problems with the
existing approach and devise algorithms to tackle them.
The results of
a user evaluation are reported that show an improvement
of precision at 10 documents by at least 45% over pure connectivity analysis.
SIGIR'98
24-28 August 1998
Melbourne, Australia.
sigir98@cs.mu.oz.au.