A Similarity Measure for Indefinite Rankings
William Webber
Department of Computer Science and Software Engineering,
The University of Melbourne,
Victoria 3010, Australia.
Alistair Moffat
Department of Computer Science and Software Engineering,
The University of Melbourne,
Victoria 3010, Australia.
Justin Zobel
Department of Computer Science and Software Engineering,
The University of Melbourne,
Victoria 3010, Australia.
Status
ACM Trans. Information Systems, 28(4):20.1-20.38, November 2010.
Abstract
Ranked lists are encountered in research and daily life, and it is
often of interest to compare these lists, even when they are
incomplete or have only some members in common.
An example is document rankings returned for the same query by
different search engines.
A metric designed to measure the similarity between incomplete
rankings should handle non-conjointness, weight high ranks more
heavily than low, and be monotonic with increasing depth of
evaluation.
Surprisingly, no such metrics exist.
In this article, we propose a new measure having these qualities,
rank-biased overlap (RBO), based on a simple probabilistic user
model.
The RBO measure provides monotonicity by calculating, at a given
depth of evaluation, a base score that is non-decreasing with
additional evaluation, and a maximum score that is non-increasing.
An extrapolated score can be calculated between these bounds if a
point estimate is required.
RBO has a parameter which determines the strength of the weighting to
top ranks.
We extend RBO to handle tied ranks and rankings of different lengths.
Finally, we give examples of the use of the measure in comparing the
results produced by public search engines, and in assessing retrieval
systems in the laboratory.
Full text
http://doi.acm.org/10.1145/1852102.1852106
.