research-article

Agreement and Disagreement between True and False-Positive Metrics in Recommender Systems Evaluation

Authors:

Elisa Mena-Maldonado,

Rocío Cañamares,

Pablo Castells,

Yongli Ren,

Mark SandersonAuthors Info & Claims

SIGIR '20: Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval

Pages 841 - 850

https://doi.org/10.1145/3397271.3401096

Published: 25 July 2020 Publication History

Get Access

Abstract

False-positive metrics can capture an important side of recommendation quality, focusing on the impact of suggestions that are disliked by users, as a complement of common metrics that only measure the amount of successful recommendations. In this paper we research the extent to which false-positive metrics agree or disagree with true-positive metrics in the offline evaluation of recommender systems. We discover a surprising degree of systematic disagreement that was occasionally noted but not explained in the literature by previous authors. We find an explanation for the discrepancy be-tween the metrics in the effect of popularity biases, which impact false and true-positive metrics in very different ways: instead of rewarding the recommendation of popular items, as with true-positive, false-positive metrics penalize the popular. We determine precise conditions and cases in the general trends, with a formal explanation for our findings, which we confirm and illustrate empirically in experiments with different datasets.

Supplementary Material

MP4 File (3397271.3401096.mp4)

We present some effects of popularity biases in recommender systems evaluation for true and false-positive metrics. First, we discuss about a different perspective for evaluation; from the point of view of the count of false-positives. Second, we describe the computation of precision and anti-precision under incomplete (MNAR) and complete relevance knowledge (MAR). Third, we illustrate the effects of popularity biases using Movielens and CM100K datasets. Finally, we summarize the presentation, present some conclusions and describe the other findings we explained further in our paper.

Download
50.95 MB

References

[1]

G. Adomavicius and A. Tuzhilin. Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions. IEEE Trans. Knowl. Data Eng., 17, 6 (Jun. 2005). IEEE, Piscataway, NJ, USA, 734--749.

Abstract

Supplementary Material

References

Cited By

Index Terms

Recommendations

Popularity Bias in False-positive Metrics for Recommender Systems Evaluation

Metrics for Evaluating Explainable Recommender Systems

Do Metrics Make Recommender Algorithms?

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Get Access

Login options

Full Access

View options

PDF

eReader

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations