Citation
Serradilla García, Francisco and Bobadilla Sancho, Jesus and Sánchez, J.L. and Martínez Murciano, Eduardo
(2008).
Choice of Metrics used in Collaborative Filtering and their Impact on Recommender Systems.
In: "2nd IEEE International Conference on Digital Ecosystems and Technologies", 26/02/2008-29/02/2008, Phitsanulok, Thailand. ISBN 978-1-4244-1489-5. pp. 432-436.
https://doi.org/10.1109/DEST.2008.4635147.
Abstract
The capacity of recommender systems to make correct predictions is essentially determined by the quality and suitability of the collaborative filtering that implements them. The common memory-based metrics are Pearson correlation and cosine, however, their use is not always the most appropriate or sufficiently justified. In this paper, we analyze these two metrics together with the less common mean squared difference (MSD) to discover their advantages and drawbacks in very important aspects such as the impact when introducing different values of k-neighborhoods, minimization of the MAE error, capacity to carry out a sufficient number of predictions, percentage of correct and incorrect predictions and behavior when attempting to recommend the n-best items. The paper lists the results and practical conclusions that have been obtained after carrying out a comparative study of the metrics based on 135 experiments on the MovieLens database of 100,000 ratios.