Enhancing User Rating Database Consistency through Pruning

Authors: Dionisis Margaris, Costas Vassilakis

Volume 34 (2017)


Recommender systems are based on information about users’ past behavior to formulate recommendations about their future actions. However, as time goes by the interests and likings of people may change: people listen to different singers or even different types of music, watch different types of movies, read different types of books and so on. Due to this type of changes, an amount of inconsistency is introduced in the database since a portion of it does not reflect the current preferences of the user, which is its intended purpose. In this paper, we present a pruning technique that removes old aged user be-havior data from the ratings database, which are bound to correspond to invali-dated preferences of the user. Through pruning (1) inconsistencies are removed and data quality is upgraded, (2) better rating prediction generation times are achieved and (3) the ratings database size is reduced. We also propose an algo-rithm for determining the amount of pruning that should be performed, allowing the tuning and operation of the pruning algorithm in an unsupervised fashion. The proposed technique is evaluated and compared against seven aging algo-rithms, which reduce the importance of aged ratings, and a state-of-the-art prun-ing algorithm, using datasets with varying characteristics. It is also validated us-ing two distinct rating prediction computation strategies, namely collaborative filtering and matrix factorization. The proposed technique needs no extra infor-mation concerning the items’ characteristics (e.g. categories that they belong to or attributes’ values), can be used in all rating databases that include a timestamp and has been proved to be effective in any size of users-items database and under two rating prediction computation strategies.