Statistical analysis of k-nearest neighbor collaborative recommendation

Abstract : Collaborative recommendation is an information-filtering technique that attempts to present information items (movies, music, books, news, images, Web pages, etc.) that are likely of interest to the Internet user. Traditionally, collaborative systems deal with situations with two types of variables, users and items. In its most common form, the problem is framed as trying to estimate ratings for items that have not yet been consumed by a user. Despite wide-ranging literature, little is known about the statistical properties of recommendation systems. In fact, no clear probabilistic model even exists allowing us to precisely describe the mathematical forces driving collaborative filtering. To provide an initial contribution to this, we propose to set out a general sequential stochastic model for collaborative recommendation and analyze its asymptotic performance as the number of users grows. We offer an in-depth analysis of the so-called cosine-type nearest neighbor collaborative method, which is one of the most widely used algorithms in collaborative filtering. We establish consistency of the procedure under mild assumptions on the model. Rates of convergence and examples are also provided.
Type de document :
Article dans une revue
The Annals of Statistics, IMS, 2010, 38 (3), pp.1568-1592. 〈10.1214/09-AOS759〉
Liste complète des métadonnées

Littérature citée [16 références]  Voir  Masquer  Télécharger
Contributeur : Laurent Rouvière <>
Soumis le : mardi 13 octobre 2009 - 17:47:05
Dernière modification le : mercredi 20 juin 2018 - 01:18:47
Document(s) archivé(s) le : mercredi 22 septembre 2010 - 13:23:49


Fichiers produits par l'(les) auteur(s)



Gérard Biau, Benoît Cadre, Laurent Rouviere. Statistical analysis of k-nearest neighbor collaborative recommendation. The Annals of Statistics, IMS, 2010, 38 (3), pp.1568-1592. 〈10.1214/09-AOS759〉. 〈hal-00367480v2〉



Consultations de la notice


Téléchargements de fichiers