Representation Learning for Recommender Systems with Application to the Scientific Literature

Abstract : The scientific literature is a large information network linking various actors (laboratories, companies, institutions, etc.). The vast amount of data generated by this network constitutes a dynamic heterogeneous attributed network (HAN), in which new information is constantly produced and from which it is increasingly difficult to extract content of interest. In this article, I present my first thesis works in partnership with an industrial company, Digital Scientific Research Technology. This later offers a scientific watch tool, Peerus, addressing various issues, such as the real time recommendation of newly published papers or the search for active experts to start new collaborations. To tackle this diversity of applications, a common approach consists in learning representations of the nodes and attributes of this HAN and use them as features for a variety of recommendation tasks. However, most works on attributed network embedding pay too little attention to textual attributes and do not fully take advantage of recent natural language processing techniques. Moreover, proposed methods that jointly learn node and document representations do not provide a way to effectively infer representations for new documents for which network information is missing, which happens to be crucial in real time recommender systems. Finally, the interplay between textual and graph data in text-attributed heterogeneous networks remains an open research direction.
Document type :
Conference papers
Complete list of metadatas
Contributor : Robin Brochier <>
Submitted on : Tuesday, March 5, 2019 - 10:07:11 AM
Last modification on : Wednesday, April 3, 2019 - 1:12:41 AM

Links full text




Robin Brochier. Representation Learning for Recommender Systems with Application to the Scientific Literature. PhD Symposium, May 2019, San Francisco, United States. ⟨10.1145/3308560.3314195⟩. ⟨hal-02057104⟩



Record views